<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mar. Sci.</journal-id>
<journal-title>Frontiers in Marine Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mar. Sci.</abbrev-journal-title>
<issn pub-type="epub">2296-7745</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmars.2022.1067482</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Marine Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Genetic features of the marine polychaete <italic>Sirsoe methanicola</italic> from metagenomic data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Lim</surname>
<given-names>Shen Jean</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<xref ref-type="author-notes" rid="fn003">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1623009"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Thompson</surname>
<given-names>Luke R.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/205488"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Goodwin</surname>
<given-names>Kelly D.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/396512"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Cooperative Institute for Marine and Atmospheric Studies, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami</institution>, <addr-line>Miami, FL</addr-line>, <country>United States</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration</institution>, <addr-line>Miami, FL</addr-line>, <country>United States</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Northern Gulf Institute, Mississippi State University</institution>, <addr-line>Starkville, MS</addr-line>, <country>United States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Rachel Przeslawski, New South Wales Department of Primary Industries, Australia</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Wang Minxiao, Institute of Oceanology (CAS), China; Fabiano Thompson, Federal University of Rio de Janeiro, Brazil</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Kelly D. Goodwin, <email xlink:href="mailto:kelly.goodwin@noaa.gov">kelly.goodwin@noaa.gov</email>; Shen Jean Lim, <email xlink:href="mailto:jslim@g.clemson.edu">jslim@g.clemson.edu</email>
</p>
</fn>
<fn fn-type="present-address" id="fn003">
<p>&#x2020;Present address: Shen Jean Lim, College of Marine Science, University of South Florida, St. Petersburg, FL, United States</p>
</fn>
<fn fn-type="other" id="fn002">
<p>This article was submitted to Deep-Sea Environments and Ecology, a section of the journal Frontiers in Marine Science</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>01</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>9</volume>
<elocation-id>1067482</elocation-id>
<history>
<date date-type="received">
<day>11</day>
<month>10</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>19</day>
<month>12</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Lim, Thompson and Goodwin</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Lim, Thompson and Goodwin</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The methane ice worm <italic>Sirsoe methanicola</italic> is the only marine polychaete species observed to colonize the methane hydrates of the Gulf of Mexico. Methane hydrates are ephemeral features of deep-sea cold seeps, and finding worm-colonized hydrates is rare; thus, little is known about these organisms. Recent metagenomic analysis predicted prokaryotic taxa and pathways from <italic>S. methanicola</italic> gut contents and worm fragments. Here, we increase the genetic information known about <italic>S. methanicola</italic> by assembling its nuclear rRNA genes (18S rRNA and 28S rRNA), mitochondrial genome (mitogenome), and other protein-coding genes from metagenomic data. Assembled 18S rRNA and 28S rRNA gene sequences of <italic>S. methanicola</italic> were near-identical to previously reported <italic>S. methanicola</italic> sequences. The 17,403-bp mitogenome of <italic>S. methanicola</italic> is the first mitogenome sequence of the family <italic>Hesionidae</italic>, consisting of 39.03% G+C content, 13 protein-coding genes, 24 tRNAs (including two split <italic>trnM</italic> genes), and 2 rRNA genes. Protein-coding genes in the <italic>S. methanicola</italic> metagenomes assigned to the phylum Annelida were involved in cell adhesion, signaling, ubiquitin system, metabolism, transport, and other processes. From the metagenomes, we also found 42 homologs of the cytochrome P450 (CYP) superfamily putatively involved in polycyclic aromatic hydrocarbon (PAH) metabolism. Our results encourage further studies into the genetic adaptations of <italic>S. methanicola</italic> to its methane hydrate habitat, especially in the context of deep-sea ecology and nutrient cycling.</p>
</abstract>
<kwd-group>
<kwd>deep-sea</kwd>
<kwd>Gulf of Mexico</kwd>
<kwd>methane hydrate</kwd>
<kwd>polychaete</kwd>
<kwd>worm</kwd>
<kwd>mitogenome</kwd>
</kwd-group>
<contract-sponsor id="cn001">National Science Foundation<named-content content-type="fundref-id">10.13039/100000001</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">National Science Foundation<named-content content-type="fundref-id">10.13039/100000001</named-content>
</contract-sponsor>
<contract-sponsor id="cn003">National Institutes of Health<named-content content-type="fundref-id">10.13039/100000002</named-content>
</contract-sponsor>
<counts>
<fig-count count="7"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="68"/>
<page-count count="17"/>
<word-count count="7358"/>
</counts>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<label>1</label>
<title>Introduction</title>    <p>Methane is a potent greenhouse gas, and methane hydrates represent one of the largest carbon reservoirs in the world; therefore, the dynamics of these deposits garner interest with regard to global carbon cycles, climate change, and as sources of alternative energy (<xref ref-type="bibr" rid="B48">National Energy Technology Laboratory, 2017</xref>). In marine cold seep locations, methane rises from the seafloor and may freeze into a crystalline clathrate structure under sufficiently low temperature and high pressure. These deposits are commonly known as methane hydrates (also called methane clathrates, gas hydrates, methane ice, hydromethane, or fire ice) (<xref ref-type="bibr" rid="B36">Kvenvolden, 1995</xref>; <xref ref-type="bibr" rid="B65">Tunnicliffe et&#xa0;al., 2003</xref>). A variety of species have been identified at cold seeps, such as chemosymbiotic bivalves, polychaetes, shrimps, amphipods, cnidarians, and sponges, but most invertebrates do not interact physically with gas hydrates (<xref ref-type="bibr" rid="B15">Desbruy&#xe8;res and Toulmond, 1998</xref>; <xref ref-type="bibr" rid="B62">Sibuet and Olu, 1998</xref>; <xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>; <xref ref-type="bibr" rid="B65">Tunnicliffe et&#xa0;al., 2003</xref>; <xref ref-type="bibr" rid="B66">Van Dover et&#xa0;al., 2003</xref>; <xref ref-type="bibr" rid="B39">Levin, 2005</xref>; <xref ref-type="bibr" rid="B16">Dubilier et&#xa0;al., 2008</xref>). A notable exception is the marine polychaete, <italic>Sirsoe methanicola</italic> (previously <italic>Hesiocaeca methanicola</italic>). First discovered in 1997 in the Green Canyon area of the Gulf of Mexico, USA, <italic>S. methanicola</italic> creates burrows in methane hydrates and inhabits them mostly with single occupancy (<xref ref-type="bibr" rid="B15">Desbruy&#xe8;res and Toulmond, 1998</xref>; <xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>). Members of the <italic>Sirsoe</italic> genus have also been found in deep-sea whale falls, vents, and seeps, but <italic>S. methanicola</italic> is the only known <italic>Sirsoe</italic> species to inhabit methane hydrates (<xref ref-type="bibr" rid="B61">Shimabukuro et&#xa0;al., 2019</xref>) and is the only macrofauna known to inhabit deposits in the Gulf of Mexico. Besides <italic>S. methanicola</italic>, a novel alvinocarid shrimp morphospecies has been found atop exposed methane hydrates in the Blake Ridge Diapir of the South Atlantic Bight (<xref ref-type="bibr" rid="B66">Van Dover et&#xa0;al., 2003</xref>).</p>
<p>
<italic>Sirsoe methanicola</italic> could play important ecological roles in the methane hydrate habitat through bioturbation, methane release, and subsequent methane hydrate dissociation. <italic>Sirsoe methanicola</italic> is thought to introduce oxygen to a methane hydrate by generating water currents on the surface with its parapodia (<xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>). The resulting oxygen can support microaerophilic microbial growth and facilitate depression formation on the methane hydrate surface, leading to its subsequent dissociation (<xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>).</p>
<p>
<italic>Sirsoe methanicola</italic> possesses a functional digestive system with a gut and appears to be a bacterivore feeding on a variety of bacteria from the surface of gas hydrates, although details of its life history or genetic capacities are not fully understood (<xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>; <xref ref-type="bibr" rid="B5">Becker et&#xa0;al., 2013</xref>). Methane hydrates provide a variety of potential substrates to support microbial life, including thermogenic methane, hydrogen sulfide, hydrocarbon gases (such as ethane, propane, isobutane, butane, and pentane), and carbon dioxide (<xref ref-type="bibr" rid="B36">Kvenvolden, 1995</xref>; <xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>; <xref ref-type="bibr" rid="B38">Lanoil et&#xa0;al., 2001</xref>; <xref ref-type="bibr" rid="B29">Joye et&#xa0;al., 2004</xref>; <xref ref-type="bibr" rid="B47">Mills et&#xa0;al., 2005</xref>). Despite the &#x201c;methane&#x201d; moniker of the ice worm habitat, our recent metagenomic analysis of <italic>S. methanicola</italic> gut contents and worm fragments revealed a paucity of reads assigned to aerobic or anaerobic methanotrophic taxa (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>). Metagenomes associated with <italic>S. methanicola</italic> were instead dominated by <italic>Sulfurospirillum</italic> and included other prokaryotic taxa capable of nitrogen, sulfur, and carbon cycling (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>).</p>
<p>From the <italic>S. methanicola</italic> metagenomes, we identified microbial genes involved in the degradation of hydrocarbon compounds, such as alkanes, benzoate, toluene, xylene, and phenol (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>). Results were consistent with reports of low and high molecular weight hydrocarbons found in the methane hydrate where <italic>S. methanicola</italic> was initially collected (<xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>) and the emanation of a petroleum smell from the guts of <italic>S. methanicola</italic> individuals during dissection (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>). The high number of metagenomic reads sequenced during the microbiome study (&gt;2M paired-end reads per library) provided an opportunity to mine for host-related genes (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>). Here we recovered and analyzed mitogenome, gene, and cytochrome P450 (CYP) enzyme superfamily annotations to explore the genetic features of this deep sea polychaete in relation to its unique deep-sea ecology.</p>
<p>Although little is known about <italic>S. methanicola</italic>, the shallow marine polychaete <italic>Capitella teleta</italic> has been well studied. <italic>Capitella teleta</italic> feeds on shallow marine sediments rich in organic matter such as fuel oil and other pollutants (<xref ref-type="bibr" rid="B8">Blake et&#xa0;al., 2009</xref>). Degradation of the polycyclic aromatic hydrocarbon (PAH) fluoranthene by <italic>C. teleta</italic> has been demonstrated, likely without aid from its gut microbiome (<xref ref-type="bibr" rid="B19">Forbes et&#xa0;al., 2001</xref>; <xref ref-type="bibr" rid="B59">Selck et&#xa0;al., 2003</xref>; <xref ref-type="bibr" rid="B25">Jang et&#xa0;al., 2020</xref>). Involvement of CYP was postulated because CYP-dependent activity and CYP expression in <italic>C. teleta</italic> increased with PAH exposure (<xref ref-type="bibr" rid="B40">Li et&#xa0;al., 2004</xref>; <xref ref-type="bibr" rid="B14">Dejong and Wilson, 2014</xref>). Sequencing has been described for the <italic>C</italic>. <italic>teleta</italic> genome (<xref ref-type="bibr" rid="B63">Simakov et&#xa0;al., 2013</xref>) with an annotated CYPome (<xref ref-type="bibr" rid="B14">Dejong and Wilson, 2014</xref>) and for the gut microbiome (<xref ref-type="bibr" rid="B22">Hochstein et&#xa0;al., 2019</xref>; <xref ref-type="bibr" rid="B25">Jang et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B26">Jang et&#xa0;al., 2021</xref>). Given the hydrocarbon rich habitat of <italic>S. methanicola</italic> (<xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>), we hypothesized that the <italic>S. methanicola</italic> genetic repertoire may include analogous hydrocarbon degradation capability.</p>
</sec>
<sec id="s2" sec-type="materials|methods">
<label>2</label>
<title>Materials and methods</title>
<p>Procedures for sample collection, processing, sequencing, and metagenomic analyses were previously documented (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>), including diagrams that illustrated sample collection design. Details of bioinformatic methods were provided in the supplemental information. Methods are re-summarized here for the readers&#x2019; convenience, with additional details provided for analysis of eukaryotic sequences.</p>
<sec id="s2_1">
<label>2.1</label>
<title>Sample collection and processing</title>
<p>Live <italic>S. methanicola</italic> specimens were collected as part of the R/V <italic>Seward Johnson</italic> cruise SJ-2009-GOM, operated by Harbor Branch Oceanographic Institute, Fort Pierce, FL, USA. Using the manipulator arm of the crewed Johnson Sea-Link II, specimens were retrieved from a Gulf of Mexico methane hydrate in the Green Canyon area GC234 (27&#xb0;44.7526&#x2019; N, 91&#xb0;13.3168&#x2019; W) at a depth of 542.8 meters on October 3, 2009 at 10:31 am UTC during Dive #3751. All specimens were rinsed with 0.2-&#x3bc;m filtered seawater prior to aseptic dissection to expose the worm gut, and five out of the seven dissected specimens were spawned on the ship prior to dissection.</p>
<p>Gut contents were extracted from the seven specimens with a sterile syringe without removal of the gut itself. Samples were placed into microcentrifuge tubes, centrifuged at 13,200 rpm for 15 minutes on the ship, and the supernatant was removed. The remaining pellet was preserved in 95% ethanol and frozen at &#x2013;80&#xb0;C on the ship and upon returning to the laboratory. The sample used for Illumina HiSeq sequencing (Tube A) contained gut contents pooled from two unspawned worms. The sample used for Illumina MiSeq sequencing (Tube D) contained the gut contents of one spawned worm. Worm fragments contained various tissues (including heads and bristles, and guts) left over from the dissection and gut content extraction. These were pooled in a 4-ml sterile polycarbonate tube, covered with 95% ethanol, and stored at &#x2013;20&#xb0;C on the ship and at &#x2013;80&#xb0;C upon returning to the laboratory. The sample used for HiSeq sequencing (Tube Loc-2) contained the worm fragments pooled from all seven dissected worms.</p>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Library preparation and sequencing</title>
<p>DNA was extracted from all processed samples using the Qiagen DNeasy Tissue &amp; Blood Kit (Valencia, CA, USA) on February 6, 2012 and quantified using the Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). For Illumina HiSeq sequencing, 1 &#x3bc;g of DNA each from Tube A and Tube Loc-2 was sheared into 200&#x2013;300-bp fragments using the Covaris S2 instrument (Woburn, MA, USA) to produce library G and library W, respectively. The fragments were end-repaired and used for library preparation using the TruSeq DNA Sample Preparation Kit (Illumina) with 12 cycles of amplification. From each library, 200&#x2013;250-bp fragments were selected using gel size selection for paired-end sequencing at Scripps Research (formerly The Scripps Research Institute; La Jolla, CA, USA) on a single lane of the Illumina HiSeq 2000 (2 x 100 bp) platform. For Illumina MiSeq sequencing, the Nextera XT library preparation kit (Illumina) was used to prepare metagenomic library G-Mi from Tube D. This library was quantitated using the Qubit&#x2122; dsDNA assay (Life Technologies, Austin, TX, USA) and sequenced on the Illumina MiSeq (2 x 300 bp) platform at San Diego State University.</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Metagenomic analysis</title>
<p>Reads from the three metagenomic libraries (W, G, and G-Mi) were assembled by various software to obtain full-length small subunit (SSU) rRNA sequences, whole metagenomes, and the worm mitogenome, as detailed below.</p>
<sec id="s2_3_1">
<label>2.3.1</label>
<title>Full-length SSU rRNA sequence assembly</title>
<p>Reads from each library were assembled separately into full-length eukaryotic and prokaryotic SSU rRNA sequences using the default parameters of phyloFlash v3.4 (<xref ref-type="bibr" rid="B21">Gruber-Vodicka et&#xa0;al., 2020</xref>). Assembled sequences classified by phyloFlash as <italic>S</italic>. <italic>methanicola</italic> were searched against NCBI GenBank (<xref ref-type="bibr" rid="B6">Benson et&#xa0;al., 2018</xref>) <italic>via</italic> the NCBI blastn web interface (<xref ref-type="bibr" rid="B27">Johnson et&#xa0;al., 2008</xref>) to identify matching genes and sequences (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>).</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>Taxonomic assignments to the polychaete <italic>S. methanicola</italic> based on nuclear 18S rRNA and 28S rRNA gene annotations recovered from the three metagenomic libraries.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" rowspan="2" align="left">GenBank Accession (this study)</th>
<th valign="middle" rowspan="2" align="center">Sequence length (bp)</th>
<th valign="middle" rowspan="2" align="center">Gene</th>
<th valign="middle" rowspan="2" align="center">Assembly method</th>
<th valign="middle" rowspan="2" align="center">Library Name</th>
<th valign="middle" rowspan="2" align="center"># reads mapped to sequence</th>
<th valign="top" colspan="4" align="center">GenBank best hit<break/>[accessed 8/4/22]</th>
</tr>
<tr>
<th valign="middle" align="center">Accession</th>
<th valign="top" align="center">% identity</th>
<th valign="top" align="center">Alignment length (bp)</th>
<th valign="top" align="center">Reference</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">MZ224434</td>
<td valign="top" align="center">1,817</td>
<td valign="middle" rowspan="3" align="center">18S rRNA</td>
<td valign="middle" rowspan="3" align="left">phyloFlash</td>
<td valign="top" align="left">G-Mi</td>
<td valign="top" align="center">1,232</td>
<td valign="middle" rowspan="3" align="left">JN631332</td>
<td valign="middle" rowspan="3" align="center">100</td>
<td valign="middle" rowspan="3" align="center">1,778</td>
<td valign="middle" rowspan="3" align="left">
<xref ref-type="bibr" rid="B55">Pleijel et&#xa0;al., 2012</xref>
</td>
</tr>
<tr>
<td valign="top" align="left">MZ224435</td>
<td valign="top" align="center">1,781</td>
<td valign="top" align="left">W</td>
<td valign="top" align="center">246,713</td>
</tr>
<tr>
<td valign="top" align="left">MZ224436</td>
<td valign="top" align="center">1,817</td>
<td valign="top" align="left">G</td>
<td valign="top" align="center">89,883</td>
</tr>
<tr>
<td valign="top" align="left">OP169017</td>
<td valign="top" align="center">782</td>
<td valign="middle" rowspan="3" align="center">28S rRNA</td>
<td valign="middle" rowspan="3" align="left">MEGAHIT</td>
<td valign="top" align="left">G-Mi</td>
<td valign="top" align="center">not mapped</td>
<td valign="middle" rowspan="3" align="left">DQ442611</td>
<td valign="middle" rowspan="3" align="center">99</td>
<td valign="middle" rowspan="3" align="center">770</td>
<td valign="top" rowspan="3" align="left">
<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>
</td>
</tr>
<tr>
<td valign="top" align="left">OL704460</td>
<td valign="top" align="center">782</td>
<td valign="top" align="left">G</td>
<td valign="top" align="center">not mapped</td>
</tr>
<tr>
<td valign="top" align="left">OL704461</td>
<td valign="top" align="center">782</td>
<td valign="top" align="left">W</td>
<td valign="top" align="center">not mapped</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2_3_2">
<label>2.3.2</label>
<title>Whole metagenome assembly</title>
<p>Reads were trimmed at Q-score threshold of 30 using Trim Galore! V0.6.5 (<uri xlink:href="https://github.com/FelixKrueger/TrimGalore">https://github.com/FelixKrueger/TrimGalore</uri>), a wrapper tool around cutadapt (<xref ref-type="bibr" rid="B45">Martin, 2011</xref>), and FastQC (<uri xlink:href="https://www.bioinformatics.babraham.ac.uk/projects/fastqc/">https://www.bioinformatics.babraham.ac.uk/projects/fastqc/</uri>). Read qualities pre- and post-trimming were assessed with FastQC v0.11.9. Each metagenomic library was individually assembled on the KBase server (<xref ref-type="bibr" rid="B4">Arkin et&#xa0;al., 2018</xref>) using MEGAHIT v1.2.9 (<xref ref-type="bibr" rid="B43">Li et&#xa0;al., 2016</xref>) with the &#x201c;meta-large&#x201d; preset option for large and complex assembly (kmin=21, kmax=99, and kstep=20). Reads from the three metagenomic libraries were not co-assembled because of computational resource limitations. The MiSeq-sequenced library G-Mi from the gut contents in Tube D was additionally assembled using the &#x201c;mega-sensitive&#x201d; option of MEGAHIT (kmin=21, kmax=255, and kstep=20), the default parameters of IDBA-UD v1.1.3 (<xref ref-type="bibr" rid="B52">Peng et&#xa0;al., 2012</xref>), and metaSPAdes v3.13.0 (k=21,33,55,77,99,127) (<xref ref-type="bibr" rid="B51">Nurk et&#xa0;al., 2017</xref>). For consistency with other HiSeq-sequenced metagenomes, the metagenome assembled from G-Mi with the &#x201c;meta-large&#x201d; preset option was used for 18S and 28S rRNA gene sequence retrieval and functional annotations.</p>
<sec id="s2_3_2_1">
<label>2.3.2.1</label>
<title>28S rRNA gene sequence retrieval</title>
<p>The 28S rRNA gene sequence from the ribosomal large subunit (LSU) of <italic>S. methanicola</italic> was retrieved by running blastn searches implemented in BLAST 2.10.1+ (<xref ref-type="bibr" rid="B9">Camacho et&#xa0;al., 2009</xref>). Published 28S rRNA gene sequence (NCBI accession: DQ442611) for <italic>S. methanicola</italic> (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>) were queried against the MEGAHIT-assembled metagenomes (<xref ref-type="bibr" rid="B43">Li et&#xa0;al., 2016</xref>) from the gut contents and worm fragments (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>).</p>
</sec>
<sec id="s2_3_2_2">
<label>2.3.2.2</label>
<title>Worm mitogenome assembly</title>
<p>Draft mitogenomes of <italic>S. methanicola</italic> were recovered by querying previously reported 16S rRNA gene sequence (NCBI accession: DQ442582) and cytochrome <italic>c</italic> oxidase subunit I (<italic>cox1</italic>) sequence (NCBI accession: DQ513295) for <italic>S. methanicola</italic> (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>) against the MEGAHIT-assembled metagenomes from the gut contents and worm fragments using the blastn function in the BLAST+ application (<xref ref-type="bibr" rid="B9">Camacho et&#xa0;al., 2009</xref>). Matching 18,000-bp and 16,108-bp contigs from the gut content and worm fragment metagenomes, respectively, were searched against NCBI GenBank using the web blastn interface (<xref ref-type="bibr" rid="B27">Johnson et&#xa0;al., 2008</xref>). These contigs were deduced to be <italic>S. methanicola</italic> mitogenomic sequences, based on matches to mitochondrial sequences from the genus <italic>Sirsoe</italic> and matches to mitogenomes of Polychaeta species.</p>
<p>Sequences from both <italic>S. methanicola</italic> draft mitogenomes were compared against each other using the web blastn interface and annotated with the MITOS web server (<xref ref-type="bibr" rid="B7">Bernt et&#xa0;al., 2013</xref>) and MitoZ v2.3 (<xref ref-type="bibr" rid="B46">Meng et&#xa0;al., 2019</xref>) based on the invertebrate mitochondrial code. The draft mitogenome assembled from the gut contents contained no missing genes, while the draft mitogenome from the worm fragments was missing 16 genes. The mitogenome assembled from the gut contents was retained for further annotation. Gene annotations produced by MITOS and MitoZ were manually reviewed and corrected by aligning the <italic>S. methanicola</italic> draft mitogenome with the <italic>Goniada japonica</italic> mitogenome (NC_026995/KP867019) (<xref ref-type="bibr" rid="B12">Chen et&#xa0;al., 2016</xref>), which was identified by MitoZ to be the most closely related to the <italic>S. methanicola</italic> draft genomes. Manual mitogenome annotation was aided by the blastn web interface with the Coding Sequences (CDS) feature display and performed according to tutorials published by NCBI (<uri xlink:href="https://support.nlm.nih.gov/knowledgebase/article/KA-05223/en-us">https://support.nlm.nih.gov/knowledgebase/article/KA-05223/en-us</uri>). Internal stop codons were identified in the sequences encoding cytochrome <italic>c</italic> oxidase subunit I (<italic>cox1</italic>) and NADH dehydrogenase subunit 2 (<italic>nd2</italic>). The internal stop codon in <italic>cox1</italic> was due to a 497-bp insertion in the <italic>S. methanicola</italic> draft mitogenome and these bases were manually removed. The internal stop codon in NADH dehydrogenase subunit 2 (<italic>nd2</italic>) was due to an insertion causing a frameshift in the <italic>S. methanicola</italic> draft mitogenome. This frameshift was additionally verified by aligning the translated nucleotide sequence of <italic>nd2</italic> in the <italic>S. methanicola</italic> draft genome with the protein and nucleotide sequences of <italic>nd2</italic> in <italic>Hesionides</italic> sp. <italic>PA-2020</italic> (MN855167/QHT64973) (<xref ref-type="bibr" rid="B2">Alves et&#xa0;al., 2020</xref>) using web blastx and tblastn searches against NCBI nr/nt. To correct the frameshift, a 100-bp gene region that was not homologous to <italic>Goniada japonica</italic> and <italic>Hesionides</italic> sp. <italic>PA-2020 nd2</italic> sequences was removed from the <italic>S. methanicola</italic> draft mitogenome. The original draft mitogenomes recovered from the gut content (18,000 bp) and worm fragment (16,108 bp) metagenomes are provided in the Supplementary Data, and the manually annotated <italic>S. methanicola</italic> mitogenome is deposited to NCBI RefSeq with the accession number NC_064058.</p>
<p>MitoZ (<xref ref-type="bibr" rid="B46">Meng et&#xa0;al., 2019</xref>) was used to compute the G+C content (window size=50) and sequencing depth of each library (library G-Mi, library G, and library W) along the final representative <italic>S. methanicola</italic> mitogenome. The GenBank annotations, G+C content, and depth data files were used to visualize the <italic>S. methanicola</italic> genome with circos v0.69-8 (<xref ref-type="bibr" rid="B34">Krzywinski et&#xa0;al., 2009</xref>), based on the circos configuration file templates generated by MitoZ. MEGAX (<xref ref-type="bibr" rid="B35">Kumar et&#xa0;al., 2018</xref>) was used to calculate the relative synonymous codon usage (RSCU) in the <italic>S. methanicola</italic> mitogenome, which is the frequency of a codon divided by the average frequency of all synonymous codons for an amino acid (<xref ref-type="bibr" rid="B60">Sharp and Li, 1987</xref>). An RSCU value of 1 indicates no codon usage bias, while RSCU values above and below 1 represent positive and negative bias, respectively. Codons with RSCU values &gt;1.6 were considered overrepresented, while codons with RSCU values &lt;0.6 were considered underrepresented (<xref ref-type="bibr" rid="B67">Wong et&#xa0;al., 2010</xref>).</p>
</sec>
<sec id="s2_3_2_3">
<label>2.3.2.3</label>
<title>Phylogenetic analyses</title>
<p>Nucleotide sequences of 16S rRNA, 18S rRNA, 28S rRNA, and <italic>cox1</italic> genes assembled from <italic>S. methanicola</italic> (see subsections 2.3.1, 2.3.2.1, and 2.3.2.2) were concatenated and used for phylogenetic analysis. Comparisons utilized a subset of species selected from <xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref> of Rouse et&#xa0;al. (<xref ref-type="bibr" rid="B56">Rouse et&#xa0;al., 2018</xref>) that had reference sequences available for all four of the analyzed genes (16S rRNA, 18S rRNA, 28S rRNA, and <italic>cox1</italic>). These included 32 species from the family <italic>Hesionidae</italic> and the outgroup species <italic>Dsyponetus caecus</italic> from family <italic>Chrysopetalidae</italic> (<xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>). Sequences from the genus <italic>Sirsoe</italic>, including <italic>S. methanicola</italic>, <italic>S. dalailamai</italic>, <italic>S. munki</italic>, and <italic>S. sirikos</italic>, were part of a larger clade of the hesionid subfamily <italic>Psamathinae</italic> (<xref ref-type="bibr" rid="B53">Pleijel, 1998</xref>).</p>
<table-wrap id="T2" position="float">
<label>Table&#xa0;2</label>
<caption>
<p>GenBank accession numbers for <italic>Hesionidae</italic> and <italic>Chrysopetalidae</italic> (outgroup) species used for phylogenetic analysis. <italic>Sirsoe methanicola</italic> sequences assembled from this study are highlighted in bold.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="bottom" align="left">Species</th>
<th valign="bottom" align="center">18S rRNA</th>
<th valign="bottom" align="center">16S rRNA</th>
<th valign="bottom" align="center">28S rRNA</th>
<th valign="bottom" align="center">
<italic>cox1</italic>
</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="bottom" align="left">
<italic>Dsyponetus caecus</italic> (outgroup)</td>
<td valign="bottom" align="center">AY839568</td>
<td valign="bottom" align="center">EU555047</td>
<td valign="bottom" align="center">EU555028</td>
<td valign="bottom" align="center">AF221568</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Amphiduros fuscescens</italic>
</td>
<td valign="bottom" align="center">DQ442584</td>
<td valign="bottom" align="center">DQ442569</td>
<td valign="bottom" align="center">DQ442598</td>
<td valign="bottom" align="center">DQ442561</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Amphiduros cf. axialensis</italic>
</td>
<td valign="bottom" align="center">MG649239</td>
<td valign="bottom" align="center">MG523356</td>
<td valign="bottom" align="center">MG649243</td>
<td valign="bottom" align="center">MG517505</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Amphiduros pacificus</italic>
</td>
<td valign="bottom" align="center">JN631334</td>
<td valign="bottom" align="center">JN631324</td>
<td valign="bottom" align="center">JN631345</td>
<td valign="bottom" align="center">JN631312</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Gyptis brunnea</italic>
</td>
<td valign="bottom" align="center">JN631335</td>
<td valign="bottom" align="center">JN631323</td>
<td valign="bottom" align="center">JN631346</td>
<td valign="bottom" align="center">JN631313</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Gyptis hians</italic>
</td>
<td valign="bottom" align="center">JN571891</td>
<td valign="bottom" align="center">JN571880</td>
<td valign="bottom" align="center">JN571900</td>
<td valign="bottom" align="center">JN571824</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Gyptis pacifica</italic>
</td>
<td valign="bottom" align="center">JN631337</td>
<td valign="bottom" align="center">JN631322</td>
<td valign="bottom" align="center">JN631348</td>
<td valign="bottom" align="center">JN631314</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Gyptis robertscrippsi</italic> sp. <italic>nov.</italic>
</td>
<td valign="bottom" align="center">MG649238</td>
<td valign="bottom" align="center">MG523360</td>
<td valign="bottom" align="center">MG649247</td>
<td valign="bottom" align="center">MG517513</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Hesiospina aurantiaca</italic>
</td>
<td valign="bottom" align="center">JN631329</td>
<td valign="bottom" align="center">JN631319</td>
<td valign="bottom" align="center">JF317203</td>
<td valign="bottom" align="center">JN631342</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Hesiospina vestimentifera</italic>
</td>
<td valign="bottom" align="center">JN631330</td>
<td valign="bottom" align="center">JN631320</td>
<td valign="bottom" align="center">JN631343</td>
<td valign="bottom" align="center">JN631310</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Leocrates chinensis</italic>
</td>
<td valign="bottom" align="center">DQ442589</td>
<td valign="bottom" align="center">DQ442575</td>
<td valign="bottom" align="center">DQ442605</td>
<td valign="bottom" align="center">DQ442565</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Micropodarke dubia</italic>
</td>
<td valign="bottom" align="center">JN571888</td>
<td valign="bottom" align="center">DQ442576</td>
<td valign="bottom" align="center">JN571899</td>
<td valign="bottom" align="center">JN571825</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Neogyptis carriebowcayi</italic>
</td>
<td valign="bottom" align="center">JN631338</td>
<td valign="bottom" align="center">JN631325</td>
<td valign="bottom" align="center">JN631349</td>
<td valign="bottom" align="center">JN631315</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Neogyptis hinehina</italic>
</td>
<td valign="bottom" align="center">JN631340</td>
<td valign="bottom" align="center">JN631328</td>
<td valign="bottom" align="center">JN631350</td>
<td valign="bottom" align="center">JN631317</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Neogyptis julii</italic>
</td>
<td valign="bottom" align="center">KP745538</td>
<td valign="bottom" align="center">KP745535</td>
<td valign="bottom" align="center">KP745541</td>
<td valign="bottom" align="center">KP745532</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Neogyptis rosea</italic>
</td>
<td valign="bottom" align="center">JN571890</td>
<td valign="bottom" align="center">DQ442574</td>
<td valign="bottom" align="center">DQ442603</td>
<td valign="bottom" align="center">JN571826</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Neogyptis</italic> sp. <italic>A AN-2012</italic>
</td>
<td valign="bottom" align="center">JN631341</td>
<td valign="bottom" align="center">JN631327</td>
<td valign="bottom" align="center">JN631351</td>
<td valign="bottom" align="center">JN631318</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Nereimyra punctata</italic>
</td>
<td valign="bottom" align="center">DQ442591</td>
<td valign="bottom" align="center">DQ442577</td>
<td valign="bottom" align="center">DQ442606</td>
<td valign="bottom" align="center">DQ442566</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Oxydromus flexuosus</italic>
</td>
<td valign="bottom" align="center">DQ442592</td>
<td valign="bottom" align="center">DQ442578</td>
<td valign="bottom" align="center">DQ442607</td>
<td valign="bottom" align="center">DQ442567</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Oxydromus pugettensis</italic>
</td>
<td valign="bottom" align="center">DQ790086</td>
<td valign="bottom" align="center">KJ855069</td>
<td valign="bottom" align="center">KJ855081</td>
<td valign="bottom" align="center">KJ855074</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Podarkeopsis arenicolus</italic>
</td>
<td valign="bottom" align="center">JN571889</td>
<td valign="bottom" align="center">JN571879</td>
<td valign="bottom" align="center">DQ442609</td>
<td valign="bottom" align="center">JN571827</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Podarkeopsis perkinsi</italic>
</td>
<td valign="bottom" align="center">JN571892</td>
<td valign="bottom" align="center">JN571881</td>
<td valign="bottom" align="center">JN571901</td>
<td valign="bottom" align="center">JN571828</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Sirsoe dalailamai</italic>
</td>
<td valign="bottom" align="center">MG649240</td>
<td valign="bottom" align="center">MG523357</td>
<td valign="bottom" align="center">MG649245</td>
<td valign="bottom" align="center">MG517498</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Sirsoe methanicola</italic>
</td>
<td valign="bottom" align="center">JN631332</td>
<td valign="bottom" align="center">DQ442582</td>
<td valign="bottom" align="center">DQ442611</td>
<td valign="bottom" align="center">DQ513295</td>
</tr>
<tr>
<td valign="bottom" align="left">
<bold>
<italic>Sirsoe methanicola</italic> isolate G/G-Mi/W</bold>
</td>
<td valign="bottom" align="center">
<bold>MZ224436</bold>
</td>
<td valign="bottom" align="center">
<bold>NC_064058</bold>
</td>
<td valign="bottom" align="center">
<bold>OL704460</bold>
</td>
<td valign="bottom" align="center">
<bold>NC_064058</bold>
</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Sirsoe munki</italic>
</td>
<td valign="bottom" align="center">MG649241</td>
<td valign="bottom" align="center">MG523358</td>
<td valign="bottom" align="center">MG649246</td>
<td valign="bottom" align="center">MG517510</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Sirsoe sirikos</italic>
</td>
<td valign="bottom" align="center">JN571893</td>
<td valign="bottom" align="center">JN571882</td>
<td valign="bottom" align="center">JN571902</td>
<td valign="bottom" align="center">JN571829</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Syllidia armata</italic>
</td>
<td valign="bottom" align="center">DQ442596</td>
<td valign="bottom" align="center">DQ442583</td>
<td valign="bottom" align="center">DQ442612</td>
<td valign="bottom" align="center">DQ442568</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Vrijenhoekia ahabi</italic>
</td>
<td valign="bottom" align="center">JN571898</td>
<td valign="bottom" align="center">JN571887</td>
<td valign="bottom" align="center">JN571907</td>
<td valign="bottom" align="center">JN571876</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Vrijenhoekia balaenophila</italic>
</td>
<td valign="bottom" align="center">JN571895</td>
<td valign="bottom" align="center">JN571884</td>
<td valign="bottom" align="center">JN571904</td>
<td valign="bottom" align="center">JN571831</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Vrijenhoekia falenothiras</italic>
</td>
<td valign="bottom" align="center">JN571897</td>
<td valign="bottom" align="center">JN571886</td>
<td valign="bottom" align="center">JN571906</td>
<td valign="bottom" align="center">JN571875</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Vrijenhoekia ketea</italic>
</td>
<td valign="bottom" align="center">JN571896</td>
<td valign="bottom" align="center">JN571885</td>
<td valign="bottom" align="center">JN571905</td>
<td valign="bottom" align="center">JN571838</td>
</tr>
<tr>
<td valign="bottom" align="left">
<italic>Vrijenhoekia</italic> sp. <italic>A MS-2015</italic>
</td>
<td valign="bottom" align="center">KP745539</td>
<td valign="bottom" align="center">KP745536</td>
<td valign="bottom" align="center">KP745542</td>
<td valign="bottom" align="center">KP745533</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In addition, amino acid sequences of protein-coding genes (PCGs) annotated in the assembled <italic>S. methanicola</italic> mitogenome (see subsection 2.3.2.2) were compared with published mitogenomes. No assembled <italic>Hesionidae</italic> mitogenomes were available; therefore, 35 mitogenomes from the order <italic>Phyllodocida</italic> (<xref ref-type="supplementary-material" rid="SF1">
<bold>Table S1</bold>
</xref>) were retrieved from NCBI&#x2019;s Organelle Genome Resources (<xref ref-type="bibr" rid="B58">Sayers et&#xa0;al., 2021</xref>). Additionally, the <italic>S. methanicola</italic> mitogenome sequence was queried against NCBI GenBank (<xref ref-type="bibr" rid="B6">Benson et&#xa0;al., 2018</xref>) using the NCBI blastn web interface (<xref ref-type="bibr" rid="B27">Johnson et&#xa0;al., 2008</xref>) to identify two other <italic>Phyllodocida</italic> mitogenomes not listed in Organelle Genome Resources (<xref ref-type="supplementary-material" rid="SF1">
<bold>Table S1</bold>
</xref>). The mitogenome of <italic>Hydroides elegans</italic> from the polychaete order <italic>Sabellida</italic>, retrieved from Organelle Genome Resources, was used as the outgroup (<xref ref-type="supplementary-material" rid="SF1">
<bold>Table S1</bold>
</xref>). Phylogenetic analysis employed 12 of 13 PCGs annotated in these 37 <italic>Phyllodocida</italic> mitogenomes. The <italic>atp8</italic> gene encoding ATP synthase F0 subunit 8 was excluded because it was absent in three mitogenomes (<xref ref-type="supplementary-material" rid="SF1">
<bold>Table S1</bold>
</xref>).</p>
<p>All sequences were downloaded from the NCBI database. Sequences for each gene were separately aligned with MAFFT v7.475 using the L-INS-I option recommended for &lt;200 sequences, which is an iterative refinement method that produces accurate multiple sequence alignments with local pairwise alignment information (<xref ref-type="bibr" rid="B33">Katoh and Standley, 2013</xref>). Conserved blocks within each multiple sequence alignment were identified using gblocks v0.91b (<xref ref-type="bibr" rid="B10">Castresana, 2000</xref>). Using gblocks, conserved blocks for 16S rRNA, 18S rRNA, 28S rRNA, and <italic>cox1</italic> were concatenated into a single nucleotide sequence alignment, while conserved blocks for the protein-coding genes in the mitogenomes were concatenated into a single amino acid sequence alignment.</p>
<p>MEGAX (<xref ref-type="bibr" rid="B35">Kumar et&#xa0;al., 2018</xref>) was used to identify the best model for each concatenated alignment. For the concatenated 16S rRNA+18S rRNA+28S rRNA+<italic>cox1</italic> alignment, the best model was the General Time Reversible model (<xref ref-type="bibr" rid="B49">Nei and Kumar, 2000</xref>) with a discrete Gamma distribution that allows for evolutionary invariable sites (GTR+G+I). For the concatenated amino acid sequence alignment from mitogenomes, the best model was the General Reversible Mitochondria model (<xref ref-type="bibr" rid="B1">Adachi and Hasegawa, 1996</xref>) with Gamma distribution and frequencies (mtREV24+G+F). Phylogenetic trees for both concatenated alignments were constructed separately in MEGAX (<xref ref-type="bibr" rid="B35">Kumar et&#xa0;al., 2018</xref>) using the Maximum Likelihood method with 1,000 bootstrap replicates. The Maximum Composite Likelihood (MCL) or Jones-Taylor-Thornton model (<xref ref-type="bibr" rid="B28">Jones et&#xa0;al., 1992</xref>) was used to estimate a pairwise distance matrix for the nucleotide and protein sequence alignments, respectively. These matrices were used to search for initial trees heuristically using Neighbor-Join and BioNJ algorithms. Subsequently, a tree topology with the highest log likelihood value was predicted for each concatenated alignment.</p>
</sec>
<sec id="s2_3_2_4">
<label>2.3.2.4</label>
<title>Mapping to reference genomes</title>
<p>Metagenomic reads obtained from <italic>S. methanicola</italic> gut contents (library G) and worm fragments (library W) were mapped separately to the genome of <italic>C. teleta</italic> (NCBI accession: GCA_000328365). Worm-related sequences were identified in these metagenomes by mapping trimmed reads using the default parameters of Bowtie2 v2.4.1 (<xref ref-type="bibr" rid="B37">Langmead and Salzberg, 2012</xref>) and SAMtools v1.10 (<xref ref-type="bibr" rid="B42">Li et&#xa0;al., 2009</xref>). Reads mapped to each <italic>C. teleta</italic> coding sequence were counted using HTSeq v0.12.4 (<xref ref-type="bibr" rid="B3">Anders et&#xa0;al., 2015</xref>). Each mapped <italic>C. teleta</italic> coding sequence was matched to the corresponding protein or nucleotide sequence in the <italic>S. methanicola</italic> metagenomes through blastp and tblastn searches performed using BLAST+ (<xref ref-type="bibr" rid="B9">Camacho et&#xa0;al., 2009</xref>), respectively. Mapping results of HiSeq-sequenced reads from libraries G and W were reported; results for MiSeq-sequenced reads from library G-Mi were not reported because of low coverage.</p>
</sec>
<sec id="s2_3_2_5">
<label>2.3.2.5</label>
<title>Metagenome annotation</title>
<p>From the metagenomes assembled by MEGAHIT (<xref ref-type="bibr" rid="B43">Li et&#xa0;al., 2016</xref>), 346,292 nucleotide sequences that did not bin into bacterial metagenome-assembled genomes (MAGs) (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>) were retrieved from the three libraries W, G, and G-Mi. These sequences were combined and annotated using the WebAUGUSTUS server (<xref ref-type="bibr" rid="B23">Hoff and Stanke, 2013</xref>). A training set containing the nucleotide and protein sequences in the genome of <italic>C. teleta</italic> (GCA_000328365) (<xref ref-type="bibr" rid="B63">Simakov et&#xa0;al., 2013</xref>) was submitted to WebAUGUSTUS to generate parameters for eukaryotic gene prediction in <italic>S. methanicola</italic>. From the <italic>C. teleta</italic> genomic and protein data, eukaryotic protein-coding genes in the <italic>S. methanicola</italic> metagenomes were predicted <italic>ab initio</italic> by WebAUGUSTUS using the default parameters (report genes on both strands; no alternative transcripts; and predict partial and complete genes). Assembled contigs &gt;1,000 bp from the <italic>S. methanicola</italic> metagenomes were used for eukaryotic gene prediction. Since WebAUGUSTUS has an input limit of 250,000 sequences per prediction job, assembled <italic>S. methanicola</italic> nucleotide sequences from all metagenomes were split into two datasets containing 195,167 contigs that were &#x2265;3,000 bp long and 151,125 contigs that were &lt;3,000 bp.</p>
<p>Predicted protein sequences from the <italic>S. methanicola</italic> metagenomes were submitted to the ghostKOALA web server (<xref ref-type="bibr" rid="B32">Kanehisa et&#xa0;al., 2016</xref>) maintained by the Kyoto Encyclopedia of Genes and Genomes (KEGG) (<xref ref-type="bibr" rid="B31">Kanehisa et&#xa0;al., 2020</xref>) for the assignment of taxonomy and KEGG Orthology (KO) terms. Protein sequences classified by ghostKOALA as Annelids were extracted based on similarities to sequences in the KEGG Genome database (<xref ref-type="bibr" rid="B31">Kanehisa et&#xa0;al., 2020</xref>). All KO terms classified as Annelids were submitted to the KEGG mapper server to identify complete pathway modules. Protein sequences in the complete pathway module M00141 (C1-unit interconversion, eukaryotes) were verified to be homologs of annelid sequences through web blastp searches against NCBI nr (<xref ref-type="bibr" rid="B27">Johnson et&#xa0;al., 2008</xref>).</p>
</sec>
<sec id="s2_3_2_6">
<label>2.3.2.6</label>
<title>Cytochrome P450 annotation</title>
<p>We compared 84 CYP superfamily protein sequences described in <italic>C. teleta</italic> (<xref ref-type="bibr" rid="B14">Dejong and Wilson, 2014</xref>) with those predicted from the <italic>S. methanicola</italic> metagenomes to identify eukaryotic CYP homologs that may respond to or detoxify PAHs. Based on the HTSeq output, no reads were mapped to the P450 genomic regions. Cytochrome P450 protein sequences in <italic>S. methanicola</italic> were alternatively identified by clustering WebAUGUSTUS-predicted protein sequences in the <italic>Sirsoe methanicola</italic> metagenomes with <italic>C. teleta</italic> cytochrome P450 protein sequences sequentially at 100%, 90%, 80%, 70%, 60%, 50%, 40% and 30% (psi-cd-hit command) global identity thresholds using CD-HIT (<xref ref-type="bibr" rid="B41">Li and Godzik, 2006</xref>). <italic>Sirsoe methanicola</italic> protein sequences that clustered with <italic>C. teleta</italic>&#x2019;s cytochrome P450 protein sequences were verified to be annelid CYP sequence homologs through web blastp searches against NCBI nr (<xref ref-type="bibr" rid="B27">Johnson et&#xa0;al., 2008</xref>). Sequence clusters were visualized with the igraph v1.2.6 R package (<xref ref-type="bibr" rid="B13">Cs&#xe1;rdi and Nepusz, 2006</xref>).</p>
</sec>
</sec>
</sec>
</sec>
<sec id="s3" sec-type="results">
<label>3</label>
<title>Results</title>
<p>A total of three metagenomic libraries were sequenced from <italic>S. methanicola</italic> specimens collected from a Gulf of Mexico methane hydrate located at GC234 (27&#xb0;44.7526&#x2019; N, 91&#xb0;13.3168&#x2019; W; <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>). Metagenomic sequencing provided the following numbers of paired-end reads: 1) 236.8M from gut contents pooled from two worms (HiSeq library G, Tube A); 2) 244.1M for non-axenic worm fragments containing gut tissues (HiSeq library W, Tube Loc-2); and 3) 1.3M for a gut content library sequenced with Illumina MiSeq (library G-Mi, Tube D). The text here details assignments associated with the host organism, <italic>S. methanicola</italic>. Analysis of prokaryotic diversity and function for this data set are provided under separate cover (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>).</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>
<bold>(A)</bold> <italic>Sirsoe methanicola</italic> individual viewed under a compound microscope (photo credit: R. Emlet/C. Young aboard R/V <italic>Seward Johnson</italic>; <bold>(B)</bold> <italic>S. methanicola</italic> individuals colonizing depressions on a methane hydrate (photo credit: SJ-2009-GOM-JSL2-3751-014, Johnson Sea Link II, Harbor Branch Oceanographic Institute); <bold>(C)</bold> Map showing location of the sampling site (generated from <uri xlink:href="https://www.simplemappr.net">https://www.simplemappr.net</uri>. Accessed December 15, 2022).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1067482-g001.tif"/>
</fig>
<sec id="s3_1">
<label>3.1</label>
<title>Nuclear rRNA genes in <italic>S. methanicola</italic>
</title>
<p>Assembly by phyloFlash (<xref ref-type="bibr" rid="B21">Gruber-Vodicka et&#xa0;al., 2020</xref>) yielded 1,817-bp 18S rRNA gene sequences from library G and library G-Mi and a 1,781-bp 18S rRNA gene sequence from library W (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>). Based on web blastn alignments, these sequences were 100% identical to each other and to another 1,778-bp segment of the 18S rRNA gene sequence published for <italic>S. methanicola</italic> (NCBI accession: JN631332) (<xref ref-type="bibr" rid="B55">Pleijel et&#xa0;al., 2012</xref>) (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>). A previous 18S rRNA sequence (1,745 bp) obtained from a clone library using the <italic>S. methanicola</italic> samples collected in this study (<xref ref-type="bibr" rid="B68">Xin, 2013</xref>) showed 99% identity to JN631332.</p>
<p>All three metagenomes assembled with MEGAHIT (<xref ref-type="bibr" rid="B43">Li et&#xa0;al., 2016</xref>) identified 28S rRNA LSU gene sequences that were 782 bp in length and 100% identical to each other (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>). These sequences shared 99% global sequence identity with a 770-bp segment of the 28S rRNA gene sequence published for <italic>S. methanicola</italic> (NCBI accession: DQ442611) (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>), differing only by one gap at position 417 in the alignment with DQ442611 (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>).</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>
<italic>Sirsoe methanicola</italic> mitogenome</title>
<p>From the metagenomes assembled with MEGAHIT (<xref ref-type="bibr" rid="B43">Li et&#xa0;al., 2016</xref>), we recovered draft mitogenomes of <italic>S. methanicola</italic> through sequence searches using the previously reported 16S rRNA gene sequence (NCBI accession: DQ442582) and <italic>cox1</italic> sequence (NCBI accession: DQ513295) for <italic>S. methanicola</italic> (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>) as query. Searching against the gut content metagenome from the MiSeq-sequenced library G-Mi produced no hit. However, matching 16S rRNA and <italic>cox1</italic> sequences were identified in a 18,000-bp contig assembled from the HiSeq-sequenced library G and another 16,108-bp contig assembled from the HiSeq-sequenced library W. Both contigs were deduced to be <italic>S. methanicola</italic> mitogenomic sequences, based on matches to mitochondrial sequences from the genus <italic>Sirsoe</italic> and matches to mitogenomes of Polychaeta species in NCBI GenBank (<xref ref-type="bibr" rid="B6">Benson et&#xa0;al., 2018</xref>). Based on MITOS (<xref ref-type="bibr" rid="B7">Bernt et&#xa0;al., 2013</xref>) and MitoZ (<xref ref-type="bibr" rid="B46">Meng et&#xa0;al., 2019</xref>) annotations, these draft mitogenomes were non-circular. The draft mitogenome assembled from the gut contents contained 38 genes with no missing genes, while the draft mitogenome assembled from the worm fragments contained only 22 genes and was missing 16 genes. Local alignment using the blastn web interface showed a shared region of 10,066 bp between both mitogenomes with 99% sequence identity and one gap. The mitogenome assembled from the gut contents was retained for downstream annotation and analysis to produce a representative mitogenome of <italic>S. methanicola</italic> (NCBI accessions: NC_064058/OM914591; <xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref>).</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>Map of the linear 17,403-bp <italic>S. methanicola</italic> mitogenome (NCBI accessions: NC_064058/OM91459) visualized by Circos implemented in MitoZ. The mitogenome includes protein-coding (green), rRNA (orange), and tRNA (red) gene regions. The red line on the G+C content track marks the G+C content threshold of 50%. The red lines on the depth distribution for library G-Mi indicate regions with sequencing depths lower than 20.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1067482-g002.tif"/>
</fig>
<p>The <italic>S. methanicola</italic> mitogenome was non-circular, consisting of 17,403 bp with 39.03% G+C content. Regions of low sequencing depths across all three libraries were mostly observed at the end of the mitogenome between 16 kb and 17.3 kb (<xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref>). The mitogenome contained 13 PCGs, 24 tRNAs (including two split <italic>trnM</italic> genes), and 2 rRNAs (12S rRNA and 16S rRNA). All genes were located on the positive strand of the mitogenome. The full-length 1,302-bp 16S rRNA gene was 99% identical to a 538-bp partial 16S rRNA gene sequence reported for <italic>S. methanicola</italic> (NCBI accession: DQ442582) (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>), differing by four nucleotides. The full-length 1,536-bp <italic>cox1</italic> gene shared 99% identity with a 629-bp partial <italic>cox1</italic> gene sequence (NCBI accession: DQ513295) (<xref ref-type="bibr" rid="B54">Pleijel et&#xa0;al., 2008</xref>) with two nucleotide mismatches.</p>
<p>Most PCGs identified in <italic>S. methanicola</italic> used AUG as the start codon, except for three that used AUC as an alternative start codon. Stop codons used in the <italic>S. methanicola</italic> mitogenome included the truncated U&#x2013; stop codon in seven PCGs, UAA in four PCGs, and UAG in NADH dehydrogenase subunit 6 (<italic>nd6</italic>). Analysis of RSCU values of 64 codons showed positive bias for half of the codons (RSCU &gt;1) and negative bias for the other half (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>). UAA, the preferred stop codon over UAG, and amino acids with only two codons showed positive bias for one over the other (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>). More than one codon was preferred for alanine, glycine, leucine, proline, serine, threonine, and valine (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>). Among these, the UCU codon for serine was over-represented with RSCU &gt;1.6 (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>). Underrepresented codons with RSCU &lt;0.6 included CUG for leucine, AUG for methionine, CCG for proline, ACG for threonine, GCG for alanine and AGU and AGG for serine (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>).</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>Relative synonymous codon usage (RSCU) values for each codon of each amino acid in the <italic>S. methanicola</italic> mitogenome. The black horizontal line on each plot marks the RSCU threshold of 1.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1067482-g003.tif"/>
</fig>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Phylogenetic analyses</title>
<p>The phylogeny of concatenated 16S rRNA, 18S rRNA, 28S rRNA, and <italic>cox1</italic> nucleotide gene sequences from the <italic>S. methanicola</italic> metagenome in relation to sequences available for the family <italic>Hesionidae</italic> (<xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref>) was consistent with the most recent published phylogeny (<xref ref-type="bibr" rid="B56">Rouse et&#xa0;al., 2018</xref>). <italic>Sirsoe methanicola</italic> sequences from this study were most closely related to previously published <italic>S. methanicola</italic> sequences (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>), and <italic>S. methanicola</italic> was most closely related to its sister species <italic>S. dalailamai</italic> (<xref ref-type="bibr" rid="B56">Rouse et&#xa0;al., 2018</xref>).</p>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>Bootstrapped maximum likelihood phylogenetic tree of concatenated 16S rRNA, 18S rRNA, 28S rRNA, and <italic>cox1</italic> nucleotide sequences from <italic>S</italic>. <italic>methanicola</italic> metagenomic libraries G/W/G-Mi (red text) in relation to sequences from other species in the family <italic>Hesionidae</italic>. The tree was constructed by MEGAX from the concatenated 1,446-bp alignment from 33 specimens. The tree with the highest log likelihood (&#x2013;10081.68) is shown, with branch lengths indicating the number of substitutions per site. Bootstrap values on tree nodes indicate the percentage of trees, based on 1,000 replicates, in which taxa from a node are clustered together. The outgroup species used was <italic>Dsyponetus caecus</italic> from the polychaete family <italic>Chrysopetalidae</italic>. Accession numbers of these sequences are provided in <xref ref-type="table" rid="T2">
<bold>Table&#xa0;2</bold>
</xref>.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1067482-g004.tif"/>
</fig>
<p>The phylogeny of amino acid sequences from the <italic>S</italic>. <italic>methanicola</italic> mitogenome in relation to 12 PCGs available for the order <italic>Phyllodocida</italic> showed the <italic>S. methanicola</italic> mitogenome to be most closely related to the mitogenome of <italic>Goniada japonica</italic> (<xref ref-type="bibr" rid="B12">Chen et&#xa0;al., 2016</xref>) from family <italic>Goniadidae</italic> (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>), as predicted by MitoZ (<xref ref-type="bibr" rid="B46">Meng et&#xa0;al., 2019</xref>). Both mitogenomes were placed in a well-supported clade (98% bootstrap confidence) with the mitogenomes of <italic>Glycera capitata</italic> and <italic>Hemipodia simplex</italic> from the family <italic>Glyceridae</italic> (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>). Of the 37 <italic>Phyllodocida</italic> mitogenomes, most (n=19) were from the family <italic>Nereididae</italic> (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>). The mitogenomes of the <italic>Chrysopetalidae</italic> species <italic>Craseoschema thyasiricola</italic> and <italic>Chrysopetalum debile</italic> did not cluster together on the phylogenetic tree (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>). The <italic>Chrysopetalum debile</italic> mitogenome clustered with mitogenomes from the <italic>Hesionidae</italic>-<italic>Goniadidae</italic>-<italic>Glyceridae</italic> clade with only 62% bootstrap confidence (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>). Similar to previously reported phylogeny (<xref ref-type="bibr" rid="B11">Cejp et&#xa0;al., 2022</xref>), the <italic>Craseoschema thyasiricola</italic> mitogenome clustered with the mitogenome of <italic>Iheyomytilidicola lauensis</italic> from the family <italic>Nautiliniellidae</italic> with 100% bootstrap confidence (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>). Both of these polychaetes are endosymbionts in deep-sea bivalves (<xref ref-type="bibr" rid="B11">Cejp et&#xa0;al., 2022</xref>).</p>
<fig id="f5" position="float">
<label>Figure&#xa0;5</label>
<caption>
<p>Bootstrapped maximum likelihood phylogenetic tree of concatenated protein sequences of 12 PCGs from the <italic>S. methanicola</italic> mitogenome (red text) in relation to mitogenomes available from the order <italic>Phyllodocida</italic>. The tree was constructed by MEGAX from the concatenated 2,648-aa alignment from 35 mitochondrial genomes. The tree with the highest log likelihood (&#x2013;62222.03) is shown, with branch lengths indicating the number of substitutions per site. Bootstrap values on tree nodes indicate the percentage of trees, based on 100 replicates, in which taxa from a node are clustered together. The outgroup species used was <italic>Hydroides elegans</italic> from the polychaete order <italic>Sabellida</italic>. Accession numbers of all mitogenome sequences used are provided in <xref ref-type="supplementary-material" rid="SF1">
<bold>Table S1</bold>
</xref>.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1067482-g005.tif"/>
</fig>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Mapping to <italic>Capitella teleta</italic> genome</title>
<p>Reads from two <italic>S. methanicola</italic> metagenomic libraries (libraries G and W) predominantly mapped to the 28S-5.8S-18S rRNA operon of <italic>C. teleta</italic>. Mapping was also observed to genes encoding structural components (actin and collagen alpha), signaling proteins (enterin neuropeptide, Fc-receptor like 1 homolog, phosphodiesterase 8B homolog, and ankyrin repeat domain-containing protein 26 homolog), RNA-directed DNA polymerase from transposon BS, cilia- and flagella-associated protein 20, putative glycosyltransferase, acidic repeat-containing protein, and hypothetical proteins (<xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref>).</p>
<fig id="f6" position="float">
<label>Figure&#xa0;6</label>
<caption>
<p>log<sub>10</sub>-transformed counts of reads from library G and library W (x-axis) mapped to genes/gene products in the <italic>Capitella teleta</italic> genome (y-axis), as quantified using HTSeq. Zero read counts are represented as grey cells.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1067482-g006.tif"/>
</fig>
<p>Using the nucleotide and protein sequences in the <italic>C. teleta</italic> genome as training data, we predicted 79,493 eukaryotic protein-coding genes in the assembled <italic>S. methanicola</italic> metagenomes using WebAUGUSTUS (<xref ref-type="bibr" rid="B23">Hoff and Stanke, 2013</xref>). Of these protein sequences, ghostKOALA (<xref ref-type="bibr" rid="B32">Kanehisa et&#xa0;al., 2016</xref>) assigned taxonomy to 98% (77,530) and KO terms to 35% (28,270), based on mapping to complete genomes or functionally characterized individual protein sequences in the KEGG Genome database (<xref ref-type="bibr" rid="B31">Kanehisa et&#xa0;al., 2020</xref>). Protein sequences were mostly assigned to the KEGG-defined broad taxonomic groups &#x201c;Animals&#x201d; (77%; <xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>), with 2% (1,799) assigned to the phylum Annelida. All Annelid sequences were predicted based on mapping to sequences in the genome of the freshwater leech <italic>Helobdella robusta</italic> from class Clitellata (KEGG accession T0327 and NCBI accession GCF_000326865.1) (<xref ref-type="bibr" rid="B63">Simakov et&#xa0;al., 2013</xref>). Smaller numbers of protein sequences from the metagenomes were assigned to the groups &#x201c;Bacteria&#x201d;, &#x201c;Plants&#x201d;, &#x201c;Fungi&#x201d;, &#x201c;Protists&#x201d;, &#x201c;Archaea&#x201d; and &#x201c;Viruses&#x201d; (<xref ref-type="table" rid="T3">
<bold>Table&#xa0;3</bold>
</xref>).</p>
<table-wrap id="T3" position="float">
<label>Table&#xa0;3</label>
<caption>
<p>Broad taxonomic classification of protein sequences annotated from the combined <italic>S. methanicola</italic> metagenomes by ghostKOALA, based on mapping to complete genomes or functionally characterized individual protein sequences in the KEGG Genome database.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" align="left">Predicted taxonomic group</th>
<th valign="top" align="center"># protein sequences</th>
<th valign="top" align="center">% protein sequences</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="middle" align="left">Animals</td>
<td valign="top" align="center">61,262</td>
<td valign="top" align="center">77.07</td>
</tr>
<tr>
<td valign="middle" align="left">Bacteria</td>
<td valign="top" align="center">9,681</td>
<td valign="top" align="center">12.18</td>
</tr>
<tr>
<td valign="middle" align="left">Plants</td>
<td valign="top" align="center">2,592</td>
<td valign="top" align="center">3.26</td>
</tr>
<tr>
<td valign="middle" align="left">Fungi</td>
<td valign="top" align="center">1,680</td>
<td valign="top" align="center">2.11</td>
</tr>
<tr>
<td valign="middle" align="left">Protists</td>
<td valign="top" align="center">1,572</td>
<td valign="top" align="center">1.98</td>
</tr>
<tr>
<td valign="middle" align="left">Archaea</td>
<td valign="top" align="center">453</td>
<td valign="top" align="center">0.57</td>
</tr>
<tr>
<td valign="middle" align="left">Viruses</td>
<td valign="top" align="center">280</td>
<td valign="top" align="center">0.35</td>
</tr>
<tr>
<td valign="middle" align="left">Others (not in complete genomes)</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0.01</td>
</tr>
<tr>
<td valign="middle" align="left">Not assigned</td>
<td valign="top" align="center">1,963</td>
<td valign="top" align="center">2.47</td>
</tr>
<tr>
<td valign="middle" align="left">
<bold>Total</bold>
</td>
<td valign="top" align="center">
<bold>79,493</bold>
</td>
<td valign="top" align="center">
<bold>100.00</bold>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Of the Annelid sequences, the most abundant KEGG Orthology term mapped to <italic>H. robusta</italic> from <italic>S. methanicola</italic> metagenomes was innexin (<xref ref-type="table" rid="T4">
<bold>Table&#xa0;4</bold>
</xref>). Other abundant KEGG Orthology terms were involved in cell adhesion, signaling, the ubiquitin system, metabolism, transport, and other processes (<xref ref-type="table" rid="T4">
<bold>Table&#xa0;4</bold>
</xref>). KEGG mapper analysis mapping all Annelid KO terms to KEGG pathway modules revealed one complete pathway module associated with eukaryotic C1-unit interconversion (M00141). This module comprised two genes assigned to K00600 (glycine hydroxymethyltransferase) and two genes assigned to K00288 (methylenetetrahydrofolate dehydrogenase (NADP+)/methenyltetrahydrofolate cyclohydrolase/formyltetrahydrofolate synthetase).</p>
<table-wrap id="T4" position="float">
<label>Table&#xa0;4</label>
<caption>
<p>Most abundant KEGG Orthology (KO) terms mapped by ghostKOALA from the <italic>S. methanicola</italic> metagenomes to Annelid sequences in the KEGG Genome database.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="bottom" align="left">KO</th>
<th valign="bottom" align="center">Count</th>
<th valign="bottom" align="center">Name</th>
<th valign="bottom" align="center">Category</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="bottom" align="left">K22037</td>
<td valign="bottom" align="center">32</td>
<td valign="bottom" align="left">Innexin</td>
<td valign="bottom" align="left">Transporters</td>
</tr>
<tr>
<td valign="bottom" align="left">K04437</td>
<td valign="bottom" align="center">17</td>
<td valign="bottom" align="left">Filamin</td>
<td valign="bottom" align="left">Signaling</td>
</tr>
<tr>
<td valign="bottom" align="left">K16498</td>
<td valign="bottom" align="center">17</td>
<td valign="bottom" align="left">Protocadherin delta 1</td>
<td valign="bottom" align="left">Cell adhesion molecules</td>
</tr>
<tr>
<td valign="bottom" align="left">K11997</td>
<td valign="bottom" align="center">15</td>
<td valign="bottom" align="left">Tripartite motif-containing protein 2/3</td>
<td valign="bottom" align="left">Ubiquitin system</td>
</tr>
<tr>
<td valign="bottom" align="left">K00710</td>
<td valign="bottom" align="center">14</td>
<td valign="bottom" align="left">Polypeptide N-acetylgalactosaminyltransferase</td>
<td valign="bottom" align="left">Metabolism</td>
</tr>
<tr>
<td valign="bottom" align="left">K07380</td>
<td valign="bottom" align="center">9</td>
<td valign="bottom" align="left">Contactin associated protein-like 2</td>
<td valign="bottom" align="left">Cell adhesion molecules</td>
</tr>
<tr>
<td valign="bottom" align="left">K06756</td>
<td valign="bottom" align="center">9</td>
<td valign="bottom" align="left">Neuronal cell adhesion molecule</td>
<td valign="bottom" align="left">Cell adhesion molecules</td>
</tr>
<tr>
<td valign="bottom" align="left">K11536</td>
<td valign="bottom" align="center">9</td>
<td valign="bottom" align="left">Pyrimidine nucleoside transport protein</td>
<td valign="bottom" align="left">Transporters</td>
</tr>
<tr>
<td valign="bottom" align="left">K14165</td>
<td valign="bottom" align="center">8</td>
<td valign="bottom" align="left">Atypical dual specificity phosphatase</td>
<td valign="bottom" align="left">Protein phosphatases and associated proteins</td>
</tr>
<tr>
<td valign="bottom" align="left">K02183</td>
<td valign="bottom" align="center">8</td>
<td valign="bottom" align="left">Calmodulin</td>
<td valign="bottom" align="left">Signaling</td>
</tr>
<tr>
<td valign="bottom" align="left">K01049</td>
<td valign="bottom" align="center">6</td>
<td valign="bottom" align="left">Acetylcholinesterase</td>
<td valign="bottom" align="left">Metabolism</td>
</tr>
<tr>
<td valign="bottom" align="left">K11643</td>
<td valign="bottom" align="center">6</td>
<td valign="bottom" align="left">Chromodomain-helicase-DNA-binding protein 4</td>
<td valign="bottom" align="left">Cancers</td>
</tr>
<tr>
<td valign="bottom" align="left">K13811</td>
<td valign="bottom" align="center">6</td>
<td valign="bottom" align="left">3&#x2019;-phosphoadenosine 5&#x2019;-phosphosulfate synthase</td>
<td valign="bottom" align="left">Metabolism</td>
</tr>
<tr>
<td valign="bottom" align="left">K00873</td>
<td valign="bottom" align="center">6</td>
<td valign="bottom" align="left">Pyruvate kinase</td>
<td valign="bottom" align="left">Metabolism</td>
</tr>
<tr>
<td valign="bottom" align="left">K21991</td>
<td valign="bottom" align="center">6</td>
<td valign="bottom" align="left">Protein unc-45</td>
<td valign="bottom" align="left">Chaperones and folding catalysts</td>
</tr>
<tr>
<td valign="bottom" align="left">K20526</td>
<td valign="bottom" align="center">6</td>
<td valign="bottom" align="left">Transgelin</td>
<td valign="bottom" align="left">Membrane trafficking</td>
</tr>
<tr>
<td valign="bottom" align="left">K01298</td>
<td valign="bottom" align="center">5</td>
<td valign="bottom" align="left">Carboxypeptidase A2</td>
<td valign="bottom" align="left">Pancreatic secretion</td>
</tr>
<tr>
<td valign="bottom" align="left">K00029</td>
<td valign="bottom" align="center">5</td>
<td valign="bottom" align="left">Malate dehydrogenase (oxaloacetate-decarboxylating)(NADP+)</td>
<td valign="bottom" align="left">Metabolism</td>
</tr>
<tr>
<td valign="bottom" align="left">K00643</td>
<td valign="bottom" align="center">5</td>
<td valign="bottom" align="left">5-aminolevulinate synthase</td>
<td valign="bottom" align="left">Metabolism</td>
</tr>
<tr>
<td valign="bottom" align="left">K24048</td>
<td valign="bottom" align="center">5</td>
<td valign="bottom" align="left">MAGUK p55 subfamily member 2/6</td>
<td valign="bottom" align="left">Signaling</td>
</tr>
<tr>
<td valign="bottom" align="left">K06569</td>
<td valign="bottom" align="center">5</td>
<td valign="bottom" align="left">Melanoma-associated antigen p97</td>
<td valign="bottom" align="left">Signaling</td>
</tr>
<tr>
<td valign="bottom" align="left">K22078</td>
<td valign="bottom" align="center">5</td>
<td valign="bottom" align="left">Protein-glucosylgalactosylhydroxylysine glucosidase</td>
<td valign="bottom" align="left">Glycosidase</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec id="s3_4_1">
<label>3.4.1</label>
<title>Cytochrome P450 homologs in <italic>S. methanicola</italic>
</title>
<p>Using reference <italic>C. teleta</italic> CYP genomic annotations (<xref ref-type="bibr" rid="B14">Dejong and Wilson, 2014</xref>) to identify potential CYP sequences in <italic>S. methanicola</italic> that may respond to or detoxify PAHs, we identified 42 predicted protein sequences from the <italic>S. methanicola</italic> metagenomes that were homologous (30% to 79% identical) to 37 cytochrome P450 sequences in <italic>C. teleta</italic> (<xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7</bold>
</xref> and <xref ref-type="supplementary-material" rid="SF2">
<bold>Table S2</bold>
</xref>). In <italic>C. teleta</italic>, expression of both CYP331A1 and CYP4AT1 was shown to increase with exposure to PAHs (<xref ref-type="bibr" rid="B40">Li et&#xa0;al., 2004</xref>). From the <italic>S. methanicola</italic> metagenomes, we identified a 177-aa protein sequence sharing ~38% local sequence identity and ~55% local sequence similarity to CYP331A1. This sequence was part of a 6,485-bp contig encoding only one protein product. The sequence also shared 49% identity to an unnamed protein product of the polychaete <italic>Owenia fusiformis</italic> (NCBI accession: CAH1774988), as well as 50% identity and 69% similarity to CYP 3A29-like sequences from the brachiopod <italic>Linguna</italic> found inhabiting an intertidal zone in Kasari Bay, Japan (NCBI accessions: XP_023933140, XP_013408119, XP_013408125, XP_013408132, XP_013408139, XP_013408146, XP_013408154). We also identified another 63-aa protein sequence with ~37% local sequence identity and ~62% local sequence similarity to CYP4AT1. This sequence was part of a 5,913-bp contig encoding only one protein product. The sequence was 56% identical and 76% similar to a hypothetical protein predicted in the <italic>Helobdella robusta</italic> genome (<xref ref-type="bibr" rid="B63">Simakov et&#xa0;al., 2013</xref>), and 48% identical and 82% similar to an unnamed protein product of <italic>Owenia fusiformis</italic>.</p>
<fig id="f7" position="float">
<label>Figure&#xa0;7</label>
<caption>
<p>Clusters of cytochrome P450 (CYP) protein sequences from <italic>Capitella teleta</italic> compared to those identified in the <italic>S. methanicola</italic> metagenomes, visualized using the igraph R package. Each node represents a protein sequence, and connected nodes represent protein sequences sharing the specified range of global % sequence identity. Protein sequences in the <italic>S. methanicola</italic> metagenomes in contigs &#x2265;3,000 bp contain the prefix b_, while those in contigs &lt;3,000 bp contain the prefix s_. Clusters containing protein sequences similar to CYP331A1 and CYP4AT1 in <italic>C. teleta</italic> are highlighted in blue text. Protein sequences of CYP homologs recovered from the <italic>S. methanicola</italic> metagenomes are listed in <xref ref-type="supplementary-material" rid="SF2">
<bold>Table S2</bold>
</xref>.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmars-09-1067482-g007.tif"/>
</fig>
<p>CYP sequences are considered to be the same family and subfamily if they share 40% and 55% identity, respectively, according to the CYP nomenclature committee (<xref ref-type="bibr" rid="B50">Nelson et&#xa0;al., 1996</xref>). Based on these criteria, both homologs from the <italic>S. methanicola</italic> metagenomes did not belong to the CYP331 and CYP4 families. The protein sequence similar to CYP331A may belong to the CYP3 family, while the other protein sequence similar to CYP4AT1 was too short (66a) for CYP family assignment.</p>
</sec>
</sec>
</sec>
<sec id="s4" sec-type="discussion">
<label>4</label>
<title>Discussion</title>
<p>The genomes of deep sea invertebrates are poorly represented in genetic databases (<xref ref-type="bibr" rid="B64">Taylor and Roterman, 2017</xref>). Here, we advance knowledge concerning the genetic repertoire of a rarely studied bristle worm that can be found inhabiting Gulf of Mexico methane hydrates (<xref ref-type="bibr" rid="B18">Fisher et&#xa0;al., 2000</xref>; <xref ref-type="bibr" rid="B5">Becker et&#xa0;al., 2013</xref>). This polychaete has previously been barcoded using 16S rRNA, 18S rRNA, 28S rRNA, and <italic>cox1</italic> genes (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>). In this study, 18S rRNA and 28S rRNA genes, the mitogenome, and certain protein-coding genes were assembled using shotgun metagenomic sequencing of <italic>S. methanicola</italic> gut contents and worm body fragments. The resulting nuclear 18S rRNA and 28S rRNA (<xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>) and mitochondrial 16S rRNA and <italic>cox1</italic> gene sequences were 99% to 100% identical to marker gene sequences previously published for <italic>S. methanicola</italic> (<xref ref-type="bibr" rid="B57">Ruta et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B54">Pleijel et&#xa0;al., 2008</xref>). The phylogeny inferred from the concatenated alignment of these genes (<xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref>) was consistent with the most recent phylogeny available for <italic>Hesionidae</italic>, which clustered <italic>S. methanicola</italic> with <italic>S. dalailamai</italic> (<xref ref-type="bibr" rid="B56">Rouse et&#xa0;al., 2018</xref>) and placed all <italic>Sirsoe</italic> species within the hesionid group Psamathinae (<xref ref-type="bibr" rid="B53">Pleijel, 1998</xref>).</p>
<p>The <italic>S. methanicola</italic> mitogenome reported here is the first from the family <italic>Hesionidae</italic> (<xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref>). Further polymerase chain reaction (PCR) attempts are needed to finish the <italic>S. methanicola</italic> mitogenome, particularly to verify regions with missing sequences, low coverage, and manually corrected annotations. The mitogenome of <italic>S. methanicola</italic> was found most closely related to the mitogenome of <italic>Goniada japonica</italic> (<xref ref-type="bibr" rid="B12">Chen et&#xa0;al., 2016</xref>) from the family <italic>Goniadidae</italic>, and these were clustered with the mitogenomes of <italic>Glycera capitata</italic> and <italic>Hemipodia simplex</italic> from the family <italic>Glyceridae</italic> (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>). Among these <italic>S. methanicola</italic> relatives, only the bristle worm <italic>G</italic>. <italic>capitata</italic> has been reported in the northern Gulf of Mexico at shallow (12&#x2013;18 m) depth (<xref ref-type="bibr" rid="B17">Fauchald et&#xa0;al., 2009</xref>). Other <italic>Goniadidae</italic> and <italic>Glyceridae</italic> species have been found in shallow sediments (<xref ref-type="bibr" rid="B17">Fauchald et&#xa0;al., 2009</xref>) and deep-sea oil platform sediments (<xref ref-type="bibr" rid="B20">Granadosbarba and Solisweiss, 1997</xref>) of the Gulf of Mexico.</p>
<p>The <italic>S. methanicola</italic> mitogenome shared similar features with other reported annelid mitogenomes, including the common usage of AUG as the start codon, as well as the frequent occurrence of truncated U&#x2013; stop codons which may be completed by alternative polyadenylation (<xref ref-type="bibr" rid="B12">Chen et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B11">Cejp et&#xa0;al., 2022</xref>). Additionally, UAA and UAG stop codons found in the <italic>S. methanicola</italic> mitogenome are the most common stop codons in polychaete mitogenomes species (<xref ref-type="bibr" rid="B11">Cejp et&#xa0;al., 2022</xref>). We also identified several codon biases in the <italic>S. methanicola</italic> mitogenome. Previous mitogenomic analysis of the polychaete family <italic>Chrysopetalidae</italic> revealed relaxed selection, particularly in the cytochrome <italic>c</italic> oxidase subunit III (<italic>cox3</italic>) gene, in deep-sea compared to shallow-water species (<xref ref-type="bibr" rid="B11">Cejp et&#xa0;al., 2022</xref>). Similar codon usage comparisons within the family <italic>Hesionidae</italic> could reveal habitat-specific adaptations; however in-depth mitogenome analysis of <italic>S. methanicola</italic> was hampered by the paucity of mitogenomes taxonomically or ecologically related to the species.</p>
<p>This study provides the first functional profile for <italic>S. methanicola</italic>. Annotations of the metagenomic data included genes for putative cell adhesion, signaling, ubiquitin system, metabolism, and transport, as well as genes homologous to innexins and the CYP superfamily (<xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref>, <xref ref-type="table" rid="T4">
<bold>Table&#xa0;4</bold>
</xref>, and <xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7</bold>
</xref>). Innexins form gap junctions between neurons and are potentially useful for studying annelid phylogeny (<xref ref-type="bibr" rid="B30">Kandarian et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B24">Hughes, 2014</xref>). Using reference sequences from <italic>C. teleta</italic>, we predicted 42 CYP protein sequences in <italic>S. methanicola</italic> (<xref ref-type="fig" rid="f7">
<bold>Figure&#xa0;7</bold>
</xref> and <xref ref-type="supplementary-material" rid="SF2">
<bold>Table S2</bold>
</xref>). Based on sequence identities, none of these sequences were assigned to the same family or subfamily as CYP331A1 and CYP4AT1, whose expression was shown to increase with PAH exposure in <italic>C. teleta</italic> (<xref ref-type="bibr" rid="B40">Li et&#xa0;al., 2004</xref>). Although we hypothesized that the worm may detoxify or consume organic compounds from the environment or its gut (<xref ref-type="bibr" rid="B44">Lim et&#xa0;al., 2022</xref>), further studies are required to validate the expression and functions of the cytochrome P450 protein sequences in <italic>S. methanicola</italic>.</p>
<p>This study provides the first mitogenome, protein-coding gene, and CYP enzyme superfamily annotations for <italic>S. methanicola</italic> living on the surface of a methane hydrate in the Gulf of Mexico. Future sampling will improve the genetic annotations of this poorly understood polychaete species, which has proven difficult to locate in the deep sea. Previous studies on <italic>C. teleta</italic> polychaetes harboring gut microbes had revealed important host-microbiome interface properties relevant to the cycling of environmental compounds (<xref ref-type="bibr" rid="B14">Dejong and Wilson, 2014</xref>; <xref ref-type="bibr" rid="B25">Jang et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B26">Jang et&#xa0;al., 2021</xref>). Our results encourage further comparative studies on the genomic and microbiome adaptations of this deep-sea worm to its unique habitat and how these adaptations contribute to the ecology and nutrient cycling of methane hydrates.</p>
</sec>
<sec id="s5" sec-type="data-availability">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/<xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Material</bold>
</xref>.</p>
</sec>
<sec id="s6" sec-type="author-contributions">
<title>Author contributions</title>
<p>SL conducted bioinformatics analyses and wrote the manuscript, under the guidance of LT. KG conceived this project and led manuscript editing. All authors edited the manuscript and approved the submission.</p>
</sec>
</body>
<back>
<sec id="s7" sec-type="funding-information">
<title>Funding</title>
<p>This research was carried out [in part] under the auspices of the Cooperative Institute for Marine and Atmospheric Studies (CIMAS), a Cooperative Institute of the University of Miami and the National Oceanic and Atmospheric Administration (cooperative agreement # NA20OAR4320472), with support from the NOAA Office of Oceanic and Atmospheric Research&#x2019;s &#x2018;Omics Initiative. LT was supported by award NA21OAR4320190 to the Northern Gulf Institute from NOAA&#x2019;s Office of Oceanic and Atmospheric Research, U.S. Department of Commerce. Sample collection was supported in part by NSF OCE-0527139. MiSeq sequencing was performed through NSF DUE-1323809 that supported a Sequencing Technology Education using Microbial Metagenomes (STEMM) workshop at San Diego State University organized by Dr. Elizabeth Dinsdale. NIH grant 1R21-ES024105 supported preliminary computational analysis.</p>
</sec>
<ack>
<title>Acknowledgments</title>
<p>We are indebted to Dr. Terry Gaasterland for orchestrating the HiSeq sequencing and Dr. Craig M. Young for organizing the cruise that provided the methane ice worm samples. We thank Wei Xin of Smith College and Lee Edsall and Nick Smith of the University of California, San Diego (UCSD) for early data analysis efforts. We thank Dr. Michelle Wood (Co-PI on NSF OCE-0527139) for inviting KG on the cruise and for promoting metagenomic research, including this methane ice worm study. We acknowledge Dr. Richard Emlet at University of Oregon for the skilled dissection that provided the Tube A sample. Nucleic acid processing and HiSeq 2000 sequencing efforts were aided by Steve Head and Terry Gelbart at the Scripps Research Institute.</p>
</ack>
<sec id="s8" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s9" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmars.2022.1067482/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmars.2022.1067482/full#supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
<supplementary-material xlink:href="Table_1.docx" id="SF1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
<supplementary-material xlink:href="Table_2.docx" id="SF2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adachi</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Hasegawa</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>1996</year>). <article-title>Model of amino acid substitution in proteins encoded by mitochondrial DNA</article-title>. <source>J. Mol. Evol.</source> <volume>42</volume> (<issue>4</issue>), <fpage>459</fpage>&#x2013;<lpage>468</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/BF02498640</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alves</surname> <given-names>P. R.</given-names>
</name>
<name>
<surname>Halanych</surname> <given-names>K. M.</given-names>
</name>
<name>
<surname>Santos</surname> <given-names>C. S. G.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>The phylogeny of nereididae (Annelida) based on mitochondrial genomes</article-title>. <source>Zool. Scripta</source> <volume>49</volume> (<issue>3</issue>), <fpage>366</fpage>&#x2013;<lpage>378</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/zsc.12413</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Anders</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Pyl</surname> <given-names>P. T.</given-names>
</name>
<name>
<surname>Huber</surname> <given-names>W.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>HTSeq&#x2013;a Python framework to work with high-throughput sequencing data</article-title>. <source>Bioinformatics</source> <volume>31</volume> (<issue>2</issue>), <fpage>166</fpage>&#x2013;<lpage>169</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btu638</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Arkin</surname> <given-names>A. P.</given-names>
</name>
<name>
<surname>Cottingham</surname> <given-names>R. W.</given-names>
</name>
<name>
<surname>Henry</surname> <given-names>C. S.</given-names>
</name>
<name>
<surname>Harris</surname> <given-names>N. L.</given-names>
</name>
<name>
<surname>Stevens</surname> <given-names>R. L.</given-names>
</name>
<name>
<surname>Maslov</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>KBase: The united states department of energy systems biology knowledgebase</article-title>. <source>Nat. Biotechnol.</source> <volume>36</volume> (<issue>7</issue>), <fpage>566</fpage>&#x2013;<lpage>569</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nbt.4163</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Becker</surname> <given-names>E. L.</given-names>
</name>
<name>
<surname>Cordes</surname> <given-names>E. E.</given-names>
</name>
<name>
<surname>Macko</surname> <given-names>S. A.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>R. W.</given-names>
</name>
<name>
<surname>Fisher</surname> <given-names>C. R.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Using stable isotope compositions of animal tissues to infer trophic interactions in gulf of Mexico lower slope seep communities</article-title>. <source>PloS One</source> <volume>8</volume> (<issue>12</issue>), <elocation-id>e74459</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0074459</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Benson</surname> <given-names>D. A.</given-names>
</name>
<name>
<surname>Cavanaugh</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Clark</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Karsch-Mizrachi</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Ostell</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Pruitt</surname> <given-names>K. D.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>GenBank</article-title>. <source>Nucleic Acids Res.</source> <volume>46</volume> (<issue>D1</issue>), <fpage>D41</fpage>&#x2013;<lpage>D47</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkx1094</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bernt</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Donath</surname> <given-names>A.</given-names>
</name>
<name>
<surname>J&#xfc;hling</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Externbrink</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Florentz</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Fritzsch</surname> <given-names>G.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>MITOS: Improved <italic>de novo</italic> metazoan mitochondrial genome annotation</article-title>. <source>Mol. Phylogenet. Evol.</source> <volume>69</volume> (<issue>2</issue>), <fpage>313</fpage>&#x2013;<lpage>319</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ympev.2012.08.023</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Blake</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Grassle</surname> <given-names>J. P.</given-names>
</name>
<name>
<surname>Eckelbarger</surname> <given-names>K. J.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>
<italic>Capitella teleta</italic>, a new species designation for the opportunistic and experimental <italic>Capitella</italic> sp. I, with a review of the literature for confirmed records</article-title>. <source>Zoosymposia</source> <volume>2</volume> (<issue>1</issue>), <fpage>25</fpage>&#x2013;<lpage>53</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.11646/zoosymposia.2.1.6</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Camacho</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Coulouris</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Avagyan</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Papadopoulos</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bealer</surname> <given-names>K.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>BLAST+: architecture and applications</article-title>. <source>BMC Bioinf.</source> <volume>10</volume>, <elocation-id>421</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2105-10-421</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Castresana</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis</article-title>. <source>Mol. Biol. Evol.</source> <volume>17</volume> (<issue>4</issue>), <fpage>540</fpage>&#x2013;<lpage>552</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a026334</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cejp</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Ravara</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Aguado</surname> <given-names>M. T.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>First mitochondrial genomes of chrysopetalidae (Annelida) from shallow-water and deep-sea chemosynthetic environments</article-title>. <source>Gene</source> <volume>815</volume>, <elocation-id>146159</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.gene.2021.146159</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Guo</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Meng</surname> <given-names>Z.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>The complete mitochondrial genome of the polychaete, <italic>Goniada japonica</italic> (Phyllodocida, goniadidae)</article-title>. <source>Mitochondrial. DNA A DNA Mapp Seq Anal.</source> <volume>27</volume> (<issue>4</issue>), <fpage>2850</fpage>&#x2013;<lpage>2851</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3109/19401736.2015.1053124</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cs&#xe1;rdi</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Nepusz</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>The igraph software package for complex network research</article-title>. <source>InterJournal Complex Syst.</source> <volume>1695</volume>, <fpage>1</fpage>&#x2013;<lpage>9</lpage>.</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dejong</surname> <given-names>C. A.</given-names>
</name>
<name>
<surname>Wilson</surname> <given-names>J. Y.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>The cytochrome P450 superfamily complement (CYPome) in the annelid <italic>Capitella teleta</italic>
</article-title>. <source>PloS One</source> <volume>9</volume> (<issue>11</issue>), <elocation-id>e107728</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0107728</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Desbruy&#xe8;res</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Toulmond</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>A new species of hesionid worm, <italic>Hesiocaeca methanicola</italic> sp. nov. (Polychaeta: Hesionidae), living in ice-like methane hydrates in the deep gulf of Mexico</article-title>. <source>Cah. Biol. Mar.</source> <volume>39</volume>, <fpage>93</fpage>&#x2013;<lpage>98</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.21411/CBM.A.BA5D76AF</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dubilier</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Bergin</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Lott</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Symbiotic diversity in marine animals: the art of harnessing chemosynthesis</article-title>. <source>Nat.Rev.Microbiol</source> <volume>6</volume> (<issue>10</issue>), <fpage>725</fpage>&#x2013;<lpage>740</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nrmicro1992</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Fauchald</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Granados-Barba</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Solis-Weiss</surname> <given-names>V.</given-names>
</name>
</person-group> (<year>2009</year>). &#x201c;<article-title>Polychaeta (Annelida) of the gulf of Mexico</article-title>,&#x201d; in <source>Gulf of Mexico origin, waters, and biota: Biodiversity</source>. Eds. <person-group person-group-type="editor">
<name>
<surname>Felder</surname> <given-names>D. L.</given-names>
</name>
<name>
<surname>Camp</surname> <given-names>D. K.</given-names>
</name>
</person-group> (<publisher-loc>College Station, USA</publisher-loc>: <publisher-name>Texas A&amp;M University Press</publisher-name>), <fpage>751</fpage>&#x2013;<lpage>788</lpage>.</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fisher</surname> <given-names>C. R.</given-names>
</name>
<name>
<surname>MacDonald</surname> <given-names>I. R.</given-names>
</name>
<name>
<surname>Sassen</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Young</surname> <given-names>C. M.</given-names>
</name>
<name>
<surname>Macko</surname> <given-names>S. A.</given-names>
</name>
<name>
<surname>Hourdez</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2000</year>). <article-title>Methane ice worms: <italic>Hesiocaeca methanicola</italic> colonizing fossil fuel reserves</article-title>. <source>Naturwissenschaften</source> <volume>87</volume> (<issue>4</issue>), <fpage>184</fpage>&#x2013;<lpage>187</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s001140050700</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Forbes</surname> <given-names>V. E.</given-names>
</name>
<name>
<surname>Andreassen</surname> <given-names>M. S. H.</given-names>
</name>
<name>
<surname>Christensen</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Metabolism of the polycyclic aromatic hydrocarbon fluoranthene by the polychaete <italic>Capitella capitata</italic> species I</article-title>. <source>Environ. Toxicol. Chem.</source> <volume>20</volume> (<issue>5</issue>), <fpage>1012</fpage>&#x2013;<lpage>1021</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/etc.5620200511</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Granadosbarba</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Solisweiss</surname> <given-names>V.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>The polychaetous annelids from oil platforms areas in the southeastern gulf of Mexico: Phyllodocidae, glyceridae, goniadidae, hesionidae, and pilargidae, with description of <italic>Ophioglycera lyra</italic>, a new species, and comments on <italic>Goniada distorta</italic> Moore and <italic>Scoloplos texana</italic> maciolek &amp; Holland</article-title>. <source>Proc. Biol. Soc. Washington</source> <volume>110</volume> (<issue>3</issue>), <fpage>457</fpage>&#x2013;<lpage>470</lpage>.</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gruber-Vodicka</surname> <given-names>H. R.</given-names>
</name>
<name>
<surname>Seah</surname> <given-names>B. K. B.</given-names>
</name>
<name>
<surname>Pruesse</surname> <given-names>E.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>phyloFlash: rapid small-subunit rRNA profiling and targeted assembly from metagenomes</article-title>. <source>mSystems</source> <volume>5</volume> (<issue>5</issue>), <fpage>e00920</fpage>&#x2013;<lpage>e00920</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/mSystems.00920-20</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hochstein</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Sadowsky</surname> <given-names>M. J.</given-names>
</name>
<name>
<surname>Forbes</surname> <given-names>V. E.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>The deposit feeder <italic>Capitella teleta</italic> has a unique and relatively complex microbiome likely supporting its ability to degrade pollutants</article-title>. <source>Sci. Total Environ.</source> <volume>670</volume>, <fpage>547</fpage>&#x2013;<lpage>554</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.scitotenv.2019.03.255</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hoff</surname> <given-names>K. J.</given-names>
</name>
<name>
<surname>Stanke</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>WebAUGUSTUS&#x2013;a web service for training AUGUSTUS and predicting genes in eukaryotes</article-title>. <source>Nucleic Acids Res.</source> <volume>41</volume> (<issue>Web Server issue</issue>), <fpage>W123</fpage>&#x2013;<lpage>W128</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkt418</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hughes</surname> <given-names>A. L.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Evolutionary diversification of insect innexins</article-title>. <source>J. Insect Sci.</source> <volume>14</volume>, <fpage>1</fpage>&#x2013;<lpage>5</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/jisesa/ieu083</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Forbes</surname> <given-names>V. E.</given-names>
</name>
<name>
<surname>Sadowsky</surname> <given-names>M. J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Lack of evidence for the role of gut microbiota in PAH biodegradation by the polychaete <italic>Capitella teleta</italic>
</article-title>. <source>Sci. Total Environ.</source> <volume>725</volume>, <elocation-id>138356</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.scitotenv.2020.138356</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Hochstein</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Forbes</surname> <given-names>V. E.</given-names>
</name>
<name>
<surname>Sadowsky</surname> <given-names>M. J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Bioturbation by the marine polychaete <italic>Capitella teleta</italic> alters the sediment microbial community by ingestion and defecation of sediment particles</article-title>. <source>Sci. Total Environ.</source> <volume>752</volume>, <elocation-id>142239</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.scitotenv.2020.142239</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Johnson</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Zaretskaya</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Raytselis</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Merezhuk</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>McGinnis</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Madden</surname> <given-names>T. L.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>NCBI BLAST: a better web interface</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume> (<issue>Web Server issue</issue>), <fpage>W5</fpage>&#x2013;<lpage>W9</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkn201</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname> <given-names>D. T.</given-names>
</name>
<name>
<surname>Taylor</surname> <given-names>W. R.</given-names>
</name>
<name>
<surname>Thornton</surname> <given-names>J. M.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>The rapid generation of mutation data matrices from protein sequences</article-title>. <source>Comput. Appl. Biosci.</source> <volume>8</volume> (<issue>3</issue>), <fpage>275</fpage>&#x2013;<lpage>282</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/8.3.275</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Joye</surname> <given-names>S. B.</given-names>
</name>
<name>
<surname>Boetius</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Orcutt</surname> <given-names>B. N.</given-names>
</name>
<name>
<surname>Montoya</surname> <given-names>J. P.</given-names>
</name>
<name>
<surname>Schulz</surname> <given-names>H. N.</given-names>
</name>
<name>
<surname>Erickson</surname> <given-names>M. J.</given-names>
</name>
<etal/>
</person-group>. (<year>2004</year>). <article-title>The anaerobic oxidation of methane and sulfate reduction in sediments from gulf of Mexico cold seeps</article-title>. <source>Chem. Geol.</source> <volume>205</volume> (<issue>3</issue>), <fpage>219</fpage>&#x2013;<lpage>238</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.chemgeo.2003.12.019</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kandarian</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Sethi</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Baker</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Yazdani</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Kym</surname> <given-names>E.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>The medicinal leech genome encodes 21 innexin genes: different combinations are expressed by identified central neurons</article-title>. <source>Dev. Genes Evol.</source> <volume>222</volume> (<issue>1</issue>), <fpage>29</fpage>&#x2013;<lpage>44</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s00427-011-0387-z</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kanehisa</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Furumichi</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Sato</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Ishiguro-Watanabe</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Tanabe</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>KEGG: integrating viruses and cellular organisms</article-title>. <source>Nucleic Acids Res.</source> <volume>49</volume> (<issue>D1</issue>), <fpage>D545</fpage>&#x2013;<lpage>D551</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkaa970</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kanehisa</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Sato</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Morishima</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences</article-title>. <source>J. Mol. Biol.</source> <volume>428</volume> (<issue>4</issue>), <fpage>726</fpage>&#x2013;<lpage>731</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.jmb.2015.11.006</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Katoh</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Standley</surname> <given-names>D. M.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>MAFFT multiple sequence alignment software version 7: improvements in performance and usability</article-title>. <source>Mol. Biol. Evol.</source> <volume>30</volume> (<issue>4</issue>), <fpage>772</fpage>&#x2013;<lpage>780</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/mst010</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krzywinski</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Schein</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Birol</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Connors</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Gascoyne</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Horsman</surname> <given-names>D.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>Circos: an information aesthetic for comparative genomics</article-title>. <source>Genome Res.</source> <volume>19</volume> (<issue>9</issue>), <fpage>1639</fpage>&#x2013;<lpage>1645</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1101/gr.092759.109</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Stecher</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Knyaz</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Tamura</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>MEGA X: Molecular evolutionary genetics analysis across computing platforms</article-title>. <source>Mol. Biol. Evol.</source> <volume>35</volume> (<issue>6</issue>), <fpage>1547</fpage>&#x2013;<lpage>1549</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msy096</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kvenvolden</surname> <given-names>K. A.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>A review of the geochemistry of methane in natural gas hydrate</article-title>. <source>Organic Geochem.</source> <volume>23</volume> (<issue>11</issue>), <fpage>997</fpage>&#x2013;<lpage>1008</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/0146-6380(96)00002-2</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Langmead</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Salzberg</surname> <given-names>S. L.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Fast gapped-read alignment with bowtie 2</article-title>. <source>Nat. Methods</source> <volume>9</volume> (<issue>4</issue>), <fpage>357</fpage>&#x2013;<lpage>359</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lanoil</surname> <given-names>B. D.</given-names>
</name>
<name>
<surname>Sassen</surname> <given-names>R.</given-names>
</name>
<name>
<surname>La Duc</surname> <given-names>M. T.</given-names>
</name>
<name>
<surname>Sweet</surname> <given-names>S. T.</given-names>
</name>
<name>
<surname>Nealson</surname> <given-names>K. H.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Bacteria and archaea physically associated with gulf of Mexico gas hydrates</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>67</volume> (<issue>11</issue>), <fpage>5143</fpage>&#x2013;<lpage>5153</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/AEM.67.11.5143-5153.2001</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Levin</surname> <given-names>L. A.</given-names>
</name>
</person-group> (<year>2005</year>). &#x201c;<article-title>Ecology of cold seep sediments: interactions of fauna with flow, chemistry</article-title>,&#x201d; in <source>Oceanography and marine biology</source>. Eds. <person-group person-group-type="editor">
<name>
<surname>Gibson</surname> <given-names>R. N.</given-names>
</name>
<name>
<surname>Atkinson</surname> <given-names>R. J. A.</given-names>
</name>
<name>
<surname>Gordon</surname> <given-names>J. D. M.</given-names>
</name>
</person-group> (<publisher-loc>Boca Raton, FL, USA</publisher-loc>: <publisher-name>Taylor &amp; Francis</publisher-name>), <fpage>1</fpage>&#x2013;<lpage>46</lpage>.</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Bisgaard</surname> <given-names>H. C.</given-names>
</name>
<name>
<surname>Forbes</surname> <given-names>V. E.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Identification and expression of two novel cytochrome P450 genes, belonging to CYP4 and a new CYP331 family, in the polychaete <italic>Capitella capitata</italic> sp.I</article-title>. <source>Biochem. Biophys. Res. Commun.</source> <volume>325</volume> (<issue>2</issue>), <fpage>510</fpage>&#x2013;<lpage>517</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.bbrc.2004.10.066</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Godzik</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>CD-HIT: a fast program for clustering and comparing large sets of protein or nucleotide sequences</article-title>. <source>Bioinformatics</source> <volume>22</volume> (<issue>13</issue>), <fpage>1658</fpage>&#x2013;<lpage>1659</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btl158</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Handsaker</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Wysoker</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Fennell</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Ruan</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Homer</surname> <given-names>N.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>The sequence Alignment/Map format and SAMtools</article-title>. <source>Bioinformatics</source> <volume>25</volume> (<issue>16</issue>), <fpage>2078</fpage>&#x2013;<lpage>2079</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Luo</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>C. M.</given-names>
</name>
<name>
<surname>Leung</surname> <given-names>C. M.</given-names>
</name>
<name>
<surname>Ting</surname> <given-names>H. F.</given-names>
</name>
<name>
<surname>Sadakane</surname> <given-names>K.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices</article-title>. <source>Methods</source> <volume>102</volume>, <fpage>3</fpage>&#x2013;<lpage>11</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ymeth.2016.02.020</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lim</surname> <given-names>S. J.</given-names>
</name>
<name>
<surname>Thompson</surname> <given-names>L. R.</given-names>
</name>
<name>
<surname>Young</surname> <given-names>C. M.</given-names>
</name>
<name>
<surname>Gaasterland</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Goodwin</surname> <given-names>K. D.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Dominance of <italic>Sulfurospirillum</italic> in metagenomes associated with the methane ice worm (<italic>Sirsoe methanicola</italic>)</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>88</volume> (<issue>15</issue>), <elocation-id>e0029022</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/aem.00290-22</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Martin</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Cutadapt removes adapter sequences from high-throughput sequencing reads</article-title>. <source>EMBnet Journal</source> <volume>17</volume> (<issue>1</issue>), <page-range>10&#x2013;12</page-range>. doi:&#xa0;<pub-id pub-id-type="doi">10.14806/ej.17.1.200</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meng</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>MitoZ: a toolkit for animal mitochondrial genome assembly, annotation and visualization</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume> (<issue>11</issue>), <fpage>e63</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkz173</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mills</surname> <given-names>H. J.</given-names>
</name>
<name>
<surname>Martinez</surname> <given-names>R. J.</given-names>
</name>
<name>
<surname>Story</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Sobecky</surname> <given-names>P. A.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Characterization of microbial community structure in gulf of Mexico gas hydrates: comparative analysis of DNA- and RNA-derived clone libraries</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>71</volume> (<issue>6</issue>), <fpage>3235</fpage>&#x2013;<lpage>3247</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1128/AEM.71.6.3235-3247.2005</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="book">
<person-group person-group-type="author">
<collab>National Energy Technology Laboratory</collab>
</person-group> (<year>2017</year>). <source>Methane hydrate science and technology: a 2017 update</source>. (<publisher-loc>U.S. Department of Energy</publisher-loc>). Available at: <uri xlink:href="https://www.netl.doe.gov/sites/default/files/netl-file/2017-Methane-Hydrate-Primer%5B1%5D.pdf">https://www.netl.doe.gov/sites/default/files/netl-file/2017-Methane-Hydrate-Primer%5B1%5D.pdf</uri>.</citation>
</ref>
<ref id="B49">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Nei</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Kumar</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2000</year>). <source>Molecular evolution and phylogenetics</source> (<publisher-loc>New York</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>).</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nelson</surname> <given-names>D. R.</given-names>
</name>
<name>
<surname>Koymans</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Kamataki</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Stegeman</surname> <given-names>J. J.</given-names>
</name>
<name>
<surname>Feyereisen</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Waxman</surname> <given-names>D. J.</given-names>
</name>
<etal/>
</person-group>. (<year>1996</year>). <article-title>P450 superfamily: update on new sequences, gene mapping, accession numbers and nomenclature</article-title>. <source>Pharmacogenetics</source> <volume>6</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>42</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1097/00008571-199602000-00002</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nurk</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Meleshko</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Korobeynikov</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Pevzner</surname> <given-names>P. A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>metaSPAdes: a new versatile metagenomic assembler</article-title>. <source>Genome Res.</source> <volume>27</volume> (<issue>5</issue>), <fpage>824</fpage>&#x2013;<lpage>834</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1101/gr.213959.116</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peng</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Leung</surname> <given-names>H. C.</given-names>
</name>
<name>
<surname>Yiu</surname> <given-names>S. M.</given-names>
</name>
<name>
<surname>Chin</surname> <given-names>F. Y.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>IDBA-UD: a <italic>de novo</italic> assembler for single-cell and metagenomic sequencing data with highly uneven depth</article-title>. <source>Bioinformatics</source> <volume>28</volume> (<issue>11</issue>), <fpage>1420</fpage>&#x2013;<lpage>1428</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/bts174</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pleijel</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Phylogeny and classification of hesionidae (Polychaeta)</article-title>. <source>Zool. Scripta</source> <volume>27</volume> (<issue>2</issue>), <fpage>89</fpage>&#x2013;<lpage>163</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1463-6409.1998.tb00433.x</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pleijel</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Rouse</surname> <given-names>G. W.</given-names>
</name>
<name>
<surname>Ruta</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Wiklund</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Nygren</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>
<italic>Vrijenhoekia balaenophila</italic>, a new hesionid polychaete from a whale fall off California</article-title>. <source>Zool. J. Linn. Soc.</source> <volume>152</volume> (<issue>4</issue>), <fpage>625</fpage>&#x2013;<lpage>634</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1096-3642.2007.00360.x</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pleijel</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Rouse</surname> <given-names>G. W.</given-names>
</name>
<name>
<surname>Sundkvist</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Nygren</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>A partial revision of <italic>Gyptis</italic> (Gyptini, ophiodrominae, hesionidae, aciculata, Annelida), with descriptions of a new tribe, a new genus and five new species</article-title>. <source>Zool. J. Linn. Soc.</source> <volume>165</volume> (<issue>3</issue>), <fpage>471</fpage>&#x2013;<lpage>494</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1096-3642.2012.00819.x</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rouse</surname> <given-names>G. W.</given-names>
</name>
<name>
<surname>Carvajal</surname> <given-names>J. I.</given-names>
</name>
<name>
<surname>Pleijel</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Phylogeny of hesionidae (Aciculata, Annelida), with four new species from deep-sea eastern pacific methane seeps, and resolution of the affinity of <italic>Hesiolyra</italic>
</article-title>. <source>Invertebrate Systemat.</source> <volume>32</volume> (<issue>5</issue>), <fpage>1050</fpage>&#x2013;<lpage>1068</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1071/IS17092</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ruta</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Nygren</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Rousset</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Sundberg</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Tillier</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Wiklund</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>2007</year>). <article-title>Phylogeny of <italic>Hesionidae</italic> (Aciculata, polychaeta), assessed from morphology, 18S rDNA, 28S rDNA, 16S rDNA and COI</article-title>. <source>Zool. Scripta</source> <volume>36</volume> (<issue>1</issue>), <fpage>99</fpage>&#x2013;<lpage>107</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1463-6409.2006.00255.x</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sayers</surname> <given-names>E. W.</given-names>
</name>
<name>
<surname>Beck</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Bolton</surname> <given-names>E. E.</given-names>
</name>
<name>
<surname>Bourexis</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Brister</surname> <given-names>J. R.</given-names>
</name>
<name>
<surname>Canese</surname> <given-names>K.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>Database resources of the national center for biotechnology information</article-title>. <source>Nucleic Acids Res.</source> <volume>49</volume> (<issue>D1</issue>), <fpage>D10</fpage>&#x2013;<lpage>D17</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkaa892</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Selck</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Palmqvist</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Forbes</surname> <given-names>V. E.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Biotransformation of dissolved and sediment-bound fluoranthene in the polychaete, <italic>Capitella</italic> sp. I</article-title>. <source>Environ. Toxicol. Chem.</source> <volume>22</volume> (<issue>10</issue>), <fpage>2364</fpage>&#x2013;<lpage>2374</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1897/02-272</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sharp</surname> <given-names>P. M.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>W. H.</given-names>
</name>
</person-group> (<year>1987</year>). <article-title>The codon adaptation index&#x2013;a measure of directional synonymous codon usage bias, and its potential applications</article-title>. <source>Nucleic Acids Res.</source> <volume>15</volume> (<issue>3</issue>), <fpage>1281</fpage>&#x2013;<lpage>1295</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/15.3.1281</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shimabukuro</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Carrerette</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Alfaro-Lucas</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Rizzo</surname> <given-names>A. E.</given-names>
</name>
<name>
<surname>Halanych</surname> <given-names>K. M.</given-names>
</name>
<name>
<surname>Sumida</surname> <given-names>P. Y. G.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Diversity, distribution and phylogeny of hesionidae (Annelida) colonizing whale falls: new species of <italic>Sirsoe</italic> and connections between ocean basins</article-title>. <source>Front. Mar. Sci.</source> <volume>6</volume> (<issue>478</issue>). doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fmars.2019.00478</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sibuet</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Olu</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Biogeography, biodiversity and fluid dependence of deep-sea cold-seep communities at active and passive margins</article-title>. <source>Deep Sea Res. Part II: Topical Stud. Oceanogr.</source> <volume>45</volume> (<issue>1</issue>), <fpage>517</fpage>&#x2013;<lpage>567</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S0967-0645(97)00074-X</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Simakov</surname> <given-names>O.</given-names>
</name>
<name>
<surname>Marletaz</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Cho</surname> <given-names>S.-J.</given-names>
</name>
<name>
<surname>Edsinger-Gonzales</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Havlak</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Hellsten</surname> <given-names>U.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>Insights into bilaterian evolution from three spiralian genomes</article-title>. <source>Nature</source> <volume>493</volume> (<issue>7433</issue>), <fpage>526</fpage>&#x2013;<lpage>531</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nature11696</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taylor</surname> <given-names>M. L.</given-names>
</name>
<name>
<surname>Roterman</surname> <given-names>C. N.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Invertebrate population genetics across earth's largest habitat: the deep-sea floor</article-title>. <source>Mol. Ecol.</source> <volume>26</volume> (<issue>19</issue>), <fpage>4872</fpage>&#x2013;<lpage>4896</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/mec.14237</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Tunnicliffe</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Juniper</surname> <given-names>S. K.</given-names>
</name>
<name>
<surname>Sibuet</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2003</year>). &#x201c;<article-title>Reducing environments of the deep-sea floor</article-title>,&#x201d; in <source>Ecosystems of the world</source>. Ed. <person-group person-group-type="editor">
<name>
<surname>Tyler</surname> <given-names>P. A.</given-names>
</name>
</person-group> (<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>The Netherlands: Elsevier Science</publisher-name>), <fpage>81</fpage>&#x2013;<lpage>110</lpage>.</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van Dover</surname> <given-names>C. L.</given-names>
</name>
<name>
<surname>Aharon</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Bernhard</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Caylor</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Doerries</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Flickinger</surname> <given-names>W.</given-names>
</name>
<etal/>
</person-group>. (<year>2003</year>). <article-title>Blake Ridge methane seeps: characterization of a soft-sediment, chemosynthetically based ecosystem</article-title>. <source>Deep Sea Res. Part I: Oceanogr. Res. Pap.</source> <volume>50</volume> (<issue>2</issue>), <fpage>281</fpage>&#x2013;<lpage>300</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S0967-0637(02)00162-0</pub-id>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wong</surname> <given-names>E. H.</given-names>
</name>
<name>
<surname>Smith</surname> <given-names>D. K.</given-names>
</name>
<name>
<surname>Rabadan</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Peiris</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Poon</surname> <given-names>L. L.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Codon usage bias and the evolution of influenza a viruses. codon usage biases of influenza virus</article-title>. <source>BMC Evol. Biol.</source> <volume>10</volume>, <elocation-id>253</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2148-10-253</pub-id>
</citation>
</ref>
<ref id="B68">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Xin</surname> <given-names>W.</given-names>
</name>
</person-group> (<year>2013</year>). <source>Metagenomics of the methane ice worm, sirsoe methanicola, and its associated microbial community</source> (<publisher-loc>Smith College</publisher-loc>: <publisher-name>Honors Project</publisher-name>).</citation>
</ref>
</ref-list>
</back>
</article>