<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2023.1203780</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Evolutionary dynamics of plastomes in coscinodiscophycean diatoms revealed by comparative genomics</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Liu</surname> <given-names>Feng</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/988788/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname> <given-names>Yichao</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Huang</surname> <given-names>Hailong</given-names></name>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1687450/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Chen</surname> <given-names>Nansheng</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1058005/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences</institution>, <addr-line>Qingdao, Shandong</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>Marine Ecology and Environmental Science Laboratory, Laoshan Laboratory</institution>, <addr-line>Qingdao, Shandong</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>Center for Ocean Mega-Science, Chinese Academy of Sciences</institution>, <addr-line>Qingdao, Shandong</addr-line>, <country>China</country></aff>
<aff id="aff4"><sup>4</sup><institution>Chinese Academy of Fishery Sciences</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<aff id="aff5"><sup>5</sup><institution>School of Marine Sciences, Ningbo University</institution>, <addr-line>Ningbo, Zhejiang</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Ondrej Prasil, Academy of Sciences of the Czech Republic (ASCR), Czechia</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Tore Brembu, Norwegian University of Science and Technology, Norway; Ra&#x00FA;l A. Gonz&#x00E1;lez-Pech, The Pennsylvania State University (PSU), United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Feng Liu, <email>liufeng@qdio.ac.cn</email></corresp>
<corresp id="c002">Nansheng Chen, <email>chenns@qdio.ac.cn</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>15</day>
<month>06</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1203780</elocation-id>
<history>
<date date-type="received">
<day>11</day>
<month>04</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>01</day>
<month>06</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Liu, Wang, Huang and Chen.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Liu, Wang, Huang and Chen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>To understand the evolution of coscinodiscophycean diatoms, plastome sequences of six coscinodiscophycean diatom species were constructed and analyzed in this study, doubling the number of constructed plastome sequences in Coscinodiscophyceae (radial centrics). The platome sizes varied substantially in Coscinodiscophyceae, ranging from 119.1 kb of <italic>Actinocyclus subtilis</italic> to 135.8 kb of <italic>Stephanopyxis turris</italic>. Plastomes in Paraliales and Stephanopyxales tended to be larger than those in Rhizosoleniales and Coscinodiacales, which were due to the expansion of the inverted repeats (IRs) and to the marked increase of the large single copy (LSC). Phylogenomic analysis indicated that <italic>Paralia</italic> and <italic>Stephanopyxis</italic> clustered tightly to form the Paraliales-Stephanopyxales complex, which was sister to the Rhizosoleniales-Coscinodiscales complex. The divergence time between Paraliales and Stephanopyxales was estimated at 85 MYA in the middle Upper Cretaceous, indicating that Paraliales and Stephanopyxales appeared later than Coscinodiacales and Rhizosoleniales according to their phylogenetic relationships. Frequent losses of housekeeping protein-coding genes (PCGs) were observed in these coscinodiscophycean plastomes, indicating that diatom plastomes showed an ongoing reduction in gene content during evolution. Two <italic>acpP</italic> genes (<italic>acpP1</italic> and <italic>acpP2</italic>) detected in diatom plastomes were found to be originated from an early gene duplication event occurred in the common progenitor after diatom emergence, rather than multiple independent gene duplications occurring in different lineages of diatoms. The IRs in <italic>Stephanopyxis turris</italic> and <italic>Rhizosolenia fallax</italic>-<italic>imbricata</italic> exhibited a similar trend of large expansion to the small single copy (SSC) and slightly small contraction from the LSC, which eventually led to the conspicuous increase in IR size. Gene order was highly conserved in Coscinodiacales, while multiple rearrangements were observed in Rhizosoleniales and between Paraliales and Stephanopyxales. Our results greatly expanded the phylogenetic breadth in Coscinodiscophyceae and gained novel insights into the evolution of plastomes in diatoms.</p>
</abstract>
<kwd-group>
<kwd>diatom</kwd>
<kwd>plastome</kwd>
<kwd>Coscinodiscophyceae</kwd>
<kwd>gene duplication</kwd>
<kwd>inverted repeats</kwd>
<kwd>phylogenomic analysis</kwd>
</kwd-group>
<counts>
<fig-count count="7"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="63"/>
<page-count count="13"/>
<word-count count="7749"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Aquatic Microbiology</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Diatoms are one of the most successful phytoplankton groups in contemporary oceans, accounting for roughly 20% of the primary productivity on Earth (<xref ref-type="bibr" rid="B11">Falkowski et al., 1998</xref>), as well as being the primary biological mediators of the silica cycle in oceans (<xref ref-type="bibr" rid="B51">Sumper and Brunner, 2008</xref>). Some diatom species are widely used as initial feeding in aquaculture, while others have demonstrated great potential to be important metabolites in the fields of biomedicine, bioenergy and biomaterials (e.g., <xref ref-type="bibr" rid="B18">Hemaiswarya et al., 2011</xref>; <xref ref-type="bibr" rid="B23">Kiatmetha et al., 2011</xref>; <xref ref-type="bibr" rid="B43">Nurachman et al., 2012</xref>). Some diatoms species proliferate rapidly and massively in regional waters, resulting in the development of harmful algal blooms (HABs) which have serious negative impact on economy and the ecological environment (<xref ref-type="bibr" rid="B1">Armbrust, 2009</xref>).</p>
<p>Diatoms are an extraordinarily diverse lineage in evolution and morphology, and it is believed that there are as many as 100,000 extant species around the world (<xref ref-type="bibr" rid="B36">Mann and Droop, 1996</xref>; <xref ref-type="bibr" rid="B37">Mann et al., 2021</xref>). Up to now, more than 18,200 species of diatoms have been effectively described in taxonomy around the world (<xref ref-type="bibr" rid="B15">Guiry and Guiry, 2023</xref>). There is no doubt that a large number of cryptic species have not yet been properly resolved due to their morphological similarities. Current classification system of diatoms based on molecular biological data, combined with morphological features, sexual reproduction, and fossil evidence (<xref ref-type="bibr" rid="B41">Medlin and Kaczmarska, 2004</xref>; <xref ref-type="bibr" rid="B40">Medlin and Desdevises, 2020</xref>), has revealed that the Bacillariophyta phylum could be divided into two subphyla, Bacillariophytina and Coscinodiscophytina. Bacillariophytina harbors two classes, Bacillariophyceae (pennates) and Mediophyceae (polar centrics), harboring 78.7 and 10.2% of diatom species, respectively. Coscinodiscophytina contains a single class Coscinodiscophyceae (radial centrics), which thus far has 1,548 species, occupying 8.5% (<xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>; <xref ref-type="bibr" rid="B15">Guiry and Guiry, 2023</xref>).</p>
<p>Diatoms are unicellular photosynthetic heterokont algae that are globally distributed in marine and freshwater environments. Diatoms represent a lineage of photosynthetic heterokonts which acquired their chloroplasts from a red algal ancestor by secondary endosymbiosis that took place around one billion years ago (<xref ref-type="bibr" rid="B4">Bhattacharya et al., 2007</xref>), thus their chloroplasts are surrounded by four layers of membranes. Diatom nuclear genomes sequenced thus far harbored genes from a heterotrophic host cell, bacterial donors, and red algal endosymbionts, displaying mosaic nature of their genetic material (<xref ref-type="bibr" rid="B2">Armbrust et al., 2004</xref>; <xref ref-type="bibr" rid="B6">Bowler et al., 2008</xref>).</p>
<p>Nevertheless, in the latest 10 years, the number of diatom chloroplast genomes (plastid genomes, plastomes) increased rather rapidly driven by high throughput DNA sequencing technologies. As of 1st May 2023, more than 150 diatom plastomes from at least 130 species have been deposited in the GenBank database. The sequenced diatom plastomes display a canonical quadripartite organization with two inverted repeats (IRs) between a large single copy (LSC) region and a small single copy (SSC) region, and harbor a core set of 150 - 165 canonical genes (e.g., <xref ref-type="bibr" rid="B24">Kowallik et al., 1995</xref>; <xref ref-type="bibr" rid="B44">Oudot-Le Secq et al., 2007</xref>; <xref ref-type="bibr" rid="B16">Hamsher et al., 2019</xref>). Diatom plastomes show a reduced gene content in comparison with red algal plastomes (230&#x2013;254), indicating that many chloroplast genes have been lost or transferred to the nuclear genome after secondary endosymbiosis (<xref ref-type="bibr" rid="B35">Lommer et al., 2010</xref>; <xref ref-type="bibr" rid="B27">Lang and Nedelcu, 2012</xref>). Meanwhile, some genes were acquired via lateral gene transfer from different sources (e.g., plasmids, bacteria), leading to the expansion of plastome sizes (<xref ref-type="bibr" rid="B7">Brembu et al., 2014</xref>; <xref ref-type="bibr" rid="B46">Ruck et al., 2014</xref>). Although they show a high degree of similarity in genome architecture and core gene set (<xref ref-type="bibr" rid="B47">Sabir et al., 2014</xref>; <xref ref-type="bibr" rid="B45">Ruck et al., 2017</xref>; <xref ref-type="bibr" rid="B58">Wang et al., 2022</xref>), diatom plastomes have undergone tremendous alterations in genome size, IR size and gene content, and gene order through evolutionary history of diatoms (<xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>; <xref ref-type="bibr" rid="B13">Gastineau et al., 2021</xref>). Plastomes of some closely related intrageneric species or even intra-order species displayed highly conserved gene order and genome structure (<xref ref-type="bibr" rid="B47">Sabir et al., 2014</xref>; <xref ref-type="bibr" rid="B16">Hamsher et al., 2019</xref>; <xref ref-type="bibr" rid="B59">Xu et al., 2021</xref>; <xref ref-type="bibr" rid="B63">Zhang and Chen, 2022</xref>).</p>
<p>Thus far, only six plastomes from four genera in two orders (Rhizosoleniales and Coscinodiacales) in coscinodiscophycean diatoms have been sequenced (<xref ref-type="table" rid="T1">Table 1</xref>). Considering the rich morphological diversity of coscinodiscophycean diatom species, the important evolutionary status of this class which represents a basal lineage in the evolution of early diatoms, and their global distribution and important ecological significance, it is very necessary to fill this gap to understand their evolution trend and relationships. This study focused on the evolution of plastomes in coscinodiscophycean diatoms. Plastomes from six coscinodiscophycean diatom species including <italic>Guinardia delicatula</italic>, <italic>Guinardia striata</italic>, <italic>Actinocyclus</italic> sp., <italic>Coscinodiscus granii</italic>, <italic>Stephanopyxis turris</italic>, and <italic>Paralia sulcate</italic>, which were collected in the Jiaozhou Bay of China, were constructed in this study and compared with plastome sequences deposited in the GenBank database to ascertain their evolutionary dynamics.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>General features of 12 diatom plastomes in Coscinodiscophyceae for comparative analysis.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Species</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">Order</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">Accession<break/> number</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">Size<break/> (bp)</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">G + C (%)</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">LSC<break/> (bp)</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">SSC<break/> (bp)</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">IR<break/>(bp)</td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">Canonical genes<xref ref-type="table-fn" rid="t1fns2">&#x002A;&#x002A;</xref></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;">References</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"><bold>PCGs/rRNAs/tRNAs/sRNAs</bold></td>
<td valign="top" align="center" style="color:#ffffff;background-color: #7f8080;"></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Guinardia delicatula</italic></td>
<td valign="top" align="center">Rhizosoleniales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="OM827252">OM827252</ext-link></td>
<td valign="top" align="center">123,772</td>
<td valign="top" align="center">32.39</td>
<td valign="top" align="center">61,109</td>
<td valign="top" align="center">39,097</td>
<td valign="top" align="center">11,783</td>
<td valign="top" align="center">130/3/27/2</td>
<td valign="top" align="center">This study</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Guinardia striata</italic></td>
<td valign="top" align="center">Rhizosoleniales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="OM827251">OM827251</ext-link></td>
<td valign="top" align="center">121,778</td>
<td valign="top" align="center">31.68</td>
<td valign="top" align="center">59,939</td>
<td valign="top" align="center">38,657</td>
<td valign="top" align="center">11,591</td>
<td valign="top" align="center">130/3/27/2</td>
<td valign="top" align="center">This study</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Guinardia striata</italic></td>
<td valign="top" align="center">Rhizosoleniales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="MG755796">MG755796</ext-link></td>
<td valign="top" align="center">122,145</td>
<td valign="top" align="center">32.26</td>
<td valign="top" align="center">59,710</td>
<td valign="top" align="center">38,869</td>
<td valign="top" align="center">11,783<xref ref-type="table-fn" rid="t1fns1">&#x002A;</xref></td>
<td valign="top" align="center">129/3/27/2</td>
<td valign="top" align="center"><xref ref-type="bibr" rid="B61">Yu et al., 2018</xref></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Rhizosolenia setigera</italic></td>
<td valign="top" align="center">Rhizosoleniales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="MG755793">MG755793</ext-link></td>
<td valign="top" align="center">121,011</td>
<td valign="top" align="center">32.17</td>
<td valign="top" align="center">58,541</td>
<td valign="top" align="center">38,332</td>
<td valign="top" align="center">12,069</td>
<td valign="top" align="center">129/3/27/2</td>
<td valign="top" align="center"><xref ref-type="bibr" rid="B61">Yu et al., 2018</xref></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Rhizosolenia fallax</italic></td>
<td valign="top" align="center">Rhizosoleniales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="MG755802">MG755802</ext-link></td>
<td valign="top" align="center">125,283</td>
<td valign="top" align="center">30.20</td>
<td valign="top" align="center">59,165</td>
<td valign="top" align="center">28,184</td>
<td valign="top" align="center">18,967</td>
<td valign="top" align="center">122/3/27/2</td>
<td valign="top" align="center"><xref ref-type="bibr" rid="B61">Yu et al., 2018</xref></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Rhizosolenia imbricata</italic></td>
<td valign="top" align="center">Rhizosoleniales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="KJ958482">KJ958482</ext-link></td>
<td valign="top" align="center">120,956</td>
<td valign="top" align="center">31.76</td>
<td valign="top" align="center">61,244</td>
<td valign="top" align="center">27,482</td>
<td valign="top" align="center">16,115</td>
<td valign="top" align="center">123/3/27/2</td>
<td valign="top" align="center"><xref ref-type="bibr" rid="B47">Sabir et al., 2014</xref></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Actinocyclus</italic> sp.</td>
<td valign="top" align="center">Coscinodiacales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="OM827248">OM827248</ext-link></td>
<td valign="top" align="center">120,465</td>
<td valign="top" align="center">31.04</td>
<td valign="top" align="center">59,224</td>
<td valign="top" align="center">38,731</td>
<td valign="top" align="center">11,255</td>
<td valign="top" align="center">129/3/27/2</td>
<td valign="top" align="center">This study</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Actinocyclus subtilis</italic></td>
<td valign="top" align="center">Coscinodiacales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="MG755799">MG755799</ext-link></td>
<td valign="top" align="center">119,120</td>
<td valign="top" align="center">29.42</td>
<td valign="top" align="center">59,040</td>
<td valign="top" align="center">38,042</td>
<td valign="top" align="center">11,019</td>
<td valign="top" align="center">130/3/27/2</td>
<td valign="top" align="center"><xref ref-type="bibr" rid="B61">Yu et al., 2018</xref></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Coscinodiscus granii</italic></td>
<td valign="top" align="center">Coscinodiacales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="MW561225">MW561225</ext-link></td>
<td valign="top" align="center">123,615</td>
<td valign="top" align="center">31.18</td>
<td valign="top" align="center">60,117</td>
<td valign="top" align="center">37,498</td>
<td valign="top" align="center">13,000</td>
<td valign="top" align="center">131/3/27/2</td>
<td valign="top" align="center">This study</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Coscinodiscus radiatus</italic></td>
<td valign="top" align="center">Coscinodiacales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="KC509521">KC509521</ext-link></td>
<td valign="top" align="center">122,213</td>
<td valign="top" align="center">30.41</td>
<td valign="top" align="center">60,402</td>
<td valign="top" align="center">36,643</td>
<td valign="top" align="center">12,584</td>
<td valign="top" align="center">131/3/27/2</td>
<td valign="top" align="center"><xref ref-type="bibr" rid="B46">Ruck et al., 2014</xref></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Paralia sulcata</italic></td>
<td valign="top" align="center">Paraliales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="OM827250">OM827250</ext-link></td>
<td valign="top" align="center">132,157</td>
<td valign="top" align="center">30.97</td>
<td valign="top" align="center">68,214</td>
<td valign="top" align="center">36,673</td>
<td valign="top" align="center">13,635</td>
<td valign="top" align="center">128/3/27/2</td>
<td valign="top" align="center">This study</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Stephanopyxis turris</italic></td>
<td valign="top" align="center">Stephanopyxales</td>
<td valign="top" align="center"><ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="OM827249">OM827249</ext-link></td>
<td valign="top" align="center">135,791</td>
<td valign="top" align="center">30.37</td>
<td valign="top" align="center">68,199</td>
<td valign="top" align="center">27,372</td>
<td valign="top" align="center">20,110</td>
<td valign="top" align="center">128/3/27/2</td>
<td valign="top" align="center">This study</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t1fns1"><p>&#x002A;One of IR gene clusters was inverted in the <italic>Guinardia striata</italic> plastome (MG755796), which resulted in two IR gene clusters being arranged in a forward direction instead of a reverse direction.</p></fn>
<fn id="t1fns2"><p>&#x002A;&#x002A;Multicopy genes were counted once, e.g., the canonical genes located in IRs as well as the tRNA with duplication (<xref ref-type="supplementary-material" rid="TS3">Supplementary Table 3</xref>).</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and methods</title>
<sec id="S2.SS1">
<title>Sampling and isolation of diatoms</title>
<p>Water samples were collected in Jiaozhou Bay (36&#x00B0;00&#x2032;&#x2013;36&#x00B0;12&#x2032;N, 120&#x00B0;10&#x2032;&#x2013;120&#x00B0;24&#x2032;E) of China from August 2020 to December 2021 onboard the R/V Chuangxin, which was operated by the Jiaozhou Bay National Marine Ecosystem Research Station. Single cells of each diatom species were isolated using single-cell capillary methods from water samples. Six unialgal diatom strains (CNS00558, CNS00513, CNS00114, CNS00554, CNS00378, and CNS00428) were successfully cultured in L1 medium with 1 &#x2030; volume fraction Na<sub>2</sub>SiO<sub>3</sub>. The culture was maintained at 18&#x2013;20&#x00B0;C, 2,000&#x2013;3,000 Lux in the photoperiod of 12 h light-12 h dark.</p>
</sec>
<sec id="S2.SS2">
<title>DNA extraction, and sequencing</title>
<p>Total genomic DNA for each diatom strain was extracted using the DNAsecure Plant Kit (Tiangen Biotech, Beijing, China). After purification, genomic DNA samples were fragmented into a size of 350 bp using Covaris S220 ultrasonic crater (Covaris, USA) for library construction. The DNA libraries were sequenced using a NovaSeq 6000 platform (Illumina, San Diego, CA, USA), yielding about 5-Gb sequencing data of paired-end reads with 150 bp in length. Illumina sequencing raw data were trimmed using Trimmomatic v0.39 with the parameters: LEADING:3 TRAILING:3 SLIDING WINDOW:4:15 MINLEN:75 (<xref ref-type="bibr" rid="B5">Bolger et al., 2014</xref>).</p>
</sec>
<sec id="S2.SS3">
<title>Identification of coscinodiscophycean diatom species</title>
<p>Species identification of six coscinodiscophycean diatoms was performed based on comparative analysis of their full-length 18S rDNA sequences as well as their morphological characteristics (<xref ref-type="fig" rid="F1">Figure 1</xref>). Full-length 18S rDNA sequences of these diatom strains were assembled using Illumina sequencing results using the GetOrganelle v1.7.4.1 (<xref ref-type="bibr" rid="B19">Jin et al., 2020</xref>) and SPAdes v3.14.0 (<xref ref-type="bibr" rid="B3">Bankevich et al., 2012</xref>). The 18S rDNA sequences of coscinodiscophycean diatoms downloaded from the GenBank database were used as reference sequences (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). Multiple sequence alignments of 18S rDNA sequences were conducted by using ClustalX v1.83 with the default settings (<xref ref-type="bibr" rid="B54">Thompson et al., 1997</xref>). Evolutionary relationships were evaluated based on the analysis of the similarity of the 18S rDNA sequences using MEGA v7.0 (<xref ref-type="bibr" rid="B26">Kumar et al., 2016</xref>). 18S rDNA sequences of strains CNS00558, CNS00513, CNS00554, CNS00378, and CNS00428 displayed very high similarities (&#x003E;99.6%) with the reference sequences deposited in the GenBank database, respectively (<xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>). According to molecular data and their morphological characteristics, these five strains were identified as <italic>Guinardia delicatula</italic>, <italic>Guinardia striata</italic>, <italic>Coscinodiscus granii</italic>, <italic>Stephanopyxis turris</italic>, and <italic>Paralia sulcate</italic>, respectively. Strain CNS00114 shared the highest similarity with <italic>Actinocyclus</italic> sp. (X85401), reaching 98.86% (<xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>), suggesting that this strain represents an undescribed species in <italic>Actinocyclus</italic>.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Micrographs of six diatom species in class Coscinodiscophyceae. <bold>(A)</bold> <italic>Guinardia delicatula</italic>. <bold>(B)</bold> <italic>Guinardia striata</italic>. <bold>(C)</bold> <italic>Actinocyclus</italic> sp. <bold>(D)</bold> <italic>Coscinodiscus granii</italic>. <bold>(E)</bold> <italic>Stephanopyxis turris</italic>. <bold>(F)</bold> <italic>Paralia sulcate</italic>. Bar = 20 &#x03BC;m.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1203780-g001.tif"/>
</fig>
</sec>
<sec id="S2.SS4">
<title>Construction and annotation of plastomes</title>
<p>Clean reads were used to assemble complete plastome sequences using GetOrganelle v1.7.4.1 (<xref ref-type="bibr" rid="B19">Jin et al., 2020</xref>). To verify the completeness of these plastomes, GetOrganelle generally exported consistent assembly results with the same raw reads using different parameters, when a complete plastome was obtained (<xref ref-type="bibr" rid="B19">Jin et al., 2020</xref>). Whether an assembled plastome was circular was further confirmed by aligning short reads against the assembled plastome sequence using the MEM algorithm of BWA v0.7.17 (<xref ref-type="bibr" rid="B28">Li and Durbin, 2009</xref>) and SAMtools v1.10 (<xref ref-type="bibr" rid="B29">Li et al., 2009</xref>). Circular plastome was supported by perfect alignments of short reads along the entire length of assembled plastome sequence. The alignment was visualized using IGV v2.7.2 (<xref ref-type="bibr" rid="B55">Thorvaldsdottir et al., 2013</xref>). Annotation was conducted using MFannot<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> and NCBI&#x2019;s ORF Finder,<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> which was further improved using NCBI&#x2019;s Sequin v15.10. For best accuracy of comparative analysis, we had checked and re-annotated coscinodiscophycean diatom plastomes deposited in the GenBank database.</p>
</sec>
<sec id="S2.SS5">
<title>Phylogenomic and synteny analysis</title>
<p>Due to the absence of four protein-coding genes (PCGs) (<italic>bas1</italic>, <italic>ycf88</italic>, <italic>ycf89</italic>, and <italic>ycf90</italic>) in <italic>Triparma laevis</italic> (Bolidophyceae, Ochrophyta) plastome when compared with coscinodiscophycean plastomes, a total of 116 PCGs from 12 coscinodiscophycean plastomes as well as <italic>Triparma laevis</italic> plastome (<xref ref-type="bibr" rid="B52">Tajima et al., 2016</xref>) were extracted and concatenated for phylogenomic analysis. The amino acid (AA) sequences of each PCG were individually aligned and checked using MAFFT v7.471 (<xref ref-type="bibr" rid="B22">Katoh and Standley, 2013</xref>), and ambiguously aligned regions were trimmed by trimAl v1.4 (<xref ref-type="bibr" rid="B9">Capella-Gutierrez et al., 2009</xref>). These AA datasets were concatenated using PhyloSuite v1.2.2 (<xref ref-type="bibr" rid="B62">Zhang et al., 2020</xref>). The best-fit model was tested and identified using ModelFinder (<xref ref-type="bibr" rid="B21">Kalyaanamoorthy et al., 2017</xref>). Phylogenomic tree was constructed by IQ-TREE v1.6.12 (<xref ref-type="bibr" rid="B56">Trifinopoulos et al., 2016</xref>) using default parameters with 5,000 ultrafast bootstrap analysis (<xref ref-type="bibr" rid="B42">Minh et al., 2013</xref>). <italic>T. laevis</italic> was used as the outgroup. The AA sequences of AcpP (AcpP1 and AcpP2) were aligned by ClustalX v1.83 with default settings (<xref ref-type="bibr" rid="B54">Thompson et al., 1997</xref>). The phylogenetic relationships were inferred with the Maximum Likelihood (ML) method based on the JTT matrix-based model (<xref ref-type="bibr" rid="B20">Jones et al., 1992</xref>) using MEGA v7.0 (<xref ref-type="bibr" rid="B26">Kumar et al., 2016</xref>). There were 113 positions in the final dataset of AcpP. Synteny analysis of plastomes was performed using the progressiveMauve software of the package Mauve v2.3.1 (<xref ref-type="bibr" rid="B10">Darling et al., 2010</xref>). Comparative illustration of coscinodiscophycean plastomes was conducted using circos v0.69 (<xref ref-type="bibr" rid="B25">Krzywinski et al., 2009</xref>).</p>
</sec>
<sec id="S2.SS6">
<title>Divergence time estimation</title>
<p>Phylogenetic relationship and molecular dating were analyzed by calculating the codon evolution rate of nucleotide (nt) sequences of 109 PCGs which were shared by plastomes of selected diatom species and <italic>Ectocarpus siliculosus</italic> (Phaeophyceae, Ochrophyta). <italic>E. siliculosus</italic> was used as outgroup with its known fossil time. The nt sequences were aligned using MAFFT v7.471 and concatenated using PhyloSuite v1.2.2 (<xref ref-type="bibr" rid="B62">Zhang et al., 2020</xref>). The phylogenetic tree was constructed using IQ-TREE v1.6.12 (<xref ref-type="bibr" rid="B56">Trifinopoulos et al., 2016</xref>), and molecular dating was conducted using the PAML package v4.8a (<xref ref-type="bibr" rid="B60">Yang, 2007</xref>). Estimation of substitution rate was performed using baseml, and estimation of divergence times with the approximate likelihood method was carried out using mcmctree. The phylogenetic tree was displayed in Figtree v1.4.3 and visualized with 95% highest posterior density interval (HPD) for each node. Three calibrations of internal nodes, including <italic>E. siliculosus</italic> at 176&#x2013;202 MYA (<xref ref-type="bibr" rid="B38">Matari and Blair, 2014</xref>), <italic>R. setigera</italic> at 90&#x2013;93 MYA (<xref ref-type="bibr" rid="B49">Sinninghe-Damst&#x00E9; et al., 2004</xref>), and Thalassiosirales at 40&#x2013;50 MYA (<xref ref-type="bibr" rid="B48">Sims et al., 2006</xref>), were conducted in divergence time estimation.</p>
</sec>
</sec>
<sec id="S3" sec-type="results|discussion">
<title>Results and discussion</title>
<sec id="S3.SS1">
<title>Molecular features of coscinodiscophycean plastomes</title>
<p>Complete plastome sequences of six coscinodiscophycean diatom species which exhibited rich diversity in cell morphology and cell size (<xref ref-type="fig" rid="F1">Figure 1</xref>) were successfully assembled. These plastomes ranged in size from 120.5 kb in <italic>A. curvatulus</italic> to 135.8 kb in <italic>S. turris</italic>, and their G + C content was from 30.37% in <italic>S. turris</italic> to 32.39% in <italic>G. delicatula</italic> (<xref ref-type="table" rid="T1">Table 1</xref>), which were in the range of reported diatom plastomes (<xref ref-type="bibr" rid="B45">Ruck et al., 2017</xref>; <xref ref-type="bibr" rid="B33">Liu et al., 2021a</xref>,<xref ref-type="bibr" rid="B34">b</xref>; <xref ref-type="bibr" rid="B17">He et al., 2022</xref>; <xref ref-type="bibr" rid="B58">Wang et al., 2022</xref>). Similar to the reported diatom plastomes (e.g., <xref ref-type="bibr" rid="B24">Kowallik et al., 1995</xref>; <xref ref-type="bibr" rid="B44">Oudot-Le Secq et al., 2007</xref>; <xref ref-type="bibr" rid="B53">Tanaka et al., 2011</xref>; <xref ref-type="bibr" rid="B16">Hamsher et al., 2019</xref>), these newly assembled plastomes were mapped as canonical circular quadripartite structures with two IRs separating LSC and SSC. The conserved <italic>psbD-psbC</italic> overlapping region was detected in these coscinodiscophycean plastomes, but also in plastomes of Bacillariophyta and Ochrophyta (<xref ref-type="bibr" rid="B24">Kowallik et al., 1995</xref>; <xref ref-type="bibr" rid="B32">Liu et al., 2017</xref>). However, comparative analysis reveals that the <italic>psbD-psbC</italic> overlapping regions in the <italic>Stephanopyxis</italic> and <italic>Paralia</italic> plastomes reduced from the original 53 to 17 bp (GTGGAAACGCCCTTTAA), due to a single base mutation (T&#x2192;C) which should have occurred in the common progenitor of <italic>Stephanopyxis</italic> and <italic>Paralia</italic>, and <italic>psbC</italic> started with GTG instead of ATG.</p>
<p>Combined the other six plastomes of coscinodiscophycean diatom species deposited in the GenBank database, we found that these 12 plastomes represented four orders in the class Coscinodiscophyceae (<xref ref-type="table" rid="T1">Table 1</xref>), which enabled us to obtain valuable evolutionary clues. The 135.8-kb <italic>Stephanopyxis</italic> plastome was the largest one found in Coscinodiscophyceae thus far, followed by the 132.2-kb <italic>Paralia</italic> plastome, and then the plastomes in Rhizosoleniales and Coscinodiacales. There is no significant difference in plastome sizes between Rhizosoleniales and Coscinodiacales (<xref ref-type="table" rid="T1">Table 1</xref>). Plastomes in Paraliales and Stephanopyxales tended to be larger than those in Rhizosoleniales and Coscinodiacales (<xref ref-type="bibr" rid="B46">Ruck et al., 2014</xref>; <xref ref-type="bibr" rid="B47">Sabir et al., 2014</xref>; <xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>), which was caused not only by the expansion of the IR but also by the marked increase of the LSC. The size of LSC in Paraliales and Stephanopyxales is at least 8.0 kb larger than that in Rhizosoleniales and Coscinodiacales (<xref ref-type="table" rid="T1">Table 1</xref>). Numerous expanded intergenic regions were caused by accepting foreign DNA fragments in the <italic>Stephanopyxis</italic> and <italic>Paralia</italic> plastomes, which made them less compact when compared with plastomes in Rhizosoleniales and Coscinodiacales. However, it is difficult to trace the exact origin of some foreign fragments detected in diatom plastomes thus far, although a small fraction has been identified to be from diatom plasmids (<xref ref-type="bibr" rid="B7">Brembu et al., 2014</xref>; <xref ref-type="bibr" rid="B46">Ruck et al., 2014</xref>).</p>
</sec>
<sec id="S3.SS2">
<title>Divergence time estimation of coscinodiscophycean plastomes</title>
<p>Phylogenomic analysis based on AA sequences of 116 PCGs shared by the 12 coscinodiscophycean plastomes as well as the <italic>T. laevis</italic> plastome (<xref ref-type="bibr" rid="B52">Tajima et al., 2016</xref>) as the outgroup revealed that the sampled coscinodiscophycean diatom species were grouped into four clades representing four orders (<xref ref-type="fig" rid="F2">Figure 2</xref>). <italic>Guinardia</italic> and <italic>Rhizosolenia</italic> were recovered as the monophyletic clade Rhizosoleniales. In Rhizosoleniales, <italic>R. setigera</italic> was sister to <italic>Guinardia</italic> plus the left <italic>Rhizosolenia</italic> with high bootstrap support (100%), which shows that <italic>R. setigera</italic> may represent a hitherto undescribed genus (<italic>pseudo-Rhizosolenia</italic>) independent of <italic>Guinardia</italic> and <italic>Rhizosolenia</italic>. The controversial results that intraspecific genetic distance in <italic>G. striata</italic> (OM827251 and MG755796) exceeds interspecific genetic distance between <italic>G. striata</italic> and <italic>G. delicatula</italic> indicated that the <italic>G. striata</italic> strain (OM827251) we identified is not the same species as that (MG755796). More work needs to be carried out to reveal whether there are cryptic species in <italic>Guinardia. Actinocyclus</italic> and <italic>Coscinodiscus</italic> clustered together to form the clade Coscinodiacales which was sister to Rhizosoleniales. <italic>Paralia</italic> and <italic>Stephanopyxis</italic>, which represented two distinct orders, clustered tightly to form the Paraliales-Stephanopyxales complex. The Paraliales-Stephanopyxales complex was sister to Rhizosoleniales-Coscinodiscales complex with 100% bootstrap support (<xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Maximum Likelihood (ML) phylogenomic tree of coscinodiscophycean diatom species based on concatenated amino acid sequences of 116 PCGs shared by plastomes of these diatoms as well as <italic>Triparma laevis</italic> as outgroup. Numbers at the branches represented bootstrap values. Branch lengths were proportional to the amount of sequence change, which were indicated by the scale bar below the trees.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1203780-g002.tif"/>
</fig>
<p>A molecular dating tree was constructed to evaluate the divergence time in diatom lineages based on nt sequences of 109 PCGs shared by the plastomes of selected diatoms and <italic>E. siliculosus</italic>. Our results showed that ancient diatoms (Bacillariophyta) were estimated to appear at about 186 MYA, which was within the range of previous reports (<xref ref-type="bibr" rid="B50">Sorhannus, 2007</xref>; <xref ref-type="bibr" rid="B39">Medlin, 2015</xref>), while the Coscinodiscophyceae was separated from the Bacillariophyceae-Mediophyceae complex at about 142 MYA in the early Lower Cretaceous (<xref ref-type="fig" rid="F3">Figure 3</xref>). These coscinodiscophycean diatom species were well recovered as a monophyletic clade (<xref ref-type="fig" rid="F3">Figure 3</xref>). In Coscinodiscophyceae, the Paraliales-Stephanopyxales complex split at 124 MYA, followed by Coscinodiacales and Rhizosoleniales that split 114 MYA (<xref ref-type="fig" rid="F3">Figure 3</xref>). The divergence time between Paraliales and Stephanopyxales was estimated at 85 MYA in the middle Upper Cretaceous, which was consistent with the time reported previously (<xref ref-type="bibr" rid="B50">Sorhannus, 2007</xref>). These results emphasize that Paraliales and Stephanopyxales appeared later than Coscinodiacales and Rhizosoleniales. In Coscinodiacales, <italic>Actinocyclus</italic> and <italic>Coscinodiscus</italic> split at about 101 MYA. In Rhizosoleniales, <italic>R. setigera</italic> emerged first at about 92 MYA and then the split between <italic>Guinardia</italic> and other <italic>Rhizosolenia</italic> occurred at 84 MYA.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Time-calibrated phylogenetic analysis based on nucleotide sequences of 109 PCGs shared by the plastomes of selected diatom species as well as <italic>Ectocarpus siliculosus</italic> as outgroup. The fossil calibration taxa were indicated by red dots at corresponding nodes. Horizontal bars in blue color represented the 95% highest posterior density (HPD) values of the estimated divergence time.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1203780-g003.tif"/>
</fig>
</sec>
<sec id="S3.SS3">
<title>Variations of gene repertoires in coscinodiscophycean plastomes</title>
<p>Gene repertoires in coscinodiscophycean plastomes showed substantial variations, ranging from 154 genes in <italic>R. fallax</italic> to 163 genes in two <italic>Coscinodiscus</italic> plastomes (<xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="supplementary-material" rid="TS3">Supplementary Table 3</xref>). Comparative analysis revealed that 120 PCGs, three ribosomal RNA genes (rRNAs), 27 transfer RNA genes (tRNAs) and two regulatory small RNA genes (sRNAs) were shared by these coscinodiscophycean plastomes, with 13 PCGs lost to various degrees in different lineages of Coscinodiscophyceae (<xref ref-type="fig" rid="F4">Figure 4</xref> and <xref ref-type="table" rid="T2">Table 2</xref>), which is most likely due to their horizontal transfer to corresponding nuclear genomes (<xref ref-type="bibr" rid="B46">Ruck et al., 2014</xref>; <xref ref-type="bibr" rid="B33">Liu et al., 2021a</xref>,<xref ref-type="bibr" rid="B34">b</xref>). Twenty-seven tRNAs were sufficient for messenger RNA translation in diatom plastomes which contained one more tRNA (<italic>trnR3</italic>) than the plastomes of brown algae (<xref ref-type="bibr" rid="B32">Liu et al., 2017</xref>). In addition to two copies of <italic>trnN(guu)</italic> located at IRa and IRb, respectively, another perfect copy of <italic>trnN(guu)</italic> which shared the same sequence as the formers was situated in SSC of the <italic>Stephanopyxis</italic> plastome, but was absent in other coscinodiscophycean plastomes, indicating that the duplication of <italic>trnN(guu)</italic> occurred only in the <italic>Stephanopyxis</italic> plastome. Two sRNAs, <italic>ffs</italic> and <italic>ssrA</italic>, were conservatively located in the adjacent upstream (5&#x2032;) region of <italic>psbX</italic> and adjacent downstream (3&#x2032;) region of <italic>trnR3(ccg)</italic>, respectively. <italic>ffs</italic> is a signal-recognition particle RNA that participates in the transmembrane transport of nuclear-encoded genes with plastid localization, and <italic>ssrA</italic> is a small regulatory RNA that interacts with stalled ribosomes to resume translation (<xref ref-type="bibr" rid="B12">Galachyants et al., 2012</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Changes in gene content of PCGs in the 12 coscinodiscophycean plastomes. The Maximum Likelihood (ML) tree on the left showed their phylogenetic relationships, and the matrix on the right showed the presence (blue) and absence (white) of 16 PCGs.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1203780-g004.tif"/>
</fig>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Genes identified in these 12 diatom plastomes in Coscinodiscophyceae.</p></caption>
<table cellspacing="5" cellpadding="5" frame="box" rules="all">
<thead>
<tr>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Functional classification</td>
<td valign="top" align="left" style="color:#ffffff;background-color: #7f8080;">Genes<xref ref-type="table-fn" rid="t2fns1">&#x002A;</xref></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" colspan="2" style="background-color: #dcdcdc;"><bold>Protein-coding genes (PCGs: 133)<xref ref-type="table-fn" rid="t2fns2">&#x002A;&#x002A;</xref></bold></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Transcription and translation (52)</td>
<td valign="top" align="left"><italic>rpl1</italic>, <italic>rpl2</italic>, <italic>rpl3</italic>, <italic>rpl4</italic>, <italic>rpl5</italic>, <italic>rpl6</italic>, <italic>rpl11</italic>, <italic>rpl12</italic>, <italic>rpl13</italic>, <italic>rpl14</italic>, <italic>rpl16</italic>, <italic>rpl18</italic>, <italic>rpl19</italic>, <italic>rpl20</italic>, <italic>rpl21</italic>, <italic>rpl22</italic>, <italic>rpl23</italic>, <italic>rpl24</italic>, <italic>rpl27</italic>, <italic>rpl29</italic>, <italic>rpl31</italic>, <italic>rpl32</italic>, <italic><underline>rpl33</underline></italic>, <italic>rpl34</italic>, <italic>rpl35</italic>, <italic><underline>rpl36</underline></italic>, <italic>rps2</italic>, <italic>rps3</italic>, <italic>rps4</italic>, <italic>rps5</italic>, <italic>rps6</italic>, <italic>rps7</italic>, <italic>rps8</italic>, <italic>rps9</italic>, <italic>rps10</italic>, <italic>rps11</italic>, <italic>rps12</italic>, <italic>rps13</italic>, <italic>rps14</italic>, <italic>rps16</italic>, <italic>rps17</italic>, <italic>rps18</italic>, <italic>rps19</italic>, <italic>rps20</italic>, <italic>dnaB</italic>, <italic>rpoA</italic>, <italic>rpoB</italic>, <italic>rpoC1</italic>, <italic>rpoC2</italic>, <italic><underline>syfB</underline></italic>, <italic><underline>tsf</underline></italic>, <italic><underline>tufA</underline></italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Photosystem I (12)</td>
<td valign="top" align="left"><italic>psaA</italic>, <italic>psaB</italic>, <italic>psaC</italic>, <italic>psaD</italic>, <italic><underline>psaE</underline></italic>, <italic>psaF</italic>, <italic><underline>psaI</underline></italic>, <italic>psaJ</italic>, <italic>psaL</italic>, <italic><underline>psaM</underline></italic>, <italic>ycf3</italic>, <italic>ycf4</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Photosystem II (19)</td>
<td valign="top" align="left"><italic>psbA</italic>, <italic>psbB</italic>, <italic>psbC</italic>, <italic>psbD</italic>, <italic>psbE</italic>, <italic>psbF</italic>, <italic>psbH</italic>, <italic>psbI</italic>, <italic>psbJ</italic>, <italic>psbK</italic>, <italic>psbL</italic>, <italic>psbN</italic>, <italic>psbT</italic>, <italic>psbV</italic>, <italic>psbW</italic>, <italic>psbX</italic>, <italic>psbY</italic>, <italic>psbZ</italic>, <italic>ycf12</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Electron transport and ATP synthesis (18)</td>
<td valign="top" align="left"><italic>atpA</italic>, <italic>atpB</italic>, <italic>atpD</italic>, <italic>atpE</italic>, <italic>atpF</italic>, <italic>atpG</italic>, <italic>atpH</italic>, <italic>atpI</italic>, <italic>ccs1</italic>, <italic>ccsA</italic>, <italic>petA</italic>, <italic>petB</italic>, <italic>petD</italic>, <italic>petF</italic>, <italic>petG</italic>, <italic>petL</italic>, <italic>petM</italic>, <italic>petN</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Carbon assimilation and metabolism (8)</td>
<td valign="top" align="left"><italic>acpP1</italic>, <italic><underline>acpP2</underline></italic>, <italic><underline>ilvB</underline></italic>, <italic><underline>ilvH</underline></italic>, <italic>rbcL</italic>, <italic>rbcS</italic>, <italic>thiG</italic>, <italic>thiS</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Light harvesting and chl biosynthesis (1)</td>
<td valign="top" align="left"><italic>chlI</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Signal transduction (2)</td>
<td valign="top" align="left"><italic>cbbX</italic>, <italic>rbcR</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Protein import (4)</td>
<td valign="top" align="left"><italic>secA</italic>, <italic>secG</italic>, <italic>secY</italic>, <italic>tatC</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Fe-S assembly (2)</td>
<td valign="top" align="left"><italic>sufB</italic>, <italic>sufC</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Chaperones (2)</td>
<td valign="top" align="left"><italic>dnaK</italic>, <italic>groEL</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Antioxidase and proteolysis (3)</td>
<td valign="top" align="left"><italic>bas1</italic>, <italic>clpC</italic>, <italic>ftsH</italic></td>
</tr>
<tr>
<td valign="top" align="left">&#x2003;Conserved chloroplast PCGs (10)</td>
<td valign="top" align="left"><italic>ycf33</italic>, <italic><underline>ycf35</underline></italic>, <italic>ycf39</italic>, <italic>ycf41</italic>, <italic>ycf45</italic>, <italic>ycf46</italic>, <italic><underline>ycf66</underline></italic>, <italic>ycf88</italic>, <italic>ycf89</italic>, <italic>ycf90</italic></td>
</tr>
<tr>
<td valign="top" align="left">Ribosomal RNA genes (rRNAs: 3)</td>
<td valign="top" align="left"><italic>rnl</italic>, <italic>rns</italic>, <italic>rrn5</italic></td>
</tr>
<tr>
<td valign="top" align="left">Transfer RNA genes (tRNAs: 27)</td>
<td valign="top" align="left"><italic>trnA(ugc)</italic>, <italic>trnC(gca)</italic>, <italic>trnD(guc)</italic>, <italic>trnE(uuc)</italic>, <italic>trnF(gaa)</italic>, <italic>trnG1(gcc)</italic>, <italic>trnG2(ucc)</italic>, <italic>trnH(gug)</italic>, <italic>trnI1(gau)</italic>, <italic>trnK(uuu)</italic>, <italic>trnL1(uaa)</italic>, <italic>trnL2(uag)</italic>, <italic>trnM1(cau)</italic>, <italic>trnM2(cau)</italic>, <italic>trnM3(cau)</italic>, <italic>trnN(guu)</italic>, <italic>trnP(ugg)</italic>, <italic>trnQ(uug)</italic>, <italic>trnR1(acg)</italic>, <italic>trnR2(ucu)</italic>, <italic>trnR3(ccg)</italic>, <italic>trnS1(gcu)</italic>, <italic>trnS2(uga)</italic>, <italic>trnT(ugu)</italic>, <italic>trnV(uac)</italic>, <italic>trnW(cca)</italic>, <italic>trnY(gua)</italic></td>
</tr>
<tr>
<td valign="top" align="left">Small RNA genes (sRNAs: 2)</td>
<td valign="top" align="left"><italic>ffs</italic>, <italic>ssrA</italic></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t2fns1"><p>&#x002A;Genes are classified according to their function. The underlined genes represent that these genes have experienced varying degrees of loss in different lineages of Coscinodiscophyceae.</p></fn>
<fn id="t2fns2"><p>&#x002A;&#x002A;Numbers within parentheses indicate the number of genes in a specific functional group.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>All these coscinodiscophycean cpDNAs belong to intron-less plastomes. The vast majority of sequenced diatom plastomes contain no intron, and only several diatom plastomes harbor a small number of introns. So far, plastid introns were detected in six housekeeping genes including <italic>atpB</italic>, <italic>groEL</italic>, <italic>petB</italic>, <italic>petD</italic>, <italic>psaA</italic>, and <italic>rnl</italic> among sequenced diatom plastomes (<xref ref-type="bibr" rid="B7">Brembu et al., 2014</xref>; <xref ref-type="bibr" rid="B45">Ruck et al., 2017</xref>; <xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>; <xref ref-type="bibr" rid="B16">Hamsher et al., 2019</xref>; <xref ref-type="bibr" rid="B13">Gastineau et al., 2021</xref>; <xref ref-type="bibr" rid="B14">G&#x00F3;recka et al., 2021</xref>; <xref ref-type="bibr" rid="B33">Liu et al., 2021a</xref>). Overall, diatom plastomes exhibit an essential characteristic of being uncontaminated by intron. This characteristic is relatively similar in chloroplasts originating from secondary endosymbiosis (e.g., Phaeophyceae) (<xref ref-type="bibr" rid="B32">Liu et al., 2017</xref>), but significantly different from chloroplasts originating from primary endosymbiosis (e.g., Ulvophyceae and land plants), which typically contain more group I or/and group II introns in housekeeping genes (<xref ref-type="bibr" rid="B31">Liu et al., 2023</xref>).</p>
<p>Frequent loss of housekeeping PCGs were observed in these coscinodiscophycean plastomes, suggesting that diatom plastomes showed an ongoing reduction in gene content during evolution. Five PCGs including <italic>ilvB</italic>, <italic>ilvH</italic>, <italic>tsf</italic>, <italic>syfB</italic>, and <italic>acpP2</italic> were absent from plastomes of both <italic>Paralia</italic> and <italic>Stephanopyxis</italic>. Considering their phylogenetic relationship in Coscinodiscophyceae (<xref ref-type="fig" rid="F4">Figure 4</xref>), it is most likely that the loss of these five PCGs happened after the split of the Paraliales-Stephanopyxales complex from the Rhizosoleniales-Coscinodiscales complex. Three PCGs including <italic>ilvB</italic>, <italic>ilvH</italic>, and <italic>tsf</italic> were missing in all of six Rhizosoleniales plastomes, suggesting that the loss of these three genes might occur earlier than the loss of other 10 PCGs in Rhizosoleniales. The plastomes of <italic>Rhizosolenia fallax</italic>-<italic>imbricata</italic> lost more PCGs compared with the other lineages (<xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>). Especially, some housekeeping PCGs associated with photosystem I (<italic>psaE</italic>, <italic>psaI</italic>, and <italic>psaM</italic>) and ribosome (<italic>rpl33</italic> and <italic>rpl36</italic>) were lost. Combined with phylogenetic analysis, it can be seen that the loss of these genes in the <italic>R. fallax</italic>-<italic>imbricata</italic> plastomes should be recent events. Considering the important function of these genes, they are most likely transferred into the nuclear genome through endosymbiotic gene transfer. Similar events have been identified in other diatoms. For example, endosymbiotic gene transfers of <italic>petF</italic> into nuclear genomes were identified to occur in the plastomes of <italic>Thalassiosira</italic> and <italic>Skeletonema</italic> species (<xref ref-type="bibr" rid="B33">Liu et al., 2021a</xref>,<xref ref-type="bibr" rid="B34">b</xref>). The transfer of <italic>petF</italic> from plastome to nuclear genome is linked to ecological success of <italic>Thalassiosira oceanica</italic> (<xref ref-type="bibr" rid="B35">Lommer et al., 2010</xref>). In Coscinodiscophyceae, two PCGs, <italic>ilvB</italic> and <italic>ilvH</italic>, were only present in <italic>Coscinodiscus</italic> plastomes, while <italic>tsf</italic> were found only in <italic>A. subtilis</italic> plastome. These three PCGs have been lost multiple times in the evolution of diatoms, and nowadays are rarely scattered in plastomes of specific species in different lineages of diatoms (<xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>). It was found that <italic>tsf</italic> was present in nuclear genomes of some diatoms (e.g., <italic>Phaeodactylum</italic> and <italic>Thalassiosira</italic> species) with <italic>tsf</italic>-lacking plastomes (<xref ref-type="bibr" rid="B46">Ruck et al., 2014</xref>).</p>
<p>Based on the distribution pattern of two <italic>acpP</italic> genes (<italic>acpP1</italic> and <italic>acpP2</italic>) in diatom plastomes, <italic>acpP</italic> was previously considered to have experienced multiple independent duplications in the <italic>acpP1/2</italic>-containning plastomes, meanwhile it was believed to have undergone multiple losses in the <italic>acpP</italic>-lacking plastomes during the evolution of diatoms (<xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>). Our phylogenetic analysis based on the AA sequences of AcpP1 and AcpP2 in the plastomes of diatoms and <italic>T. laevis</italic> revealed that the <italic>acpP</italic> genes were clearly grouped into two clades representing <italic>acpP1</italic> and <italic>acpP2</italic> lineages, respectively (<xref ref-type="fig" rid="F5">Figure 5</xref>). Each clade harbored related members from Bacillariophyceae, Mediophyceae and Coscinodiscophyceae. These results indicated that <italic>acpP1</italic> and <italic>acpP2</italic> in diatom plastomes were not derived from multiple gene duplications occurring in different lineages of diatoms, instead they originated from an early gene duplication event occurred in the common progenitor after diatom emergence, followed by frequent loss of one or both genes (<italic>acpP1</italic> or/and <italic>acpP2</italic>) in different diatom lineages (<xref ref-type="bibr" rid="B46">Ruck et al., 2014</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Unrooted Maximum Likelihood (ML) phylogenetic tree based on the amino acid sequences of AcpP1 and AcpP2 found in plastomes of diatoms and <italic>Triparma laevis</italic>. Numbers at the branches represented bootstrap values. The bootstrap support values greater than 70% were displayed at branches. Branch lengths were proportional to the amount of sequence change, which were indicated by the scale bar below the trees.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1203780-g005.tif"/>
</fig>
</sec>
<sec id="S3.SS4">
<title>Variations of IRs and gene order in coscinodiscophycean plastomes</title>
<p>Size changes in these coscinodiscophycean plastomes were mainly caused by variations of IRs, which ranged from 11.3 kb in <italic>A. curvatulus</italic> to 20.1 kb in <italic>S. turris</italic>. Similar to most of the IR-containing plastomes, diatom IRs were composed of the conserved <italic>rns</italic>-<italic>trnI</italic>-<italic>trnA</italic>-<italic>rnl</italic>-<italic>rrn5</italic> gene block and additional genes that flanked the ribosomal gene operon. However, IRs of these coscinodiscophycean plastomes shared a much larger conserved gene block containing nine genes, i.e., <italic>acpP</italic>-<italic>trnP</italic>-<italic>ycf89</italic>- <italic>rns</italic>-<italic>trnI</italic>-<italic>trnA</italic>-<italic>rnl</italic>-<italic>rrn5</italic>-<italic>psbA</italic>. At the intrageneric level of <italic>Guinardia</italic>, <italic>Actinocyclus</italic> and <italic>Coscinodiscus</italic>, the structure and size of IRs show little fluctuations (<xref ref-type="fig" rid="F6">Figure 6</xref>), suggesting that IRs were generally conserved among closely related species. However, IRs in <italic>S. turris</italic> and <italic>R. fallax</italic>-<italic>imbricata</italic> exhibited a similar trend of large expansion to SSCs and slightly small contraction from LSCs, which eventually led to the conspicuous increase in IR sizes. The most obvious expansion of IRs was observed in <italic>Stephanopyxis</italic> plastome with the 20.1-kb IR which harbored 23 genes and was much longer than those in other coscinodiscophycean lineages (<xref ref-type="fig" rid="F6">Figure 6</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Comparison of inverted repeats (IRs) in the 12 coscinodiscophycean plastomes. The Maximum Likelihood (ML) tree on the left showed their phylogenetic relationships, and the photo on the right showed the structure and size of IRs. Annotated genes were colored according to their functional categories.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1203780-g006.tif"/>
</fig>
<p>In the evolution of diatoms, the contraction and expansion of IRs occurred frequently in many lineages, which resulted in a huge variations of IR sizes, ranging from 6.8 kb in <italic>Synedra acus</italic> to 79.0 kb in <italic>Climaconeis cf. scalaris</italic>, an eleven-fold difference (<xref ref-type="bibr" rid="B12">Galachyants et al., 2012</xref>; <xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>; <xref ref-type="bibr" rid="B13">Gastineau et al., 2021</xref>). Interestingly, it was found that the circular <italic>Pseudo-nitzschia multiseries</italic> plastome (KR709240) could lack IR (<xref ref-type="bibr" rid="B8">Cao et al., 2016</xref>; <xref ref-type="bibr" rid="B17">He et al., 2022</xref>), which may represent a new trend in the evolution of diatom plastomes if the assembly was correct. As found in the green algae (e.g., <italic>Ulva</italic>), IR is not an essential feature of plastomes (<xref ref-type="bibr" rid="B30">Liu and Melton, 2021</xref>; <xref ref-type="bibr" rid="B31">Liu et al., 2023</xref>), and it would change considerably in different eukaryotic photosynthetic lineages.</p>
<p>Mauve alignments showed that <italic>Actinocyclus</italic> and <italic>Coscinodiscus</italic> plastomes shared the same gene order and showed high collinearity at the intra-order level of Coscinodiacales (<xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 1</xref>). At the intrageneric level of <italic>Rhizosolenia</italic>, multiple inversion and translocation events were identified (<xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 1</xref>). Within the genus <italic>Guinardia</italic>, we observed that one of IR gene clusters composing of 14 genes was inverted in the <italic>G. striata</italic> plastome (MG755796) (<xref ref-type="bibr" rid="B61">Yu et al., 2018</xref>), which resulted in two IR gene clusters being arranged in a forward direction instead of a reverse direction (<xref ref-type="fig" rid="F7">Figure 7</xref> and <xref ref-type="supplementary-material" rid="DS2">Supplementary Figure 2</xref>). If this was not due to assembly issues, the emergence of this unusual IR rearrangement represents new evolutionary features that are significantly different from our newly sequenced <italic>Guinardia</italic> plastomes (OM827251 and OM827252) which maintain a usual arrangement of IRs.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>Pair-wise comparison of gene order and genome rearrangement between coscinodiscophycean plastomes. <bold>(A)</bold> Comparison of <italic>Actinocyclus</italic> sp. (OM827248) and <italic>Actinocyclus subtilis</italic> (MG755799). <bold>(B)</bold> Comparison of <italic>Guinardia delicatula</italic> (OM827252) and <italic>Guinardia striata</italic> (MG755796). <bold>(C)</bold> Comparison of <italic>Guinardia striata</italic> (OM827251) and <italic>Guinardia striata</italic> (MG755796). <bold>(D)</bold> Comparison of <italic>Stephanopyxis turris</italic> (OM827249) and <italic>Paralia sulcate</italic> (OM827250).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1203780-g007.tif"/>
</fig>
<p>Different diatom lineages showed significantly different evolution rates in gene order. Coscinodiacales and Rhizosoleniales appeared earlier than Paraliales and Stephanopyxales in the evolution of diatoms (<xref ref-type="fig" rid="F3">Figure 3</xref>). Gene order in Coscinodiacales were highly conserved at the intra-order level, while multiple rearrangements were observed in Rhizosoleniales and between Paraliales and Stephanopyxales (<xref ref-type="fig" rid="F7">Figure 7</xref> and <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 1</xref>). Considering all instances of IR variations, it seems that variations of gene orders matched well with variations of IRs in these diatom plastomes, consistent with previous understanding that. IR regions might play an important role in stabilizing the architecture of plastomes (<xref ref-type="bibr" rid="B57">Turmel and Lemieux, 2018</xref>). It is reasonable to assume that accelerating IR changes involves frequent genome recombination and rearrangement (<xref ref-type="fig" rid="F7">Figure 7</xref> and <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 1</xref>), when considering substantial variation of the additional genes that flanked the ribosomal gene operon (<italic>rns</italic>-<italic>trnI</italic>-<italic>trnA</italic>-<italic>rnl</italic>-<italic>rrn5</italic>). However, despite frequent plastome rearrangements in these diatoms, the affiliation of gene clusters has not changed, suggesting that rearrangements were strictly restricted within LSC and SSC. This is different from what was observed in the IR-losing plastomes in green algae where gene clusters have undergone a larger range of frequent changes (<xref ref-type="bibr" rid="B30">Liu and Melton, 2021</xref>; <xref ref-type="bibr" rid="B31">Liu et al., 2023</xref>).</p>
</sec>
</sec>
<sec id="S4" sec-type="conclusion">
<title>Conclusion</title>
<p>Construction of six plastomes of diatom species in the class Coscinodiscophyceae in our work substantially boosted the number of assembled plastomes in this class, which in turn helped to reveal new evolutionary trends and details of diatom plastomes. The coscinodiscophycean plastomes in different lineages exhibited different evolutionary trends. The plastomes in Coscinodiacales were highly conserved in genome size, gene content, IR structure and gene order at the intra-order level, but plastomes in Rhizosoleniales and Paraliales-Stephanopyxales complex displayed multiple changes in the above aspects, even at the intrageneric level (e.g., <italic>Rhizosolenia</italic>). Comparative analysis revealed that 120 protein-coding genes (PCGs), three ribosomal RNA genes (rRNAs), 27 transfer RNA genes (tRNAs) and two regulatory small RNA genes (sRNAs) were shared by these coscinodiscophycean plastomes, but the other 13 PCGs were lost to varying degrees in different coscinodiscophycean lineages. The lost housekeeping PCGs could have been transferred into the nuclear genome via endosymbiotic gene transfer. Frequent loss of housekeeping PCGs were observed in these coscinodiscophycean plastomes, suggesting that diatom plastomes showed an ongoing reduction in gene content during evolution. The IRs in <italic>S. turris</italic> and <italic>R. fallax</italic>-<italic>imbricata</italic> showed a trend of expansion to the SSC and contraction from the LSC, which eventually led to the conspicuous increase in IR size. Different lineages of diatoms show significantly different evolution rates in gene order. As more genomic data accumulate in future, we will be able to depict more comprehensive evolutionary pathways and reveal evolution mechanism of species diversity in diatoms.</p>
</sec>
<sec id="S5" sec-type="data-availability">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in this article/<xref ref-type="supplementary-material" rid="TS1">Supplementary material</xref>.</p>
</sec>
<sec id="S6" sec-type="author-contributions">
<title>Author contributions</title>
<p>NC designed the study. YW, HH, and FL performed the experiments. FL and YW performed the analysis. FL wrote the manuscript with contributions from coauthors. All authors have read and approved the final version of the manuscript.</p>
</sec>
</body>
<back>
<sec id="S7" sec-type="funding-information">
<title>Funding</title>
<p>This work was financially supported by the National Natural Science Foundation of China (Nos. 42176162 and 42276133), the Strategic Priority Research Program of Chinese Academy of Sciences (Nos. XDB42000000, XDA23050302, and XDA23050403), the Science and Technology Basic Resources Investigation Program of China (No. 2018FY100200), the Chinese Academy of Sciences Pioneer Hundred Talents Program (to NC), and the Taishan Scholar Project Special Fund (to NC).</p>
</sec>
<ack><p>We are grateful to colleagues from the Jiaozhou Bay National Marine Ecosystem Research Station for their help in field sampling, and all staffs of marine ecological environment genomics research group in Institute of Oceanology, Chinese Academy of Sciences. Statistical analyses were supported by Oceanographic Data Center, Institute of Oceanology, Chinese Academy of Sciences (IOCAS).</p>
</ack>
<sec id="S8" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="S9" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="S10" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2023.1203780/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2023.1203780/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.pdf" id="TS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.pdf" id="TS2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.pdf" id="TS3" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_1.PDF" id="DS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_2.PDF" id="DS2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="https://github.com/BFL-lab/Mfannot">https://github.com/BFL-lab/Mfannot</ext-link></p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/orffinder/">https://www.ncbi.nlm.nih.gov/orffinder/</ext-link></p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Armbrust</surname> <given-names>E. V.</given-names></name></person-group> (<year>2009</year>). <article-title>The life of diatoms in the world&#x2019;s oceans.</article-title> <source><italic>Nature</italic></source> <volume>459</volume> <fpage>185</fpage>&#x2013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1038/nature08057</pub-id> <pub-id pub-id-type="pmid">19444204</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Armbrust</surname> <given-names>E. V.</given-names></name> <name><surname>Berges</surname> <given-names>J. A.</given-names></name> <name><surname>Bowler</surname> <given-names>C.</given-names></name> <name><surname>Green</surname> <given-names>B. R.</given-names></name> <name><surname>Martinez</surname> <given-names>D.</given-names></name> <name><surname>Putnam</surname> <given-names>N. H.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism.</article-title> <source><italic>Science</italic></source> <volume>306</volume> <fpage>79</fpage>&#x2013;<lpage>86</lpage>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bankevich</surname> <given-names>A.</given-names></name> <name><surname>Nurk</surname> <given-names>S.</given-names></name> <name><surname>Antipov</surname> <given-names>D.</given-names></name> <name><surname>Gurevich</surname> <given-names>A. A.</given-names></name> <name><surname>Dvorkin</surname> <given-names>M.</given-names></name> <name><surname>Kulikov</surname> <given-names>A. S.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.</article-title> <source><italic>J. Comput. Biol.</italic></source> <volume>19</volume> <fpage>455</fpage>&#x2013;<lpage>477</lpage>. <pub-id pub-id-type="doi">10.1089/cmb.2012.0021</pub-id> <pub-id pub-id-type="pmid">22506599</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhattacharya</surname> <given-names>D.</given-names></name> <name><surname>Archibald</surname> <given-names>J. M.</given-names></name> <name><surname>Weber</surname> <given-names>A. P.</given-names></name> <name><surname>Reyes-Prieto</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>How do endosymbionts become organelles? Understanding early events in plastid evolution</article-title>. <source><italic>Bioessays</italic></source> <volume>29</volume> <fpage>1239</fpage>&#x2013;<lpage>1246</lpage>. <pub-id pub-id-type="doi">10.1002/bies.20671</pub-id> <pub-id pub-id-type="pmid">18027391</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolger</surname> <given-names>A. M.</given-names></name> <name><surname>Lohse</surname> <given-names>M.</given-names></name> <name><surname>Usadel</surname> <given-names>B.</given-names></name></person-group> (<year>2014</year>). <article-title>Trimmomatic: a flexible trimmer for Illumina sequence data.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>2114</fpage>&#x2013;<lpage>2120</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu170</pub-id> <pub-id pub-id-type="pmid">24695404</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowler</surname> <given-names>C.</given-names></name> <name><surname>Allen</surname> <given-names>A. E.</given-names></name> <name><surname>Badger</surname> <given-names>J. H.</given-names></name> <name><surname>Grimwood</surname> <given-names>J.</given-names></name> <name><surname>Jabbari</surname> <given-names>K.</given-names></name> <name><surname>Kuo</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>The Phaeodactylum genome reveals the evolutionary history of diatom genomes.</article-title> <source><italic>Nature</italic></source> <volume>456</volume> <fpage>239</fpage>&#x2013;<lpage>244</lpage>.</citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brembu</surname> <given-names>T.</given-names></name> <name><surname>Winge</surname> <given-names>P.</given-names></name> <name><surname>Tooming-Klunderud</surname> <given-names>A.</given-names></name> <name><surname>Nederbragt</surname> <given-names>A. J.</given-names></name> <name><surname>Jakobsen</surname> <given-names>K. S.</given-names></name> <name><surname>Bones</surname> <given-names>A. M.</given-names></name></person-group> (<year>2014</year>). <article-title>The chloroplast genome of the diatom Seminavis robusta: new features introduced through multiple mechanisms of horizontal gene transfer.</article-title> <source><italic>Mar. Genomics</italic></source> <volume>16</volume> <fpage>17</fpage>&#x2013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1016/j.margen.2013.12.002</pub-id> <pub-id pub-id-type="pmid">24365712</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cao</surname> <given-names>M.</given-names></name> <name><surname>Yuan</surname> <given-names>X.-L.</given-names></name> <name><surname>Bi</surname> <given-names>G.</given-names></name></person-group> (<year>2016</year>). <article-title>Complete sequence and analysis of plastid genomes of pseudo-nitzschia multiseries (bacillariophyta).</article-title> <source><italic>Mitochondrial DNA Part A</italic></source> <volume>27</volume> <fpage>2897</fpage>&#x2013;<lpage>2898</lpage>. <pub-id pub-id-type="doi">10.3109/19401736.2015.1060428</pub-id> <pub-id pub-id-type="pmid">26403336</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Capella-Gutierrez</surname> <given-names>S.</given-names></name> <name><surname>Silla-Martinez</surname> <given-names>J. M.</given-names></name> <name><surname>Gabaldon</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>1972</fpage>&#x2013;<lpage>1973</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp348</pub-id> <pub-id pub-id-type="pmid">19505945</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Darling</surname> <given-names>A. E.</given-names></name> <name><surname>Mau</surname> <given-names>B.</given-names></name> <name><surname>Perna</surname> <given-names>N. T.</given-names></name></person-group> (<year>2010</year>). <article-title>ProgressiveMauve: multiple genome alignment with gene gain. Loss and rearrangement</article-title>. <source><italic>PLoS One</italic></source> <volume>5</volume> <issue>e11147</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0011147</pub-id> <pub-id pub-id-type="pmid">20593022</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Falkowski</surname> <given-names>P. G.</given-names></name> <name><surname>Barber</surname> <given-names>R. T.</given-names></name> <name><surname>Smetacek</surname> <given-names>V.</given-names></name></person-group> (<year>1998</year>). <article-title>Biogeochemical controls and feedbacks on ocean primary production.</article-title> <source><italic>Science</italic></source> <volume>281</volume> <fpage>200</fpage>&#x2013;<lpage>206</lpage>. <pub-id pub-id-type="doi">10.1126/science.281.5374.200</pub-id> <pub-id pub-id-type="pmid">9660741</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galachyants</surname> <given-names>Y. P.</given-names></name> <name><surname>Morozov</surname> <given-names>A. A.</given-names></name> <name><surname>Mardanov</surname> <given-names>A. V.</given-names></name> <name><surname>Beletsky</surname> <given-names>A. V.</given-names></name> <name><surname>Ravin</surname> <given-names>N. V.</given-names></name> <name><surname>Petrova</surname> <given-names>D. P.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Complete Chloroplast genome sequence of freshwater Araphid pennate diatom alga Synedra acus from lake baikal.</article-title> <source><italic>International Journal of Biology.</italic></source> <volume>4</volume> <fpage>27</fpage>&#x2013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.5539/ijb.v4n1p27</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gastineau</surname> <given-names>R.</given-names></name> <name><surname>Davidovich</surname> <given-names>N.</given-names></name> <name><surname>Davidovich</surname> <given-names>O.</given-names></name> <name><surname>Lemieux</surname> <given-names>C.</given-names></name> <name><surname>Turmel</surname> <given-names>M.</given-names></name> <name><surname>Wr&#x00F3;bel</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Extreme enlargement of the inverted repeat region in the plastid genomes of diatoms from the genus climaconeis.</article-title> <source><italic>Int. J. Mol. Sci.</italic></source> <volume>22</volume> <issue>7155</issue>. <pub-id pub-id-type="doi">10.3390/ijms22137155</pub-id> <pub-id pub-id-type="pmid">34281209</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>G&#x00F3;recka</surname> <given-names>E.</given-names></name> <name><surname>Gastineau</surname> <given-names>R.</given-names></name> <name><surname>Davidovich</surname> <given-names>N.</given-names></name> <name><surname>Davidovich</surname> <given-names>O.</given-names></name> <name><surname>Ashworth</surname> <given-names>M.</given-names></name> <name><surname>Sabir</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Mitochondrial and plastid genomes of the Monoraphid diatom Schizostauron trachyderma.</article-title> <source><italic>Int. J. Mol. Sci.</italic></source> <volume>22</volume> <issue>11139</issue>. <pub-id pub-id-type="doi">10.3390/ijms222011139</pub-id> <pub-id pub-id-type="pmid">34681800</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guiry</surname> <given-names>M. D.</given-names></name> <name><surname>Guiry</surname> <given-names>G. M.</given-names></name></person-group> (<year>2023</year>). <source><italic>AlgaeBase. World-wide electronic publication.</italic></source> <publisher-loc>Galway</publisher-loc>: <publisher-name>National University of Ireland</publisher-name>.</citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamsher</surname> <given-names>S. E.</given-names></name> <name><surname>Keepers</surname> <given-names>K. G.</given-names></name> <name><surname>Pogoda</surname> <given-names>C. S.</given-names></name> <name><surname>Stepanek</surname> <given-names>J. G.</given-names></name> <name><surname>Kane</surname> <given-names>N. C.</given-names></name> <name><surname>Kociolek</surname> <given-names>J. P.</given-names></name></person-group> (<year>2019</year>). <article-title>Extensive chloroplast genome rearrangement amongst three closely related Halamphora spp. (Bacillariophyceae), and evidence for rapid evolution as compared to land plants.</article-title> <source><italic>PLoS One</italic></source> <volume>14</volume>:<issue>e0217824</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0217824</pub-id> <pub-id pub-id-type="pmid">31269054</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>Z.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>K.</given-names></name> <name><surname>Xu</surname> <given-names>Q.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2022</year>). <article-title>Comparative analysis of pseudo-nitzschia chloroplast genomes revealed extensive inverted region variation and pseudo-nitzschia speciation.</article-title> <source><italic>Front. Mar. Sci.</italic></source> <volume>9</volume>:<issue>784579</issue>. <pub-id pub-id-type="doi">10.3389/fmars.2022.784579</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hemaiswarya</surname> <given-names>S.</given-names></name> <name><surname>Raja</surname> <given-names>R.</given-names></name> <name><surname>Kumar</surname> <given-names>R. R.</given-names></name> <name><surname>Ganesan</surname> <given-names>V.</given-names></name> <name><surname>Anbazhagan</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Microalgae: a sustainable feed source for aquaculture.</article-title> <source><italic>World J. Microbiol. Biotechnol.</italic></source> <volume>27</volume> <fpage>1737</fpage>&#x2013;<lpage>1746</lpage>.</citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>J.-J.</given-names></name> <name><surname>Yu</surname> <given-names>W.-B.</given-names></name> <name><surname>Yang</surname> <given-names>J.-B.</given-names></name> <name><surname>Song</surname> <given-names>Y.</given-names></name> <name><surname>dePamphilis</surname> <given-names>C. W.</given-names></name> <name><surname>Yi</surname> <given-names>T.-S.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes.</article-title> <source><italic>Genome Biol.</italic></source> <volume>21</volume> <issue>241</issue>. <pub-id pub-id-type="doi">10.1186/s13059-020-02154-5</pub-id> <pub-id pub-id-type="pmid">32912315</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>D. T.</given-names></name> <name><surname>Taylor</surname> <given-names>W. R.</given-names></name> <name><surname>Thornton</surname> <given-names>J. M.</given-names></name></person-group> (<year>1992</year>). <article-title>The rapid generation of mutation data matrices from protein sequences.</article-title> <source><italic>Comput Appl Biosci</italic></source> <volume>8</volume> <fpage>275</fpage>&#x2013;<lpage>282</lpage>.</citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kalyaanamoorthy</surname> <given-names>S.</given-names></name> <name><surname>Minh</surname> <given-names>B. Q.</given-names></name> <name><surname>Wong</surname> <given-names>T. K. F.</given-names></name> <name><surname>von Haeseler</surname> <given-names>A.</given-names></name> <name><surname>Jermiin</surname> <given-names>L. S.</given-names></name></person-group> (<year>2017</year>). <article-title>ModelFinder: fast model selection for accurate phylogenetic estimates.</article-title> <source><italic>Nat. Methods</italic></source> <volume>14</volume> <fpage>587</fpage>&#x2013;<lpage>589</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.4285</pub-id> <pub-id pub-id-type="pmid">28481363</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname> <given-names>K.</given-names></name> <name><surname>Standley</surname> <given-names>D. M.</given-names></name></person-group> (<year>2013</year>). <article-title>MAFFT multiple sequence alignment software version 7: improvements in performance and usability.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>30</volume> <fpage>772</fpage>&#x2013;<lpage>780</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst010</pub-id> <pub-id pub-id-type="pmid">23329690</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kiatmetha</surname> <given-names>P.</given-names></name> <name><surname>Siangdang</surname> <given-names>W.</given-names></name> <name><surname>Bunnag</surname> <given-names>B.</given-names></name> <name><surname>Senapin</surname> <given-names>S.</given-names></name> <name><surname>Withyachumnarnkul</surname> <given-names>B.</given-names></name></person-group> (<year>2011</year>). <article-title>Enhancement of survival and metamorphosis rates of Penaeus monodon larvae by feeding with the diatom Thalassiosira weissflogii.</article-title> <source><italic>Aquac. Int.</italic></source> <volume>19</volume> <fpage>599</fpage>&#x2013;<lpage>609</lpage>.</citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kowallik</surname> <given-names>K. V.</given-names></name> <name><surname>Stoebe</surname> <given-names>B.</given-names></name> <name><surname>Schaffran</surname> <given-names>I.</given-names></name> <name><surname>Kroth-Pancic</surname> <given-names>P.</given-names></name> <name><surname>Freier</surname> <given-names>U.</given-names></name></person-group> (<year>1995</year>). <article-title>The chloroplast genome of a chlorophyll a+c-containing alga, Odontella sinensis.</article-title> <source><italic>Plant Mol. Biol. Rep.</italic></source> <volume>13</volume> <fpage>336</fpage>&#x2013;<lpage>342</lpage>. <pub-id pub-id-type="doi">10.1016/0014-5793(96)00658-8</pub-id> <pub-id pub-id-type="pmid">8706853</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krzywinski</surname> <given-names>M.</given-names></name> <name><surname>Schein</surname> <given-names>J.</given-names></name> <name><surname>Birol</surname> <given-names>I.</given-names></name> <name><surname>Connors</surname> <given-names>J.</given-names></name> <name><surname>Gascoyne</surname> <given-names>R.</given-names></name> <name><surname>Horsman</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Circos: an information aesthetic for comparative genomics.</article-title> <source><italic>Genome Res.</italic></source> <volume>19</volume> <fpage>1639</fpage>&#x2013;<lpage>1645</lpage>. <pub-id pub-id-type="doi">10.1101/gr.092759.109</pub-id> <pub-id pub-id-type="pmid">19541911</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>S.</given-names></name> <name><surname>Stecher</surname> <given-names>G.</given-names></name> <name><surname>Tamura</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>33</volume> <fpage>1870</fpage>&#x2013;<lpage>1874</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msw054</pub-id> <pub-id pub-id-type="pmid">27004904</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lang</surname> <given-names>B. F.</given-names></name> <name><surname>Nedelcu</surname> <given-names>A. M.</given-names></name></person-group> (<year>2012</year>). &#x201C;<article-title>Plastid genomes of algae</article-title>,&#x201D; in <source><italic>Advances in photosynthesis and respiration including bioenergy and related processes: genomics of chloroplasts and mitochondria</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Bock</surname> <given-names>R.</given-names></name> <name><surname>Knoop</surname> <given-names>V.</given-names></name></person-group> (<publisher-loc>Dordrecht</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>59</fpage>&#x2013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1007/978-94-007-2920-9_3</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Fast and accurate short read alignment with burrows-wheeler transform.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>1754</fpage>&#x2013;<lpage>1760</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id> <pub-id pub-id-type="pmid">19451168</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>The sequence alignment/map format and SAMtools.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>2078</fpage>&#x2013;<lpage>2079</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id> <pub-id pub-id-type="pmid">19505943</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>F.</given-names></name> <name><surname>Melton</surname> <given-names>J. T.</given-names></name></person-group> (<year>2021</year>). <article-title>Chloroplast genomes of the green-tide forming alga Ulva compressa: comparative chloroplast genomics in the genus Ulva (Ulvophyceae, Chlorophyta).</article-title> <source><italic>Front. Mar. Sci.</italic></source> <volume>8</volume> <issue>668542</issue>. <pub-id pub-id-type="doi">10.3389/fmars.2021.668542</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>F.</given-names></name> <name><surname>Chen</surname> <given-names>N.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Qu</surname> <given-names>F.</given-names></name></person-group> (<year>2023</year>). <article-title>Novel insights into chloroplast genome evolution in the green macroalgal genus <italic>Ulva</italic> (Ulvophyceae, Chlorophyta).</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>14</volume>:<issue>1126175</issue>.</citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>F.</given-names></name> <name><surname>Jin</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Bi</surname> <given-names>Y.</given-names></name> <name><surname>Melton</surname> <given-names>J. T.</given-names></name></person-group> (<year>2017</year>). <article-title>Plastid genome of Dictyopteris divaricata (Dictyotales, Phaeophyceae): understanding the evolution of plastid genomes in brown algae.</article-title> <source><italic>Mar. Biotech.</italic></source> <volume>19</volume> <fpage>627</fpage>&#x2013;<lpage>637</lpage>. <pub-id pub-id-type="doi">10.1007/s10126-017-9781-5</pub-id> <pub-id pub-id-type="pmid">29164355</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>K.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Cui</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Xu</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>N.</given-names></name></person-group> (<year>2021a</year>). <article-title>Comparative analysis of chloroplast genomes of thalassiosira species.</article-title> <source><italic>Front. Mar. Sci.</italic></source> <volume>8</volume>:<issue>788307</issue>. <pub-id pub-id-type="doi">10.3389/fmars.2021.788307</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>K.</given-names></name> <name><surname>Xu</surname> <given-names>Q.</given-names></name> <name><surname>Liu</surname> <given-names>K.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>N.</given-names></name></person-group> (<year>2021b</year>). <article-title>Chloroplast genomes for five skeletonema species: comparative and phylogenetic analysis.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>12</volume>:<issue>2962</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2021.774617</pub-id> <pub-id pub-id-type="pmid">34966403</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lommer</surname> <given-names>M.</given-names></name> <name><surname>Roy</surname> <given-names>A.-S.</given-names></name> <name><surname>Schilhabel</surname> <given-names>M.</given-names></name> <name><surname>Schreiber</surname> <given-names>S.</given-names></name> <name><surname>Rosenstiel</surname> <given-names>P.</given-names></name> <name><surname>LaRoche</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>Recent transfer of an iron-regulated gene from the plastid to the nuclear genome in an oceanic diatom adapted to chronic iron limitation.</article-title> <source><italic>BMC Genomics</italic></source> <volume>11</volume>:<issue>718</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-718</pub-id> <pub-id pub-id-type="pmid">21171997</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mann</surname> <given-names>D. G.</given-names></name> <name><surname>Droop</surname> <given-names>S. J. M.</given-names></name></person-group> (<year>1996</year>). <article-title>Biodiversity, biogeography and conservation of diatoms.</article-title> <source><italic>Hydrobiologia</italic></source> <volume>336</volume> <fpage>19</fpage>&#x2013;<lpage>32</lpage>.</citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mann</surname> <given-names>D. G.</given-names></name> <name><surname>Trobajo</surname> <given-names>R.</given-names></name> <name><surname>Sato</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>C.</given-names></name> <name><surname>Witkowski</surname> <given-names>A.</given-names></name> <name><surname>Rimet</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Ripe for reassessment: a synthesis of available molecular data for the speciose diatom family <italic>Bacillariaceae</italic>.</article-title> <source><italic>Mol. Phylogenetics Evol.</italic></source> <volume>158</volume> <issue>106985</issue>. <pub-id pub-id-type="doi">10.1016/j.ympev.2020.106985</pub-id> <pub-id pub-id-type="pmid">33059066</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matari</surname> <given-names>N. H.</given-names></name> <name><surname>Blair</surname> <given-names>J. E.</given-names></name></person-group> (<year>2014</year>). <article-title>A multilocus timescale for oomycete evolution estimated under three distinct molecular clock models.</article-title> <source><italic>BMC Evol. Biol.</italic></source> <volume>14</volume>:<issue>101</issue>. <pub-id pub-id-type="doi">10.1186/1471-2148-14-101</pub-id> <pub-id pub-id-type="pmid">24884411</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Medlin</surname> <given-names>L. K.</given-names></name></person-group> (<year>2015</year>). <article-title>A timescale for diatom evolution based on four molecular markers: reassessment of ghost lineages and major steps defining diatom evolution.</article-title> <source><italic>Vie Milieu-Life Environ.</italic></source> <volume>65</volume> <fpage>219</fpage>&#x2013;<lpage>238</lpage>.</citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Medlin</surname> <given-names>L. K.</given-names></name> <name><surname>Desdevises</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Phylogenetic reconstruction of diatoms using a seven-gene dataset, multiple outgroups, and morphological data for a total evidence approach.</article-title> <source><italic>Phycologia</italic></source> <volume>59</volume> <fpage>422</fpage>&#x2013;<lpage>436</lpage>.</citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Medlin</surname> <given-names>L. K.</given-names></name> <name><surname>Kaczmarska</surname> <given-names>I.</given-names></name></person-group> (<year>2004</year>). <article-title>Evolution of the diatoms: V. Morphological and cytological support for the major clades and a taxonomic revision.</article-title> <source><italic>Phycologia</italic></source> <volume>43</volume> <fpage>245</fpage>&#x2013;<lpage>270</lpage>.</citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Minh</surname> <given-names>B. Q.</given-names></name> <name><surname>Nguyen</surname> <given-names>M. A. T.</given-names></name> <name><surname>von Haeseler</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Ultrafast approximation for phylogenetic bootstrap.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>30</volume> <fpage>1188</fpage>&#x2013;<lpage>1195</lpage>.</citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nurachman</surname> <given-names>Z.</given-names></name> <name><surname>Anita</surname> <given-names>S.</given-names></name> <name><surname>Anward</surname> <given-names>E. E.</given-names></name> <name><surname>Novirani</surname> <given-names>G.</given-names></name> <name><surname>Mangindaan</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Oil productivity of the tropical marine diatom Thalassiosira sp.</article-title> <source><italic>Bioresour. Technol.</italic></source> <volume>108</volume> <fpage>240</fpage>&#x2013;<lpage>244</lpage>.</citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oudot-Le Secq</surname> <given-names>M.-P.</given-names></name> <name><surname>Grimwood</surname> <given-names>J.</given-names></name> <name><surname>Shapiro</surname> <given-names>H.</given-names></name> <name><surname>Armbrust</surname> <given-names>E. V.</given-names></name> <name><surname>Bowler</surname> <given-names>C.</given-names></name> <name><surname>Green</surname> <given-names>B. R.</given-names></name></person-group> (<year>2007</year>). <article-title>Chloroplast genomes of the diatoms Phaeodactylum tricornutum and Thalassiosira pseudonana: comparison with other plastid genomes of the red lineage.</article-title> <source><italic>Mol. Genet. Genomics.</italic></source> <volume>277</volume> <fpage>427</fpage>&#x2013;<lpage>439</lpage>. <pub-id pub-id-type="doi">10.1007/s00438-006-0199-4</pub-id> <pub-id pub-id-type="pmid">17252281</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ruck</surname> <given-names>E. C.</given-names></name> <name><surname>Linard</surname> <given-names>S. R.</given-names></name> <name><surname>Nakov</surname> <given-names>T.</given-names></name> <name><surname>Theriot</surname> <given-names>E. C.</given-names></name> <name><surname>Alverson</surname> <given-names>A. J.</given-names></name></person-group> (<year>2017</year>). <article-title>Hoarding and horizontal transfer led to an expanded gene and intron repertoire in the plastid genome of the diatom, Toxarium undulatum (Bacillariophyta).</article-title> <source><italic>Curr. Genet.</italic></source> <volume>63</volume> <fpage>499</fpage>&#x2013;<lpage>507</lpage>. <pub-id pub-id-type="doi">10.1007/s00294-016-0652-9</pub-id> <pub-id pub-id-type="pmid">27655214</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ruck</surname> <given-names>E. C.</given-names></name> <name><surname>Nakov</surname> <given-names>T.</given-names></name> <name><surname>Jansen</surname> <given-names>R. K.</given-names></name> <name><surname>Theriot</surname> <given-names>E. C.</given-names></name> <name><surname>Alverson</surname> <given-names>A. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Serial gene losses and foreign DNA underlie size and sequence variation in the plastid genomes of diatoms.</article-title> <source><italic>Genome Biol. Evol.</italic></source> <volume>6</volume> <fpage>644</fpage>&#x2013;<lpage>654</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evu039</pub-id> <pub-id pub-id-type="pmid">24567305</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabir</surname> <given-names>J. S. M.</given-names></name> <name><surname>Yu</surname> <given-names>M.</given-names></name> <name><surname>Ashworth</surname> <given-names>M. P.</given-names></name> <name><surname>Baeshen</surname> <given-names>N. A.</given-names></name> <name><surname>Baeshen</surname> <given-names>M. N.</given-names></name> <name><surname>Bahieldin</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Conserved gene order and expanded inverted repeats characterize plastid genomes of Thalassiosirales.</article-title> <source><italic>PLoS One</italic></source> <volume>9</volume>:<issue>e107854</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0107854</pub-id> <pub-id pub-id-type="pmid">25233465</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sims</surname> <given-names>P. A.</given-names></name> <name><surname>Mann</surname> <given-names>D. G.</given-names></name> <name><surname>Medlin</surname> <given-names>L. K.</given-names></name></person-group> (<year>2006</year>). <article-title>Evolution of the diatoms: insights from fossil, biological and molecular data.</article-title> <source><italic>Phycologia</italic></source> <volume>45</volume> <fpage>361</fpage>&#x2013;<lpage>402</lpage>.</citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sinninghe-Damst&#x00E9;</surname> <given-names>J. S.</given-names></name> <name><surname>Muyzer</surname> <given-names>G.</given-names></name> <name><surname>Abbas</surname> <given-names>B.</given-names></name> <name><surname>Rampen</surname> <given-names>S. W.</given-names></name> <name><surname>Masse</surname> <given-names>G.</given-names></name> <name><surname>Allard</surname> <given-names>W. G.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>The rise of the rhizosolenid diatoms.</article-title> <source><italic>Science</italic></source> <volume>304</volume> <fpage>584</fpage>&#x2013;<lpage>587</lpage>. <pub-id pub-id-type="doi">10.1126/science.1096806</pub-id> <pub-id pub-id-type="pmid">15105500</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sorhannus</surname> <given-names>U.</given-names></name></person-group> (<year>2007</year>). <article-title>A nuclear-encoded small-subunit ribosomal RNA timescale for diatom evolution.</article-title> <source><italic>Marine Micropaleontol.</italic></source> <volume>65</volume> <fpage>1</fpage>&#x2013;<lpage>12</lpage>.</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sumper</surname> <given-names>M.</given-names></name> <name><surname>Brunner</surname> <given-names>E.</given-names></name></person-group> (<year>2008</year>). <article-title>Silica biomineralisation in diatoms: the model organism Thalassiosira pseudonana.</article-title> <source><italic>ChemBioChem</italic></source> <volume>9</volume> <fpage>1187</fpage>&#x2013;<lpage>1194</lpage>. <pub-id pub-id-type="doi">10.1002/cbic.200700764</pub-id> <pub-id pub-id-type="pmid">18381716</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tajima</surname> <given-names>N.</given-names></name> <name><surname>Saitoh</surname> <given-names>K.</given-names></name> <name><surname>Sato</surname> <given-names>S.</given-names></name> <name><surname>Maruyama</surname> <given-names>F.</given-names></name> <name><surname>Ichinomiya</surname> <given-names>M.</given-names></name> <name><surname>Yoshikawa</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Sequencing and analysis of the complete organellar genomes of parmales, a closely related group to Bacillariophyta (diatoms).</article-title> <source><italic>Curr. Genet.</italic></source> <volume>62</volume> <fpage>887</fpage>&#x2013;<lpage>896</lpage>. <pub-id pub-id-type="doi">10.1007/s00294-016-0598-y</pub-id> <pub-id pub-id-type="pmid">27091756</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanaka</surname> <given-names>T.</given-names></name> <name><surname>Fukuda</surname> <given-names>Y.</given-names></name> <name><surname>Yoshino</surname> <given-names>T.</given-names></name> <name><surname>Maeda</surname> <given-names>Y.</given-names></name> <name><surname>Muto</surname> <given-names>M.</given-names></name> <name><surname>Matsumoto</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>High-throughput pyrosequencing of the chloroplast genome of a highly neutral-lipid-producing marine pennate diatom, Fistulifera sp. strain JPCC DA0580.</article-title> <source><italic>Photosynth. Res.</italic></source> <volume>109</volume> <fpage>223</fpage>&#x2013;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1007/s11120-011-9622-8</pub-id> <pub-id pub-id-type="pmid">21290260</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thompson</surname> <given-names>J. D.</given-names></name> <name><surname>Gibson</surname> <given-names>T. J.</given-names></name> <name><surname>Plewniak</surname> <given-names>F.</given-names></name> <name><surname>Jeanmougin</surname> <given-names>F.</given-names></name> <name><surname>Higgins</surname> <given-names>D. G.</given-names></name></person-group> (<year>1997</year>). <article-title>The ClustalX windows interface flexible strategies for multiple sequence alignment aided by quality analysis tools.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>25</volume> <fpage>4876</fpage>&#x2013;<lpage>4882</lpage>. <pub-id pub-id-type="doi">10.1093/nar/25.24.4876</pub-id> <pub-id pub-id-type="pmid">9396791</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thorvaldsdottir</surname> <given-names>H.</given-names></name> <name><surname>Robinson</surname> <given-names>J. T.</given-names></name> <name><surname>Mesirov</surname> <given-names>J. P.</given-names></name></person-group> (<year>2013</year>). <article-title>Integrative genomics viewer (IGV): high-performance genomics data visualization and exploration.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>14</volume> <fpage>178</fpage>&#x2013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbs017</pub-id> <pub-id pub-id-type="pmid">22517427</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trifinopoulos</surname> <given-names>J.</given-names></name> <name><surname>Lam-Tung</surname> <given-names>N.</given-names></name> <name><surname>von Haeseler</surname> <given-names>A.</given-names></name> <name><surname>Minh</surname> <given-names>B. Q.</given-names></name></person-group> (<year>2016</year>). <article-title>W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>W232</fpage>&#x2013;<lpage>W235</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw256</pub-id> <pub-id pub-id-type="pmid">27084950</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turmel</surname> <given-names>M.</given-names></name> <name><surname>Lemieux</surname> <given-names>C.</given-names></name></person-group> (<year>2018</year>). <article-title>Evolution of the plastid genome in green algae.</article-title> <source><italic>Adv. Bot. Res.</italic></source> <volume>85</volume> <fpage>157</fpage>&#x2013;<lpage>193</lpage>.</citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>N.</given-names></name></person-group> (<year>2022</year>). <article-title>Comparative analysis of Bacillariophyceae chloroplast genomes uncovers extensive genome rearrangements associated with speciation.</article-title> <source><italic>Int. J. Environ. Res. Public Health</italic></source> <volume>19</volume> <issue>10024</issue>. <pub-id pub-id-type="doi">10.3390/ijerph191610024</pub-id> <pub-id pub-id-type="pmid">36011659</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>Q.</given-names></name> <name><surname>Cui</surname> <given-names>Z.</given-names></name> <name><surname>Chen</surname> <given-names>N.</given-names></name></person-group> (<year>2021</year>). <article-title>Comparative analysis of chloroplast genomes of seven chaetoceros species revealed variation hotspots and speciation time.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>12</volume>:<issue>742554</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2021.742554</pub-id> <pub-id pub-id-type="pmid">34803961</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Z.</given-names></name></person-group> (<year>2007</year>). <article-title>PAML 4: phylogenetic analysis by maximum likelihood.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>24</volume> <fpage>1586</fpage>&#x2013;<lpage>1591</lpage>.</citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>M.</given-names></name> <name><surname>Ashworth</surname> <given-names>M. P.</given-names></name> <name><surname>Hajrah</surname> <given-names>N. H.</given-names></name> <name><surname>Khiyami</surname> <given-names>M. A.</given-names></name> <name><surname>Sabir</surname> <given-names>M. J.</given-names></name> <name><surname>Alhebshi</surname> <given-names>A. M.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Evolution of the plastid genomes in diatoms.</article-title> <source><italic>Adv. Bot. Res.</italic></source> <volume>85</volume> <fpage>129</fpage>&#x2013;<lpage>155</lpage>.</citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>D.</given-names></name> <name><surname>Gao</surname> <given-names>F.</given-names></name> <name><surname>Jakovlic</surname> <given-names>I.</given-names></name> <name><surname>Zou</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>W. X.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies.</article-title> <source><italic>Mol. Ecol. Resour.</italic></source> <volume>20</volume> <fpage>348</fpage>&#x2013;<lpage>355</lpage>. <pub-id pub-id-type="doi">10.1111/1755-0998.13096</pub-id> <pub-id pub-id-type="pmid">31599058</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>M.</given-names></name> <name><surname>Chen</surname> <given-names>N.</given-names></name></person-group> (<year>2022</year>). <article-title>Comparative analysis of Thalassionema chloroplast genomes revealed hidden biodiversity.</article-title> <source><italic>BMC Genomics</italic></source> <volume>23</volume>:<issue>327</issue>. <pub-id pub-id-type="doi">10.1186/s12864-022-08532-6</pub-id> <pub-id pub-id-type="pmid">35477350</pub-id></citation></ref>
</ref-list>
</back>
</article>
