<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2024.1241515</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Evolution of <italic>Whirly1</italic> in the angiosperms: sequence, splicing, and expression in a clade of early transitional mycoheterotrophic orchids</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Muti</surname>
<given-names>Rachel M.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2418192"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Barrett</surname>
<given-names>Craig F.</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1944214"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Sinn</surname>
<given-names>Brandon T.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff4">
<sup>4</sup>
</xref>
<xref ref-type="author-notes" rid="fn001">
<sup>*</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/2281715"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Department of Biology and Earth Science, Otterbein University</institution>, <addr-line>Westerville, OH</addr-line>, <country>United States</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Hematology and Medical Oncology, Emory University</institution>, <addr-line>Atlanta, GA</addr-line>, <country>United States</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Department of Biology, West Virginia University</institution>, <addr-line>Morgantown, WV</addr-line>, <country>United States</country>
</aff>
<aff id="aff4">
<sup>4</sup>
<institution>Faculty of Biology, University of Latvia</institution>, <addr-line>Riga</addr-line>, <country>Latvia</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>Edited by: Oscar Alejandro P&#xe9;rez-Escobar, Royal Botanic Gardens, Kew, United Kingdom</p>
</fn>
<fn fn-type="edited-by">
<p>Reviewed by: Dewi Pramanik, National Research and Innovation Agency (BRIN), Indonesia</p>
<p>Katharina Nargar, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Australia</p>
</fn>
<fn fn-type="corresp" id="fn001">
<p>*Correspondence: Brandon T. Sinn, <email xlink:href="mailto:sinn1@otterbein.edu">sinn1@otterbein.edu</email>
</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>28</day>
<month>06</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2024</year>
</pub-date>
<volume>15</volume>
<elocation-id>1241515</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>06</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>07</day>
<month>06</month>
<year>2024</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2024 Muti, Barrett and Sinn</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Muti, Barrett and Sinn</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The plastid-targeted transcription factor <italic>Whirly1</italic> (<italic>WHY1</italic>) has been implicated in chloroplast biogenesis, plastid genome stability, and fungal defense response, which together represent characteristics of interest for the study of autotrophic losses across the angiosperms. While gene loss in the plastid and nuclear genomes has been well studied in mycoheterotrophic plants, the evolution of the molecular mechanisms impacting genome stability is completely unknown. Here, we characterize the evolution of <italic>WHY1</italic> in four early transitional mycoheterotrophic orchid species in the genus <italic>Corallorhiza</italic> by synthesizing the results of phylogenetic, transcriptomic, and comparative genomic analyses with <italic>WHY1</italic> genomic sequences sampled from 21 orders of angiosperms. We found an increased number of non-canonical <italic>WHY1</italic> isoforms assembled from all but the greenest <italic>Corallorhiza</italic> species, including intron retention in some isoforms. Within <italic>Corallorhiza</italic>, phylotranscriptomic analyses revealed the presence of tissue-specific differential expression of <italic>WHY1</italic> in only the most photosynthetically capable species and a coincident increase in the number of non-canonical <italic>WHY1</italic> isoforms assembled from fully mycoheterotrophic species. Gene- and codon-level tests of <italic>WHY1</italic> selective regimes did not infer significant signal of either relaxed selection or episodic diversifying selection in <italic>Corallorhiza</italic> but did so for relaxed selection in the late-stage full mycoheterotrophic orchids <italic>Epipogium aphyllum</italic> and <italic>Gastrodia elata</italic>. Additionally, nucleotide substitutions that most likely impact the function of <italic>WHY1</italic>, such as nonsense mutations, were only observed in late-stage mycoheterotrophs. We propose that our findings suggest that splicing and expression changes may precede the selective shifts we inferred for late-stage mycoheterotrophic species, which therefore does not support a primary role for <italic>WHY1</italic> in the transition to mycoheterotrophy in the Orchidaceae. Taken together, this study provides the most comprehensive view of <italic>WHY1</italic> evolution across the angiosperms to date.</p>
</abstract>
<kwd-group>
<kwd>mycoheterotrophy</kwd>
<kwd>
<italic>Corallorhiza</italic>
</kwd>
<kwd>orchid</kwd>
<kwd>genomic stability</kwd>
<kwd>intron retention</kwd>
<kwd>transcription factor</kwd>
<kwd>plastome evolution</kwd>
<kwd>
<italic>Whirly1</italic>
</kwd>
</kwd-group>
<contract-num rid="cn001">0830020</contract-num>
<contract-sponsor id="cn001">National Science Foundation<named-content content-type="fundref-id">10.13039/100000001</named-content>
</contract-sponsor>
<counts>
<fig-count count="6"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="125"/>
<page-count count="17"/>
<word-count count="10363"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-in-acceptance</meta-name>
<meta-value>Plant Systematics and Evolution</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1" sec-type="intro">
<label>1</label>
<title>Introduction</title>
<p>The ability to photosynthesize has been lost dozens of times across the angiosperm Tree of Life, and at least 30 independent losses have occurred in the Orchidaceae (<xref ref-type="bibr" rid="B70">Merckx and Freudenstein, 2010</xref>; <xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>, <xref ref-type="bibr" rid="B6">Barrett et al, 2019</xref>). Mycoheterotrophy, the derivation of carbon nutrition from fungi (<xref ref-type="bibr" rid="B56">Leake, 1994</xref>), is common to all orchids during early development and is a nutritional requirement due to the lack of endosperm in their seeds (<xref ref-type="bibr" rid="B89">Rasmussen, 1995</xref>, <xref ref-type="bibr" rid="B90">Rasmussen, 2002</xref>). In lieu of stored nutrition, orchid seeds have evolved a complex symbiotic relationship with fungi, where orchid seeds germinate only in the presence of an appropriate fungal partner and the nutrition required for embryo development is derived exclusively via the degradation of fungal hyphae which penetrate the orchid cells (<xref ref-type="bibr" rid="B71">Merckx and Merckx, 2013</xref>; <xref ref-type="bibr" rid="B117">Zeng et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B115">Yuan et&#xa0;al., 2018</xref>). The duration of reliance upon their fungal partner for nutrition has been extended in some orchid species, which have evolved to parasitize fungi for the entirety of their lives. Independent shifts to a mycoheterotrophic condition throughout the angiosperms have independently led to plastid genome (plastome) degradation (<xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B111">Wicke et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B41">Graham et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B106">Timilsena et&#xa0;al., 2023</xref>), elevated rates of nucleotide substitution (<xref ref-type="bibr" rid="B59">Lemaire et&#xa0;al., 2010</xref>; <xref ref-type="bibr" rid="B111">Wicke et&#xa0;al., 2016</xref>), and oftentimes the loss of morphological structures such as leaves and roots (<xref ref-type="bibr" rid="B56">Leake, 1994</xref>).</p>
<p>
<italic>Corallorhiza</italic> is a North American, temperate genus comprising 12 species of morphologically reduced, mycoheterotrophic orchids for which varying states of plastome degradation and inferred photosynthetic ability have been characterized (<xref ref-type="bibr" rid="B2">Barrett and Freudenstein, 2008</xref>; <xref ref-type="bibr" rid="B125">Zimmer et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B13">Cameron et&#xa0;al., 2009</xref>; <xref ref-type="bibr" rid="B1">Barrett and Davis, 2012</xref>; <xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>, <xref ref-type="bibr" rid="B7">Barrett et al, 2018</xref>). The presence of relatively intact plastomes containing the expected repertoire of housekeeping genes in <italic>Corallorhiza</italic> species evidences the clade as a group of early transitional mycoheterotrophs, in contrast with late-stage mycoheterotrophic species which have highly degraded plastomes and lack many or all plastid housekeeping genes, such as <italic>Epipogium</italic> and <italic>Gastrodia</italic> species (<italic>sensu</italic> <xref ref-type="bibr" rid="B1">Barrett and Davis, 2012</xref>; see also <xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>). <italic>Corallorhiza</italic> species parasitize Basidiomycete fungi that are engaged in mycorrhizal relationships with nearby autotrophic plants, predominantly in the families Russulaceae and Thelephoraceae (<xref ref-type="bibr" rid="B102">Taylor and Bruns, 1997</xref>; <xref ref-type="bibr" rid="B3">Barrett et&#xa0;al., 2010</xref>; <xref ref-type="bibr" rid="B37">Freudenstein and Barrett, 2014</xref>; <xref ref-type="bibr" rid="B103">Taylor et&#xa0;al., 2022</xref>). In addition to their relationship with fungi, a conspicuous characteristic of <italic>Corallorhiza</italic> species is the complete loss of both leaf laminae and roots. A recurrent theme of morphological reduction has been documented across parasitic and mycoheterotrophic plant lineages (see <xref ref-type="bibr" rid="B56">Leake, 1994</xref>), and work has recently focused on genomic content and gene expression in mycoheterotrophic species in order to improve our understanding of the genomic precursors and consequences of this trophic transition (<xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B111">Wicke et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B41">Graham et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B119">Zhang et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B115">Yuan et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B11">Cai, 2023</xref>; <xref ref-type="bibr" rid="B106">Timilsena et&#xa0;al., 2023</xref>).</p>
<p>The integrity of the plastome of parasitic and mycoheterotrophic plants is of particular interest as reduction in gene content, increased number of pseudogenes, structural variation, and a reduction in overall genome length have been found to correlate with the degree of external carbon reliance among parasitic angiosperms. Generally, disruptions to the genome such as double-strand DNA breaks are harmful to the organism, and many mechanisms that help to protect against such occurrences have evolved throughout the Tree of Life (<xref ref-type="bibr" rid="B108">Waterworth et&#xa0;al., 2011</xref>). One such genome stabilizing mechanism that is increasingly recognized for its involvement in processes such as plastome double-strand break repair is the activity of the Whirly family of transcription factors (<italic>WHY</italic>; <xref ref-type="bibr" rid="B26">Desveaux et&#xa0;al., 2005</xref>).</p>
<p>Transcription factors are regulatory gene products that function by binding to DNA (<xref ref-type="bibr" rid="B55">Latchman, 1993</xref>). The Whirly family comprises <italic>WHY1</italic>, <italic>WHY2</italic>, and <italic>WHY3</italic>, which are three plant-specific, nuclear-encoded genes with DNA-binding domains, that are named for their whirligig-like structural conformation (<xref ref-type="bibr" rid="B24">Desveaux et&#xa0;al., 2002</xref>; <xref ref-type="bibr" rid="B14">Cappadocia et&#xa0;al., 2010</xref>). Crystal structures of the Whirly transcription factors have been determined as tetramers that have a single-stranded DNA-binding domain that spans two subunits (<xref ref-type="bibr" rid="B15">Cappadocia et&#xa0;al., 2013</xref>). Of particular interest is <italic>WHY1</italic>, the product of which has been implicated to play roles in several processes including mediation of abiotic stressors (<xref ref-type="bibr" rid="B123">Zhuang et&#xa0;al., 2019</xref>, <xref ref-type="bibr" rid="B124">Zhuang et al, 2020</xref>; <xref ref-type="bibr" rid="B96">Ruan et&#xa0;al., 2022</xref>), induction of double-strand DNA break repair (<xref ref-type="bibr" rid="B14">Cappadocia et&#xa0;al., 2010</xref>), and plastid biogenesis (<xref ref-type="bibr" rid="B86">Prikryl et&#xa0;al., 2008</xref>). Transcription factors (TF) are crucial to many regulatory and developmental processes, which is reflected in the massive expansions of many TF gene families in plants (<xref ref-type="bibr" rid="B58">Lehti-Shiu et&#xa0;al., 2017</xref>). Our present understanding of TF evolution has largely been informed through the investigations focused on understanding their roles in morphological or ecological diversification (see <xref ref-type="bibr" rid="B22">de Mendoza et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B53">Lai et&#xa0;al., 2020</xref>) rather than how they change in systems, which have undergone coincident extreme loss of morphological and genomic features.</p>
<p>
<italic>WHY1</italic> has been shown to dually localize to both plastids and the nucleus (<xref ref-type="bibr" rid="B52">Krause et&#xa0;al., 2005</xref>; <xref ref-type="bibr" rid="B40">Grabowski et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B86">Prikryl et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B45">Isemer et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B93">Ren et&#xa0;al., 2017</xref>). In chloroplasts, <italic>WHY1</italic> localizes to the boundary between the thylakoid and nucleoid membrane in chloroplasts and has been implicated in retrograde signaling regulating H<sub>2</sub>O<sub>2</sub> homeostasis and as a coordinator of photosynthetic gene expression (<xref ref-type="bibr" rid="B60">Lepage et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B36">Foyer et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B64">Lin et&#xa0;al., 2019</xref>). Species that have undergone the transition to heterotrophy experience elevated levels of oxidative stress compared to autotrophic relatives (<xref ref-type="bibr" rid="B100">Suetsugu et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B54">Lallemand et&#xa0;al., 2019</xref>). Additionally, <italic>WHY1</italic> proteins stabilize plastid genomes by non-specific binding to the genome, which protects against microhomology-mediated DNA rearrangements, including deletions and duplications of sequences (<xref ref-type="bibr" rid="B69">Mar&#xe9;chal et&#xa0;al., 2009</xref>; <xref ref-type="bibr" rid="B60">Lepage et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B116">Zampini et&#xa0;al., 2015</xref>).</p>
<p>
<italic>WHY1</italic> is involved in complex roles in both plant defense responses and genomic stabilization. For example, mutations which reduce the binding affinity of <italic>WHY1</italic> correlate with increased infection by some pathogens (<xref ref-type="bibr" rid="B27">Desveaux et&#xa0;al., 2004</xref>). <italic>WHY1</italic> has been shown to bind to specific DNA promoter regions that can induce transcription (<xref ref-type="bibr" rid="B124">Zhuang et&#xa0;al., 2020</xref>), aid in defense response signaling and accumulation of disease resistance (<xref ref-type="bibr" rid="B27">Desveaux et&#xa0;al., 2004</xref>, <xref ref-type="bibr" rid="B26">Desveaux et al, 2005</xref>), and also maintain telomere length in the nuclear genome (<xref ref-type="bibr" rid="B26">Desveaux et&#xa0;al., 2005</xref>). Recently, <italic>WHY1</italic> has even been shown to be capable of negatively regulating the RNA interference response to two geminiviruses (<xref ref-type="bibr" rid="B101">Sun et&#xa0;al., 2023</xref>). Taken together, the literature suggests that modulation of <italic>WHY1</italic> expression can result in tradeoffs between plant defense and genomic stability.</p>
<p>The roles that <italic>WHY1</italic> plays in stabilizing both the nuclear and plastid genomes (<xref ref-type="bibr" rid="B114">Yoo et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B116">Zampini et&#xa0;al., 2015</xref>), defense response (<xref ref-type="bibr" rid="B26">Desveaux et&#xa0;al., 2005</xref>), and chloroplast development (<xref ref-type="bibr" rid="B87">Qiu et&#xa0;al., 2022</xref>) are central to our choice to study the evolution of this transcription factor. In particular, experimental work demonstrating plastome destabilization (<xref ref-type="bibr" rid="B60">Lepage et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B116">Zampini et&#xa0;al., 2015</xref>), albinism, and variegation in <italic>WHY1</italic> mutants (<xref ref-type="bibr" rid="B86">Prikryl et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B93">Ren et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B87">Qiu et&#xa0;al., 2022</xref>) makes the gene a compelling target given the reduction in plastome content and length observed across parasitic angiosperms (<xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B111">Wicke et&#xa0;al., 2016</xref>). To date, work characterizing the sequence and expression diversity of <italic>WHY1</italic> has been restricted to model or agricultural systems (<xref ref-type="bibr" rid="B15">Cappadocia et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B96">Ruan et&#xa0;al., 2022</xref>; <xref ref-type="bibr" rid="B103">Taylor et&#xa0;al., 2022</xref>) and no phylo-comparative investigations of sequence evolution and selective regime have been conducted. The implication of <italic>WHY1</italic> in processes associated with the mycoheterotrophic condition frames a phylogenetically informed investigation of <italic>WHY1</italic> along a trophic gradient as an important step in improving our understanding of the evolution of plastid-targeted TFs in heterotrophic plant lineages.</p>
<p>We focus on four species, <italic>C</italic>. <italic>trifida</italic>, <italic>C. striata</italic>, <italic>C. wisteriana</italic>, and <italic>C. maculata</italic>, which together comprise an early transitional trophic gradient to full mycoheterotrophy with sister relationships among partial and full mycoheterotrophs (<xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>). The well-characterized phylogenetic relationships between these four <italic>Corallorhiza</italic> species (<xref ref-type="bibr" rid="B7">Barrett et&#xa0;al., 2018</xref>) provide a powerful framework upon which to investigate <italic>WHY1</italic> evolution during the early stages of transition to full mycoheterotrophy while accounting for phylogenetic non-independence (<italic>sensu</italic> <xref ref-type="bibr" rid="B34">Felsenstein, 1985</xref>). We consider <italic>C</italic>. <italic>trifida</italic> and <italic>C. wisteriana</italic> to be partial mycoheterotrophs, as their tissues contain measurable chlorophyll content and their plastomes are the most genetically intact plastid genomes in the genus, although photosynthesis has only been directly observed in <italic>C</italic>. <italic>trifida</italic> (<xref ref-type="bibr" rid="B125">Zimmer et&#xa0;al., 2008</xref>; <xref ref-type="bibr" rid="B13">Cameron et&#xa0;al., 2009</xref>; <xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>, <xref ref-type="bibr" rid="B7">Barrett et al, 2018</xref>). Conversely, we consider <italic>C. maculata</italic> and <italic>C. striata</italic> to be fully mycoheterotrophic, evidenced by their highly reduced chlorophyll content and degradation of many photosynthesis-related genes (<xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>, <xref ref-type="bibr" rid="B7">Barrett et al, 2018</xref>).</p>
<fig id="f1" position="float">
<label>Figure&#xa0;1</label>
<caption>
<p>Overview of <italic>Corallorhiza</italic> species included in this study, showing plastid and nuclear phylogenetic relationships, inflorescence, trophic status (fully vs. partially mycoheterotrophic), plastome size (bp), number of putatively functional plastid genes, and chlorophyll content (mean and standard deviation in nanograms of total chlorophylls per milligram of plant material). Phylogenetic, plastome, and chlorophyll content data are from Barrett et&#xa0;al. (2014).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-15-1241515-g001.tif"/>
</fig>
<p>Here, we characterize the evolution of <italic>WHY1</italic> across a mycoheterotrophic gradient, framed by phylogenetic context provided by the broadest taxonomic sampling of the gene to date, including late-stage mycoheterotrophic orchid species from the genera <italic>Epipogium</italic> and <italic>Gastrodia</italic>. Our integrative work leverages a combination of novel and publicly available data generated from DNA sequencing, RNA sequencing (RNA-seq), and Oxford Nanopore sequencing from 110 angiosperm species representing 21 orders. Taken together, the results of our analyses of <italic>WHY1</italic> sequence, expression, splicing, and selective regime across both a trophic gradient and the angiosperms more broadly suggest that the gene may play a critical role in maintaining plastome stability after the transition to mycoheterotrophy.</p>
</sec>
<sec id="s2" sec-type="materials|methods">
<label>2</label>
<title>Materials and methods</title>
<sec id="s2_1">
<label>2.1</label>
<title>Publicly available sequences</title>
<p>Annotated <italic>WHY1</italic> sequences were obtained from the nucleotide and Ref-seq National Center for Biotechnology Information (NCBI) databases and Orchidstra, an orchid-specific database (<xref ref-type="bibr" rid="B17">Chao et&#xa0;al., 2017</xref>). Only <italic>WHY1</italic> sequences containing canonical <italic>WHY1</italic> ORFs (open reading frames) were retained. Additionally, sequences were excluded if they did not contain the ssDNA-binding region (KGKAAL; <italic>A. thaliana</italic> Q9M9S3) as reported by Cappadocia et&#xa0;al. (2013; PDB 4KOO). Identical sequences were excluded for taxa with multiple database accessions. Stop codons were trimmed from sequences, with the exception of premature stop codons in sequences from known mycoheterotrophs, which were changed to gap characters (-) for compatibility with downstream methods (e.g., HyPhy, see below). In total, 110 species were included in the angiosperm-wide alignment (see Alignment section below; <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S1</bold>
</xref>).</p>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>RNA-seq, <italic>de-novo</italic> assembly of transcripts, and <italic>in-silico</italic> differential expression</title>    <p>
<italic>Corallorhiza</italic> tissues used for RNA-seq are those referred to in <xref ref-type="bibr" rid="B99">Sinn and Barrett (2020)</xref>, where complete methodological details can be found. In brief, total RNAs were extracted from pooled tissues using the ZR Plant RNA MiniPrep Kit (Zymo Research, Irvine, California, USA), leveraging a DNA exclusion column and a DNase digestion step. RNA extractions were conducted in an area where both DNA and RNase contamination were actively managed, the later with both RNase Away and RNase Zap (Thermo Fisher Scientific, Waltham, Massachusetts, USA). Pooled tissues were categorized as either aboveground (combined stem, flower, and ovary tissues) or belowground (rhizome tissue, including fungal tissue), and three biological replicates of each tissue type were extracted for all four species. Extracted RNAs were quantified on a 2100 Bioanalyzer (Agilent, Santa Clara, California, USA) and a NanoDrop spectrophotometer (Thermo Fisher Scientific, Waltham, Massachusetts, USA). Library construction was conducted at the West Virginia University Genomics Core Facility using TruSeq Stranded mRNA kit (Illumina, San Diego, California, USA). Libraries were sequenced on the Illumina HiSeq 1500 platform at the Marshall University Genomics Core Facility, with the exception of aboveground <italic>C</italic>. <italic>maculata</italic> (two samples) and belowground <italic>C</italic>. <italic>striata</italic> (two samples) for which library preparation of limited material was not successful.</p>
<p>
<italic>De-novo</italic> assembly of transcripts from each <italic>Corallorhiza</italic> species was conducted using Trinity (version v2.13.2, <xref ref-type="bibr" rid="B39">Grabherr et&#xa0;al., 2011</xref>). Reads were trimmed using Trimmomatic (version 0.36, <xref ref-type="bibr" rid="B8">Bolger et&#xa0;al., 2014</xref>) with default settings to remove sequencing adapters and low-quality bases from the read ends. Trinity was provided with a samples file with biological replicate relationships, and strand-specific (SS) library type was set to reverse-forward (RF). Trimmed reads were mapped to each assembled transcript using the splice-aware read mapper BBMAP (version 38.96; <xref ref-type="bibr" rid="B10">Bushnell, 2022</xref>) with default settings, with mapped reads output to SAM format and converted to sorted BAM-formatted files using SAMtools (version 1.15; <xref ref-type="bibr" rid="B62">Li et&#xa0;al., 2009</xref>).</p>
<p>
<italic>In-silico</italic> analysis of differential expression was conducted using scripts provided as components of the Trinity RNA-seq pipeline (<xref ref-type="bibr" rid="B43">Haas et&#xa0;al., 2013</xref>). Transcript abundance was estimated using the alignment-free estimation method as implemented in Salmon (version 1.2.0, <xref ref-type="bibr" rid="B80">Patro et&#xa0;al., 2017</xref>). The strand-specific library type was set to RF. Salmon output files included transcript abundance estimates at both the transcript and gene level. Matrices were built using the abundance_estimates_to_matrix.pl script for both transcript counts and gene expression. Differential expression analysis was run using the R (<xref ref-type="bibr" rid="B91">R Core Team, 2019</xref>) package edgeR (Bioconductor version 3.10, <xref ref-type="bibr" rid="B94">Robinson et&#xa0;al., 2010</xref>) via the run_DE_analysis.pl script, which identified differential expression using biological replicates of tissues across the replicate conditions aboveground and belowground. TPM (transcripts per million) values were normalized using the TMM (trimmed mean of M-values; <xref ref-type="bibr" rid="B95">Robinson and Oshlack, 2010</xref>) approach in order to normalize expression values while maintaining comparability of expression between samples.</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Identification of <italic>WHY1</italic> transcripts</title>
<p>We used the HMMER suite (version 3.3.2; <xref ref-type="bibr" rid="B31">Eddy, 2011</xref>) to create a <italic>WHY1</italic> hidden Markov model (HMM) profile. The HMM profile was built with hmmbuild, which used the complete angiosperm-wide <italic>WHY1</italic> nucleotide alignment. All assembled <italic>Corallorhiza</italic> transcripts were then searched against the HMM model using nhmmer (<xref ref-type="bibr" rid="B110">Wheeler and Eddy, 2013</xref>). Default parameters were used for both programs. All assembled isoforms of a transcript identified with the highest <italic>E</italic>-value were considered as potential splicing variants of <italic>WHY1</italic>.</p>
</sec>
<sec id="s2_4">
<label>2.4</label>
<title>PCR amplification</title>
<p>Primers for amplification of genomic <italic>WHY1</italic> sequence were designed using the Geneious Prime (version 2020.2.4, <ext-link ext-link-type="uri" xlink:href="https://www.geneious.com">https://www.geneious.com</ext-link>) plugin for Primer3 (version 2.3.7; <xref ref-type="bibr" rid="B107">Untergasser et&#xa0;al., 2012</xref>). Oligos were synthesized by Integrated DNA Technologies (Coralville, Iowa, USA). Genomic sequences presented here were amplified using a forward primer (WHY1_upstreamF: TTC AAA TCG AAG AGT AAA CTA ACC) whose 3&#x2019; end binds five nucleotides upstream of exon 1 and a reverse primer (WHY1_exon2R: TTT GGC TCA ACT GAT AGA GC), which binds in the downstream portion of exon 2. PCR amplification of <italic>WHY1</italic> for each <italic>Corallorhiza</italic> species was performed on CTAB extractions (<xref ref-type="bibr" rid="B29">Doyle and Doyle, 1987</xref>) of total DNA. PCR was conducted in 25 &#x3bc;l volumes, comprising 12.5 &#x3bc;l of Apex Taq RED Master Mix (Genesee Scientific; Morrisville, North Carolina, USA), 1.25 &#x3bc;l of each forward and reverse primer, 9 &#x3bc;l of water, and 1 &#x3bc;l of template DNA. PCR was conducted in a Bio-Rad T100 Thermocycler (Hercules, California, USA) using the following program: initial template denaturation at 95&#xb0;C for 3 min, followed by 30 cycles of denaturation at 95&#xb0;C for 30 s, primer annealing at 52&#xb0;C for 30 s, and template extension at 72&#xb0;C for 30 s. The program ended with a final extension at 72&#xb0;C for 5 min and was held at 4&#xb0;C until retrieval. PCR cleanup was performed with Agencourt AMPure XP PCR Purification beads (Beckman Coulter Life Sciences; Indianapolis, Indiana, USA). Purified DNA samples were quantified using a NanoDrop One<sup>C</sup> spectrophotometer (Thermo Fisher Scientific) and a Qubit fluorometer (Thermo Fisher Scientific) with the dsDNA BR Assay Kit.</p>
</sec>
<sec id="s2_5">
<label>2.5</label>
<title>Nanopore sequencing</title>
<p>Purified DNA samples were diluted to equimolar concentrations and libraries for long-read sequencing were prepared according to the Oxford Nanopore Technologies (ONT; Oxford, United Kingdom) End-Prep protocol (SQK-LSK109). The library of each <italic>Corallorhiza</italic> species received a unique barcode for Nanopore sequencing using the Native Barcoding Expansion 1&#x2013;12, PCR-free kit (EXP-NBD104). The MinION SpotON flow cell (R9.4.1 FLO-MIN 106; ONT) was used for sequencing. Base calling was performed using the high-accuracy base calling algorithm as implemented in the GPU version of Guppy (version 6.2.1 + 6588110a6; ONT) on an NVIDIA GeForce RTX 2060 graphics card. Nanopore reads were mapped to the <italic>Dendrobium catenatum</italic> (RefSeq ID: GCF_001605985.2; <xref ref-type="bibr" rid="B120">Zhang et&#xa0;al., 2016</xref>) <italic>WHY1</italic> genomic sequence in Geneious Prime using Minimap2 (version 2.17, <xref ref-type="bibr" rid="B61">Li, 2018</xref>) with a K-Mer length set to 15.</p>
</sec>
<sec id="s2_6">
<label>2.6</label>
<title>Alignment</title>
<p>We aligned all <italic>Corallorhiza WHY1</italic> isoforms using MAFFT (version 7.3.10; <xref ref-type="bibr" rid="B49">Katoh and Standley, 2013</xref>), and the E-INS-i algorithm (<xref ref-type="bibr" rid="B48">Katoh et&#xa0;al., 2002</xref>) and the 1PAM scoring matrix, as a Geneious Prime plugin. Visualization and structural annotation against the canonical <italic>WHY1</italic> sequence of <italic>Arabidopsis thaliana</italic> (NCBI Q9M9S3) was also conducted using Geneious Prime. Consensus was determined as majority consensus with a 0% threshold, meaning no minimum frequency was required for a consensus character if the character was shared by most sequences. All <italic>Corallorhiza WHY1</italic> isoforms were translated in Geneious Prime to amino acid sequence and manually trimmed to the correct ORF. ORF-trimmed <italic>WHY1</italic> translations were aligned with the ORF-trimmed <italic>WHY1</italic> sequence of <italic>Arabidopsis thaliana</italic> (NCBI NM101308) to verify the presence of the expected canonical reading frame.</p>
<p>A translation-aware alignment of the canonical form of <italic>WHY1</italic> representing lineages across the angiosperms was generated using a two-step process. Nucleotides were first aligned and translated using the frameshift-aware aligner MASCE2 (version 2.0.6; <xref ref-type="bibr" rid="B88">Ranwez et&#xa0;al., 2011</xref>) with default parameters, which inserts gap characters necessary to maintain codon-based statements of homology across the alignment. This approach was necessary due to the presence of frameshift mutations in sequences from <italic>Gastrodia elata</italic>, <italic>Epipogium aphyllum</italic>, and <italic>C</italic>. <italic>striata</italic>. The MACSE2-processed nucleotide and amino acid alignments were then refined using MAFFT and the E-INS-i algorithm with a BLOSUM 80 substitution matrix.</p>
<p>An alignment including the complete genomic sequences of <italic>WHY1</italic> from <italic>Phalaenopsis equestris</italic> (ASM126359v1) and <italic>Dendrobium catenatum</italic> (ASM160598v2) was also generated. <italic>Phalaenopsis equestris</italic> (<xref ref-type="bibr" rid="B12">Cai et&#xa0;al., 2015</xref>) and <italic>D</italic>. <italic>catenatum</italic> (<xref ref-type="bibr" rid="B120">Zhang et&#xa0;al., 2016</xref>) are the closest relatives of <italic>Corallorhiza</italic> with sequenced genomes (<xref ref-type="bibr" rid="B18">Chen et&#xa0;al., 2022</xref>). This DNA alignment was generated to identify the introns of <italic>WHY1</italic> and to evaluate the exonic content of assembled transcripts. All <italic>Corallorhiza</italic> isoforms and <italic>Corallorhiza</italic> Nanopore consensus sequences were aligned with the sequences of <italic>P</italic>. <italic>equestris</italic> and <italic>D</italic>. <italic>catenatum</italic> using MAFFT (version 7.3.10; <xref ref-type="bibr" rid="B49">Katoh and Standley, 2013</xref>), as a Geneious Prime plugin, and the E-INS-i algorithm (<xref ref-type="bibr" rid="B48">Katoh et&#xa0;al., 2002</xref>).</p>
</sec>
<sec id="s2_7">
<label>2.7</label>
<title>Visual screening for amino acid substitutions</title>
<p>Nonsynonymous substitutions within the angiosperm <italic>WHY1</italic> alignment were surveyed visually in our amino acid alignments via Geneious Prime. Substitutions of interest included those present exclusively in <italic>Corallorhiza</italic> species, mycoheterotrophs, and the Orchidaceae. The codons of <italic>A</italic>. <italic>thaliana</italic> (Q9M9S3) and <italic>P</italic>. <italic>equestris</italic> that corresponded to the sites of nonsynonymous substitutions in <italic>WHY1</italic> of interest were cross-referenced with the annotated <italic>A</italic>. <italic>thaliana WHY1</italic> sequence for structure and the <italic>P</italic>. <italic>equestris WHY1</italic> genomic sequence as included in the DNA alignment for exon location.</p>
</sec>
<sec id="s2_8">
<label>2.8</label>
<title>Phylogenetic inference</title>
<p>IQ-TREE (version 1.6.12; <xref ref-type="bibr" rid="B77">Nguyen et&#xa0;al., 2015</xref>) was used to infer phylogenetic relationships among the recovered <italic>WHY1</italic> sequences using automated model choice (<xref ref-type="bibr" rid="B47">Kalyaanamoorthy et&#xa0;al., 2017</xref>), optimal partitioning assessment (<xref ref-type="bibr" rid="B20">Chernomor et al., 2016</xref>), and nearest neighbor interchange search enabled. Node support was estimated using 1,000 ultrafast bootstrap approximation replicates (<xref ref-type="bibr" rid="B44">Hoang et&#xa0;al., 2018</xref>). Two partitions were defined, which comprise the signal peptide and the highly variable 5&#x2019; portion of the chain (positions 1&#x2013;516) and the highly conserved chain region (517&#x2013;1,122), identified using functional annotations on <italic>WHY1</italic> sequence per <italic>A. thaliana</italic> (Q9M9S3). The <italic>Amborella trichopoda WHY1</italic> sequence was used for phylogram rooting.</p>
</sec>
<sec id="s2_9">
<label>2.9</label>
<title>Selection analyses</title>
<p>The <italic>WHY1</italic> nucleotide alignment was tested for statistically significant changes in selection regime using four methods implemented in the command-line, multithreaded version of the Hypothesis Testing using Phylogenies suite (HyPhy; version 2.5.39; <xref ref-type="bibr" rid="B84">Pond and Muse, 2005</xref>). We used the Genetic Algorithm for Recombination Detection (GARD; <xref ref-type="bibr" rid="B85">Pond et&#xa0;al., 2006</xref>), with default parameters, to test for signal of recombination breakpoints within <italic>WHY1</italic>. We tested for significant change of selective regime using five test branch sets against null reference branch sets comprising all other species in the phylogram: (1) <italic>Corallorhiza</italic> species; (2) <italic>C</italic>. <italic>maculata</italic> + <italic>C</italic>. <italic>striata</italic>; (3) <italic>Corallorhiza</italic> + <italic>Epipogium aphyllum + Gastrodia elata</italic>; (4) <italic>C</italic>. <italic>maculata</italic> + <italic>C</italic>. <italic>striata</italic> + <italic>E</italic>. <italic>aphyllum + G</italic>. <italic>elata</italic>; (5) <italic>E</italic>. <italic>aphyllum</italic> + <italic>G</italic>. <italic>elata</italic>. The same <italic>Amborella trichopoda</italic>-rooted maximum likelihood <italic>WHY1</italic> topology inferred with IQ-tree was used for all analyses.</p>
<p>RELAX (<xref ref-type="bibr" rid="B109">Wertheim et&#xa0;al., 2015</xref>) was used to test for evidence of relaxed selection. RELAX breaks each codon into its three component sites, each with an assigned omega class. Values for omega are calculated as Dn/Ds ratios, using the calculation for the reference branches as the null hypothesis. The value <italic>k</italic> is the selection intensity parameter and is an exponent value on omega. The alternative model fits a value for k that changes the rate to fit with the test branches. Evidence for intensified selection strength along test branches is indicated by a significant result where the value of <italic>k</italic> is greater than 1 (<italic>k</italic> &gt; 1, <italic>P</italic> &lt; 0.05). Evidence for relaxed selection along test branches is indicated by a significant value of <italic>k</italic> less than 1 (<italic>k</italic> &lt; 1, <italic>P</italic> &lt; 0.05). Strength of selection was assessed simultaneously for all species using Fast Unconstrained Bayesian Approximation (FUBAR, <xref ref-type="bibr" rid="B74">Murrell et&#xa0;al., 2013</xref>). FUBAR can detect weak, yet pervasive, purifying or diversifying selection at the codon level (posterior probability &gt;0.90) without the use of test and reference branch sets.</p>
<p>The Branch-Site Unrestricted Statistical Test for Episodic Diversification (BUSTED) was used to test for evidence of gene-wide positive selection (<xref ref-type="bibr" rid="B75">Murrell et&#xa0;al., 2015</xref>). BUSTED uses three omega classes defined as &#x3c9;1 &#x2264; &#x3c9; 2 &#x2264; 1 &#x2264; &#x3c9;3. The &#x3c9;1 class is the proportion of sites with a very low Dn/Ds ratio. The &#x3c9;2 class is the proportion of sites just below 1, and &#x3c9;3 are sites above 1. A value of 1 suggests selective neutrality, and therefore defines the null, or constrained, model. BUSTED then calculates the log-likelihood of the data for each of the null and alternative models. These ratios are calculated for each site and are called evidence ratios (ERs). They are used as a threshold (&#x3c7;<sup>2</sup> distribution, <italic>P</italic> &lt; 0.01) but are not a valid test for site-specific likelihood. The null model is rejected if at least one site on a test branch experienced positive selection. Evidence for positive, or diversifying, selection in a gene of a test branch is indicated by a rejection of the null model.</p>
<p>The adaptive branch-site random effects likelihood (aBSREL) test was used to test for signal of episodic positive selection. aBSREL infers &#x3c9; values at both the level of sites and branches and can account for rate heterogeneity inherent to complex evolutionary scenarios by partitioning these values into multiple rate classes per branch. aBSREL was conducted in exploratory mode, where all branches were tested and <italic>p</italic>-values were Holm&#x2013;Bonferroni corrected, and in the more sensitive <italic>a priori</italic> mode to test for episodic positive selection in each test branch set.</p>
</sec>
</sec>
<sec id="s3" sec-type="results">
<label>3</label>
<title>Results</title>
<sec id="s3_1">
<label>3.1</label>
<title>
<italic>De-novo</italic> assembly and identification of <italic>WHY1</italic>
</title>
<p>The hmmer suite revealed a single, Trinity-identified gene model and its isoforms as representing <italic>WHY1</italic> from <italic>de-novo</italic> assembled transcripts for each <italic>Corallorhiza</italic> species. Trinity assembled a single isoform representing the expected canonical CDS (coding DNA sequence) of <italic>WHY1</italic> for all but <italic>C. striata</italic>, the latest stage fully mycoheterotrophic species of <italic>Corallorhiza</italic> sampled. However, mapping reads to each isoform with BBMAP revealed that a five nucleotide indel, the absence of which results in a premature stop codon, was differentially present or absent in the reads of each <italic>Corallorhiza</italic> species (NCBI BioProject PRJNA984634). Trinity differentially incorporated that indel, hereafter referred to as the GTGAA indel, into the isoform pool of each <italic>Corallorhiza</italic> species. The absence of the GTGAA indel in Trinity-assembled isoforms of <italic>C</italic>. <italic>striata</italic> precluded the recovery of the canonical ORF, but read mapping supports the presence of the five nucleotides necessary to recover the expected ORF in <italic>C</italic>. <italic>striata</italic>, at a rate of 67.2% of reads in isoform 2 and 76.1% of reads in isoform 4. Those data support that the canonical variant of <italic>WHY1</italic> is also expressed. We analyzed the <italic>C</italic>. <italic>striata</italic> isoforms as assembled, rather than manually modifying the <italic>C</italic>. <italic>striata</italic> transcripts to conform to a hypothesized canonical sequence. The inclusion of those five nucleotides would not alter any results presented here, aside from whether a canonical <italic>WHY1</italic> isoform is transcribed in <italic>C</italic>. <italic>striata</italic>.</p>
<p>The canonical <italic>WHY1</italic> ORF as assembled in <italic>C</italic>. <italic>trifida</italic>, <italic>C. wisteriana</italic>, and <italic>C. maculata</italic> is 795 nucleotides in length and comprises 265 amino acids, two fewer amino acids than that of <italic>A</italic>. <italic>thaliana</italic>. Non-canonical splicing variants were found in the isoform pools of each <italic>Corallorhiza</italic> species. A splicing variant containing a 79-nucleotide long sequence of 96.6% mean pairwise identity was recovered from <italic>C</italic>. <italic>wisteriana</italic>, <italic>C. maculata</italic>, and <italic>C. striata</italic>, at nucleotide position 232. The second major variant is the GTGAA indel discussed earlier, which is variously present at nucleotide position 514 in all four species. A third variant identified from the isoform pool of <italic>C</italic>. <italic>striata</italic> represents a modification of the 3&#x2019; end of the ORF, immediately downstream of the previously discussed five-nucleotide indel that was variously present in our read pools throughout <italic>Corallorhiza</italic>.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Alignment</title>
<p>MAFFT alignment of all raw Trinity-assembled <italic>Corallorhiza WHY1</italic> isoforms resulted in a matrix of 1,302 positions. Pairwise percent identity across isoforms was 81.4% and 88.2% and gaps comprised 11.7% and 8.1% of character states for all isoforms and with the exclusion of the early terminating isoform 1 of <italic>C</italic>. <italic>striata</italic>, respectively. Alignment of ORF-trimmed transcripts resulted in a matrix of 874 positions (<xref ref-type="fig" rid="f2">
<bold>Figure&#xa0;2</bold>
</xref>). Percent pairwise identity among isoforms within a species was highest for <italic>C</italic>. <italic>trifida</italic> (99.5%) and lowest for <italic>C</italic>. <italic>striata</italic> (70.6%). A 79-nucleotide-long indel of 96.6% pairwise identity was identified in at least one isoform assembled from all <italic>Corallorhiza</italic> species except <italic>C</italic>. <italic>trifida</italic>. Translation-aware alignment of ORF-trimmed canonical <italic>Corallorhiza WHY1</italic> sequences, with the differentially present GTGAA indel manually inserted into the otherwise canonical sequence of <italic>C</italic>. <italic>striata</italic>, resulted in an alignment of 795 characters of 98.0% pairwise identity and no gaps. The highest pairwise percent identity was 99.37% between <italic>C</italic>. <italic>wisteriana</italic> and <italic>C. maculata</italic> and the lowest was 96.98% between <italic>C</italic>. <italic>trifida</italic> and <italic>C. striata</italic>.</p>
<fig id="f2" position="float">
<label>Figure&#xa0;2</label>
<caption>
<p>Wrapped view of open reading frame-trimmed <italic>Corallorhiza WHY1</italic> isoform alignment. Disagreements with the consensus sequence are highlighted. An <sup>*</sup> denotes a canonical isoform, while <sup>&#x3c8;</sup> denotes an isoform with a retained intron.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-15-1241515-g002.tif"/>
</fig>
<p>The <italic>WHY1</italic> genomic DNA alignment of <italic>Corallorhiza</italic> isoforms, <italic>Corallorhiza</italic> Nanopore sequences, and <italic>P. equestris</italic> and <italic>D. catenatum</italic> genomic sequences had a total length of 12,000 nucleotides (<xref ref-type="fig" rid="f3">
<bold>Figure&#xa0;3</bold>
</xref>), in which the assembled canonical isoforms from each <italic>Corallorhiza</italic> species contained the expected exons of <italic>WHY1</italic>. One non-canonical isoform of <italic>C</italic>. <italic>wisteriana</italic>, <italic>C</italic>. <italic>maculata</italic>, and <italic>C</italic>. <italic>striata</italic> each contained intron 1, for which pairwise percent identity was 96.6% across those species. Percent pairwise identity of intron 1 between that of <italic>D. catenatum</italic> and <italic>P</italic>. <italic>equestris</italic> was 82.3%, and similarity between retained introns and that of <italic>P</italic>. <italic>equestris</italic> ranged from 78.5% to 82.3% for <italic>C</italic>. <italic>maculata</italic> and <italic>C. striata</italic>, respectively. The Nanopore-sequenced and Trinity-assembled <italic>WHY1</italic> intronic sequences of <italic>C</italic>. <italic>striata</italic> were identical, while those of <italic>C. maculata</italic> were 94.9% similar, due to the relative lack of two thiamine nucleotides in the Nanopore sequence.</p>
<fig id="f3" position="float">
<label>Figure&#xa0;3</label>
<caption>
<p>DNA alignment of canonical isoforms and Nanopore-sequenced genomic sequence from each <italic>Corallorhiza</italic> species with annotated WHY1 sequence from the <italic>Phalaenopsis equestris</italic> and <italic>Dendrobium catenatum</italic> reference genomes. Coding sequences for each reference genome are depicted by yellow annotations. Disagreements with the consensus sequence are shown in black. Percent identity of aligned sites is depicted as a histogram. An * denotes a canonical isoform, while &#x3c8; denotes an isoform with a retained intron.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-15-1241515-g003.tif"/>
</fig>
<p>The angiosperm-scale <italic>WHY1</italic> nucleotide alignment contained <italic>WHY1</italic> sequences from 110 species, including 22 orchid species. The total length of the alignment was 1,122 positions, of which 150 were of identical states across all species. Two partitions roughly corresponding to the transit peptide and chain regions as annotated in <italic>A. thaliana</italic> (Q9M9S3) were conspicuously visible in the consensus sequence of the alignment. The first was a highly variable, lineage-specific portion of sequence ranging from positions 1&#x2013;516 in the alignment. The second was a highly conserved portion spanning positions 517&#x2013;1,122. The mean pairwise percent identity of the transit region of Orchidaceae was 51.3%, while that of <italic>Apostasia shenzhenica</italic> compared to either <italic>G</italic>. <italic>elata</italic> or <italic>E</italic>. <italic>aphyllum</italic> was 21.5% and 31.1%, respectively. The mean pairwise percent identity of the transit region of <italic>Corallorhiza</italic> was 94.4%. The mean pairwise percent identity of the four <italic>Corallorhiza</italic> species in the alignment was 97.7% and 81.0% across the Orchidaceae. The grasses had the lowest mean percent pairwise similarity of any clade, which was 49.36% when each sequence was compared to each non-grass species in the alignment.</p>
<p>The angiosperm-wide <italic>WHY1</italic> amino acid alignment comprised 376 positions, of which 48 were of identical states across all species. The two partitions in the nucleotide alignment corresponding to functional annotation in <italic>A. thaliana</italic> (Q9M9S3) were more conspicuous in the amino acid alignment. Positions 1&#x2013;115 and 116&#x2013;376 corresponded to the transit and chain regions of <italic>A</italic>. <italic>thaliana</italic> (Q9M9S3), respectively. The pairwise percent identity of the four <italic>Corallorhiza</italic> species in the alignment was 97.4% for <italic>WHY1</italic>, and across the Orchidaceae it was 77.8%.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Nanopore sequencing</title>
<p>Nanopore sequencing of genomic <italic>WHY1</italic> sequence from three of the four <italic>Corallorhiza</italic> species confirmed that the unique sequence in the non-canonical transcripts of <italic>C. wisteriana</italic>, <italic>C. striata</italic>, and <italic>C. maculata</italic> represented retention of <italic>WHY1</italic> intron 1. Sequencing of <italic>WHY1</italic> amplicons generated read pools ranging from 25,546 to 74,261 reads in <italic>C</italic>. <italic>trifida</italic> and <italic>C</italic>. <italic>maculata</italic>, respectively. Mapping of Nanopore reads against the <italic>D</italic>. <italic>catenatum</italic> genomic <italic>WHY1</italic> sequence resulted in a mean coverage depth for the exon 1&#x2013;2 region ranging from 4,034.9 to 10,617.6 in <italic>C</italic>. <italic>trifida</italic> and <italic>C</italic>. <italic>striata</italic>, respectively (<xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S2</bold>
</xref>). Library preparation for Nanopore sequencing of <italic>C</italic>. <italic>wisteriana WHY1</italic> amplicons was deemed unsuccessful since the resulting sequence pool contained presumably off target reads that precluded confident assembly of a <italic>WHY1</italic> consensus sequence from that sample. Alignment of the Nanopore-sequenced <italic>WHY1</italic> amplicon consensus reads with the <italic>Corallorhiza</italic> transcript isoforms and the full <italic>WHY1</italic> genomic sequences of <italic>P</italic>. <italic>equestris</italic> and <italic>D</italic>. <italic>catenatum</italic> provided evidence that the non-canonical isoforms of <italic>WHY1</italic> in <italic>Corallorhiza</italic> were a result of alternative splicing (<xref ref-type="fig" rid="f4">
<bold>Figure&#xa0;4</bold>
</xref>). At least one non-canonical isoform from all but <italic>C</italic>. <italic>trifida</italic> contains intron 1 of <italic>WHY1</italic>.</p>
<fig id="f4" position="float">
<label>Figure&#xa0;4</label>
<caption>
<p>DNA alignment of <italic>Corallorhiza</italic> isoforms assembled from each <italic>Corallorhiza species</italic> and Nanopore sequences for <italic>C</italic>. <italic>trifida</italic>, <italic>C. maculata</italic>, and <italic>C. striata</italic> with annotated <italic>WHY1</italic> sequence from the <italic>Phalaenopsis equestris</italic> and <italic>Dendrobium catenatum</italic> reference genomes. <italic>WHY1</italic> exons 1 and 2 are annotated in yellow and the locus ID tags for each reference genome are provided in the coding sequence (yellow) annotations. Disagreements with the consensus sequence are highlighted. Percent identity of aligned sites is depicted as a histogram. An <sup>*</sup> denotes a canonical isoform, while <sup>&#x3c8;</sup> denotes an isoform with a retained intron.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-15-1241515-g004.tif"/>
</fig>
<p>The sequences obtained from <italic>Corallorhiza</italic> species via Nanopore and RNA-seq shared a high degree of similarity with each other and the <italic>WHY1</italic> sequence in the previously published <italic>P. equestris</italic> and <italic>D. catenatum</italic> genomes. Intron 1 of <italic>WHY1</italic> was found to have a pairwise percent similarity of 82.3% between <italic>P. equestris</italic> and <italic>D. catenatum</italic>. <italic>Corallorhiza WHY1</italic> intron 1 sequences obtained via Nanopore sequencing and RNA-seq had a mean pairwise percent similarity of 96.9%, while that of <italic>C</italic>. <italic>striata</italic> obtained via both RNA-seq and Nanopore sequencing had a pairwise percent similarity of 82.3% with <italic>P. equestris</italic>.</p>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Select amino acid substitutions of interest</title>
<p>Substitutions exclusive to mycoheterotrophic species were evident and, in some cases, were exclusive to fully mycoheterotrophic species. The transit regions of both <italic>E</italic>. <italic>aphyllum</italic> and <italic>G</italic>. <italic>elata</italic> contained some substitutions that were unique within the Orchidaceae. A phenylalanine-glycine residue at positions 76&#x2013;77 was exclusive to <italic>C</italic>. <italic>trifida</italic>, while a leucine-arginine residue was present for the remainder of Orchidaceae at those positions. While pairwise percent identity of positions 76&#x2013;77 was only 13.3% throughout angiosperms, the leucine-arginine residue in the Orchidaceae had a pairwise percent identity of 90.9% when including the residue from <italic>C</italic>. <italic>trifida</italic> and was identical when that taxon was excluded. Amino acid substitutions found in the transit regions of both <italic>C</italic>. <italic>wisteriana</italic> (positions 46 and 95) and <italic>C</italic>. <italic>trifida</italic> (position 113) were not identified in other orchids but were found in some non-orchid autotrophs.</p>
<p>At least one substitution that could impact <italic>WHY1</italic> conformation was identified in <italic>Corallorhiza</italic>. The terminal codon of an alpha helix annotated in the structure of <italic>A</italic>. <italic>thaliana</italic> (9M9S3) <italic>WHY1</italic> (position 336) has been substituted to an isoleucine in <italic>C</italic>. <italic>trifida</italic> and to a phenylalanine in <italic>C</italic>. <italic>wisteriana</italic>, <italic>C. maculata</italic>, and <italic>C. striata</italic> (<xref ref-type="fig" rid="f5">
<bold>Figure&#xa0;5</bold>
</xref>). We infer that the ancestral state of position 336 is leucine, on the basis of that codon state for 89.1% of sampled taxa, including <italic>Amborella trichopoda</italic>. The only other substitutions at that position were to isoleucine, which was identified in eight autotrophic angiosperms outside the Orchidaceae, and in <italic>C</italic>. <italic>trifida</italic>. Additionally, a substitution from alanine to valine at position 359 was exclusive to <italic>C</italic>. <italic>striata</italic>, which represents the only occurrence of that state for this position for 109 other angiosperm species, and the only subgeneric polymorphism observed in that position.</p>
<fig id="f5" position="float">
<label>Figure&#xa0;5</label>
<caption>
<p>Detailed view of amino acid alignment of <italic>WHY1</italic> sequences of orchid species. Note the phenylalanine substitutions unique to <italic>Corallorhiza wisteriana</italic>, <italic>C. maculata</italic>, and <italic>C. striata</italic>, relative to the other species sampled. The amino acid substitutions of interest are located at the 3&#x2019; portion of a region inferred to conform into an alpha helix structure (pink annotation) in <italic>Arabidopsis thaliana WHY1</italic> (alignment row two).</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-15-1241515-g005.tif"/>
</fig>
<p>A five-nucleotide indel resulting in a frameshift (amino acid alignment positions 269 and 270) was found in <italic>C</italic>. <italic>striata</italic>, but the pairwise percent identity of downstream sequence with that of <italic>C</italic>. <italic>maculata</italic> was high (97.8%) and reads containing the corresponding five nucleotides were identified in the RNA-seq read pool. In fact, RNA-seq reads containing the indel were found in the read pools of all <italic>Corallorhiza</italic> species, suggesting that isoform diversity was conservatively interpreted by our methods.</p>
<p>We found that the chain region of <italic>E</italic>. <italic>aphyllum</italic> contained a glycine to arginine substitution in the characteristic <italic>WHY1</italic> ssDNA-binding motif (positions 188&#x2013;193), a site that was otherwise conserved throughout the remainder of samples. Additionally, we found that the <italic>E</italic>. <italic>aphyllum</italic> sequence contained a residue comprising seven amino acids (positions 294&#x2013;300), the last of which was a premature stop codon. MAFFT resolved those seven amino acids as an insertion with no homology to other angiosperm sequences. Six substitutions downstream of that premature stop codon are exclusive to <italic>E</italic>. <italic>aphyllum</italic>.</p>
</sec>
<sec id="s3_5">
<label>3.5</label>
<title>Phylogenetic inference</title>
<p>Both the Eudicots and Monocots were recovered as monophyletic (<xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref>; BS = 100%). Orchidaceae was recovered as a monophyletic group (BS = 94%), with <italic>D. catenatum</italic>, <italic>G. elata</italic> and <italic>E. aphyllum</italic> as an early diverging paraphyletic grade within the Epidendroids (BS = 98%). However, the positions of <italic>D. catenatum</italic> (BS = 79%) and <italic>G. elata</italic> (BS = 76%) were not robustly supported. <italic>Corallorhiza</italic> species were recovered as a monophyletic group (BS = 100%) with <italic>C</italic>. <italic>trifida</italic> sister to <italic>C</italic>. <italic>striata</italic> (BS = 100%) + (<italic>C</italic>. <italic>wisteriana</italic>, <italic>C</italic>. <italic>maculata</italic>; BS = 100%), rather than those inferred in previous, genomic-scale work depicted in <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>.</p>
<fig id="f6" position="float">
<label>Figure&#xa0;6</label>
<caption>
<p>Maximum likelihood inferred topology of <italic>WHY1</italic> evolution with support values derived from 1,000 non-parametric bootstrap replicates. Lineages highlighted in red represent those for which statistically significant signal of episodic diversifying selection of <italic>WHY1</italic> was detected by the adaptive branch-site random effects likelihood test.</p>
</caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-15-1241515-g006.tif"/>
</fig>
</sec>
<sec id="s3_6">
<label>3.6</label>
<title>Differential expression</title>
<p>Canonical isoforms of <italic>WHY1</italic> were most highly expressed in all four species (see <xref ref-type="table" rid="T1">
<bold>Table&#xa0;1</bold>
</xref>), but statistically significant elevated expression in aboveground tissues relative to those belowground was only detected in <italic>C</italic>. <italic>trifida</italic>. Gene-level expression of <italic>WHY1</italic> across both tissues was highest for <italic>C</italic>. <italic>wisteriana</italic> (244.5) and <italic>C. trifida</italic> (175.4) and lowest for <italic>C</italic>. <italic>maculata</italic> (131.9) and <italic>C. striata</italic> (95.3). Expression of the canonical isoform of <italic>WHY1</italic> in <italic>C. trifida</italic> ranged from 4.15 to 9.64 TMM in belowground tissues (median = 7.17 TMM) and from 31.353 to 56.389 TMM in the aboveground tissues (median = 43.273 TMM). The log fold-change value was 2.56 between belowground and aboveground tissues for the canonical isoform of <italic>C. trifida</italic>, with a <italic>P</italic>-value of 0.00018 and a false discovery rate (FDR) of 0.00374. Expression of the <italic>WHY1</italic> nearly canonical isoform in <italic>C. striata</italic> ranged from 5.547 to 16.859 TMM in belowground tissues (median = 12.31 TMM) and 17.28 and 30.538 TMM in aboveground tissues (median = 23.909 TMM). The <italic>C. striata</italic> canonical <italic>WHY1</italic> isoform had a log fold-change value of 0.396 between belowground and aboveground tissues, with a <italic>P</italic>-value of 0.69 and FDR of 0.91. Expression of the canonical isoform of <italic>WHY1</italic> in <italic>C. wisteriana</italic> ranged from 9.256 to 23.023 TMM in belowground tissues (median = 15.663 TMM) and 34.792 to 74.955 TMM in aboveground tissues (median = 60.80 TMM). The <italic>C. wisteriana</italic> canonical isoform had a log fold-change value of 1.96 between belowground and aboveground tissues, with <italic>P</italic>-value of 0.02 and FDR of 0.28. The expression of the canonical isoform in <italic>C. maculata</italic> had values of 11.064 and 13.947 TMM in belowground tissues (median = 12.51 TMM) and ranged from 26.179 to 45.999 TMM in aboveground tissues (median = 33.01 TMM). The <italic>C. maculata</italic> canonical isoform had a log fold-change value of 1.17 between belowground and aboveground tissues, with <italic>P</italic>-value of 0.16 and FDR of 0.62.</p>
<table-wrap id="T1" position="float">
<label>Table&#xa0;1</label>
<caption>
<p>
<italic>In silico</italic> expression of <italic>WHY1</italic> across species, tissues, and biological replicates.</p>
</caption>
<table frame="hsides">
<thead>
<tr>
<th valign="middle" align="left">Species<break/>gene or isoform</th>
<th valign="middle" colspan="3" align="left">Belowground tissue<break/>(biological replicate #)</th>
<th valign="middle" colspan="3" align="left">Aboveground tissue<break/>(biological replicate #)</th>
<th valign="middle" align="left">Log<sub>FC</sub>
</th>
<th valign="middle" align="left">
<italic>P</italic>-value<break/>(<italic>Q</italic>-value)</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="middle" align="left">
<bold>
<italic>C</italic>. <italic>trifida</italic>
</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>3</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>3</bold>
</td>
<td valign="middle" colspan="2" align="center"/>
</tr>
<tr>
<td valign="middle" align="left">Gene</td>
<td valign="middle" align="center">12.7</td>
<td valign="middle" align="center">11.6</td>
<td valign="middle" align="center">4.7</td>
<td valign="middle" align="center">31.4</td>
<td valign="middle" align="center">71.0</td>
<td valign="middle" align="center">43.7</td>
<td valign="middle" align="center">2.12</td>
<td valign="middle" align="center">0.004<break/>(0.057)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 1</td>
<td valign="middle" align="center">5.0</td>
<td valign="middle" align="center">1.8</td>
<td valign="middle" align="center">1.0</td>
<td valign="middle" align="center">1.4</td>
<td valign="middle" align="center">2.9</td>
<td valign="middle" align="center">4.3</td>
<td valign="middle" align="center">0.12</td>
<td valign="middle" align="center">0.915 (0.998)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 2<sup>*</sup>
</td>
<td valign="middle" align="center">7.7</td>
<td valign="middle" align="center">9.6</td>
<td valign="middle" align="center">4.1</td>
<td valign="middle" align="center">31.3</td>
<td valign="middle" align="center">56.3</td>
<td valign="middle" align="center">42.0</td>
<td valign="middle" align="center">2.56</td>
<td valign="middle" align="center">0.0001<break/>(0.003)</td>
</tr>
<tr>
<td valign="middle" align="left">
<bold>
<italic>C</italic>. <italic>wisteriana</italic>
</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>3</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>3</bold>
</td>
<td valign="middle" colspan="2" align="center"/>
</tr>
<tr>
<td valign="middle" align="left">Gene</td>
<td valign="middle" align="center">14.2</td>
<td valign="middle" align="center">23.0</td>
<td valign="middle" align="center">8.3</td>
<td valign="middle" align="center">84.3</td>
<td valign="middle" align="center">76.7</td>
<td valign="middle" align="center">37.8</td>
<td valign="middle" align="center">1.91</td>
<td valign="middle" align="center">0.019<break/>(0.176)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 1<sup>&#x3c8;</sup>
</td>
<td valign="middle" align="center">0</td>
<td valign="middle" align="center">0.8</td>
<td valign="middle" align="center">0</td>
<td valign="middle" align="center">1.0</td>
<td valign="middle" align="center">1</td>
<td valign="middle" align="center">2.0</td>
<td valign="middle" align="center">2.36</td>
<td valign="middle" align="center">0.313<break/>(1)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 2<sup>*</sup>
</td>
<td valign="middle" align="center">14.7</td>
<td valign="middle" align="center">23.0</td>
<td valign="middle" align="center">9.2</td>
<td valign="middle" align="center">74.9</td>
<td valign="middle" align="center">72.6</td>
<td valign="middle" align="center">34.7</td>
<td valign="middle" align="center">1.96</td>
<td valign="middle" align="center">0.021<break/>(0.283)</td>
</tr>
<tr>
<td valign="middle" align="left">
<bold>
<italic>C</italic>. <italic>maculata</italic>
</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>NA</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>3</bold>
</td>
<td valign="middle" colspan="2" align="center"/>
</tr>
<tr>
<td valign="middle" align="left">Gene</td>
<td valign="middle" align="center">16.6</td>
<td valign="middle" align="center">12.7</td>
<td valign="middle" align="center"/>
<td valign="middle" align="center">47.0</td>
<td valign="middle" align="center">25.3</td>
<td valign="middle" align="center">30.0</td>
<td valign="middle" align="center">1.17</td>
<td valign="middle" align="center">0.162<break/>(0.624)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 1<sup>&#x3c8;</sup>
</td>
<td valign="middle" align="center">2.0</td>
<td valign="middle" align="center">1.8</td>
<td valign="middle" align="center"/>
<td valign="middle" align="center">0.5</td>
<td valign="middle" align="center">0</td>
<td valign="middle" align="center">3.5</td>
<td valign="middle" align="center">&#x2212;0.51</td>
<td valign="middle" align="center">0.746<break/>(0.970)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 2<sup>*</sup>
</td>
<td valign="middle" align="center">13.9</td>
<td valign="middle" align="center">11.0</td>
<td valign="middle" align="center"/>
<td valign="middle" align="center">45.9</td>
<td valign="middle" align="center">26.1</td>
<td valign="middle" align="center">26.8</td>
<td valign="middle" align="center">1.33</td>
<td valign="middle" align="center">0.119 (0.387)</td>
</tr>
<tr>
<td valign="middle" align="left">
<bold>
<italic>C</italic>. <italic>striata</italic>
</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>3</bold>
</td>
<td valign="middle" align="center">
<bold>1</bold>
</td>
<td valign="middle" align="center">
<bold>2</bold>
</td>
<td valign="middle" align="center">
<bold>NA</bold>
</td>
<td valign="middle" colspan="2" align="center"/>
</tr>
<tr>
<td valign="middle" align="left">Gene</td>
<td valign="middle" align="center">21.3</td>
<td valign="middle" align="center">16.4</td>
<td valign="middle" align="center">7.8</td>
<td valign="middle" align="center">32.1</td>
<td valign="middle" align="center">17.3</td>
<td valign="middle" align="center"/>
<td valign="middle" align="center">0.62</td>
<td valign="middle" align="center">0.525<break/>(0.804)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 1</td>
<td valign="middle" align="center">0.9</td>
<td valign="middle" align="center">0</td>
<td valign="middle" align="center">2.3</td>
<td valign="middle" align="center">0</td>
<td valign="middle" align="center">0</td>
<td valign="middle" align="center"/>
<td valign="middle" align="center">NA</td>
<td valign="middle" align="center">NA</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 2</td>
<td valign="middle" align="center">16.8</td>
<td valign="middle" align="center">14.5</td>
<td valign="middle" align="center">5.5</td>
<td valign="middle" align="center">30.5</td>
<td valign="middle" align="center">17.2</td>
<td valign="middle" align="center"/>
<td valign="middle" align="center">0.39</td>
<td valign="middle" align="center">0.695<break/>(0.918)</td>
</tr>
<tr>
<td valign="middle" align="left">Isoform 3<sup>&#x3c8;</sup>
</td>
<td valign="middle" align="center">4.4</td>
<td valign="middle" align="center">0.7</td>
<td valign="middle" align="center">0.5</td>
<td valign="middle" align="center">0.8</td>
<td valign="middle" align="center">0</td>
<td valign="middle" align="center"/>
<td valign="middle" align="center">&#x2212;2.88</td>
<td valign="middle" align="center">0.254<break/>(0.536)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The first row for a species contains values for gene-level expression, while subsequent rows contain values for a specific isoform. Trinity isoform and biological replicate identifiers are provided in <xref ref-type="supplementary-material" rid="SM1">
<bold>Supplementary Table S3</bold>
</xref>. Expression values are trimmed-mean-of-means-transformed (TMM) transcripts per million (TPM) in order to maintain comparability among biological replicates. An <sup>*</sup> denotes a canonical isoform, while <sup>&#x3c8;</sup> denotes an isoform with a retained intron. Values shown have been truncated, TMM to the tenth, LogFC to the hundredth, and <italic>P</italic>- and <italic>Q</italic>-values to the thousandth.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>The expression of non-canonical isoforms assembled from the read pools of all four species varied across tissues and samples, with some tissues or individuals not expressing splicing variants, and expression was not statistically different between the two tissue types in any of the four species. The <italic>C</italic>. <italic>trifida</italic> non-canonical isoform was expressed at relatively low levels across all aboveground and belowground tissues, the TMM of which ranged from 1.02 belowground to 4.374 aboveground. Contrastingly, the TMM of the <italic>C. wisteriana</italic> non-canonical isoform ranged from 0 to 2.017 and was not detected in two of the belowground replicates for this species. The noncanonical <italic>C</italic>. <italic>maculata</italic> and <italic>C. striata</italic> isoforms were sporadically expressed across tissue types, with the TMM values of one isoform of <italic>C</italic>. <italic>maculata</italic> ranging from 0 to 3.564 and those of <italic>C</italic>. <italic>striata</italic> ranging from 0 to 4.471. Even in the case of the <italic>C</italic>. <italic>striata</italic> isoform for which the TMM value was 4.471 in one belowground sample, the value was either 0 or less than 1 in the remainder of samples.</p>
</sec>
<sec id="s3_7">
<label>3.7</label>
<title>Selection analyses</title>
<p>GARD evaluated 2,393 models and inferred a single potential recombination breakpoint, separating the signal peptide and chain portions of <italic>WHY1</italic>, but AIC<sub>c</sub> was not significantly improved for the partitioned analysis (75,130.4) versus unpartitioned (75,111.5). The RELAX test inferred statistically significant signal for relaxation of selection pressure for the following test branch sets: <italic>Corallorhiza</italic> + <italic>E</italic>. <italic>aphyllum</italic> + <italic>G</italic>. <italic>elata</italic> (<italic>p</italic> = 0.001); <italic>C</italic>. <italic>maculata</italic> + <italic>C</italic>. <italic>striata</italic> + <italic>E</italic>. <italic>aphyllum</italic> + <italic>G</italic>. <italic>elata</italic> (<italic>p</italic> = 0.002); and <italic>E</italic>. <italic>aphyllum</italic> + <italic>G</italic>. <italic>elata</italic> (<italic>p</italic> = 0.001). However, runs of RELAX analyzing test branch sets comprising only <italic>Corallorhiza</italic> (<italic>p</italic> = 0.30) or <italic>C</italic>. <italic>maculata</italic> + <italic>C</italic>. <italic>striata</italic> (<italic>p</italic> = 0.62) did not infer significant signal for selection relaxation.</p>
<p>The FUBAR analysis inferred that 254 of 373 codons were under pervasive purifying selection (posterior probability threshold &#x2265;0.9) and that no codons were under pervasive diversifying selection. Likewise, the BUSTED analyses did not infer statistically significant signal of gene-wide episodic diversifying selection for any test branch set, where <italic>p</italic>-values ranged from <italic>p</italic> = 0.15 for <italic>E</italic>. <italic>aphyllum</italic> + <italic>G</italic>. <italic>elata</italic> to <italic>p</italic> = 0.50 for <italic>Corallorhiza</italic>.</p>
<p>The aBSREL analysis recovered signal of episodic diversifying selection in seven of 217 branches in the <italic>WHY1</italic> tree (<xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref>) but did not infer statistically significant signal in any test branch set during analyses conducted in <italic>a priori</italic> mode. The terminal node leading to <italic>Sorghum bicolor</italic> (<italic>p</italic> = 2.76 &#xd7; 10<sup>&#x2212;6</sup>) was the only branch in the monocots that showed significant signal of episodic diversifying selection. Within the eudicots, the branches leading to <italic>Prunus avium</italic> (<italic>p</italic> = 2.5 &#xd7; 10<sup>&#x2212;5</sup>), the <italic>Populus</italic> clade (<italic>p</italic> = 1.17 &#xd7; 10<sup>&#x2212;7</sup>), the <italic>Quercus</italic> clade (2.35 &#xd7; 10<sup>&#x2212;3</sup>), the <italic>Nicotiana</italic> clade (<italic>p</italic> = 5.42 &#xd7; 10<sup>&#x2212;3</sup>), the <italic>Coffea</italic> clade (6.96 &#xd7; 10<sup>&#x2212;5</sup>), and the asterid clade (<italic>p</italic> = 3.7 &#xd7; 10<sup>&#x2212;4</sup>) were inferred to contain signal of episodic diversifying selection.</p>
</sec>
</sec>
<sec id="s4" sec-type="discussion">
<label>4</label>
<title>Discussion</title>
<p>Our work represents the largest scale investigation of <italic>WHY1</italic> evolution to date and reveals strong phylogenetic signal for this gene across multiple taxonomic levels in the angiosperms. Phylogenetic methods have emerged as the gold-standard for the inference of gene orthology (<xref ref-type="bibr" rid="B73">M&#xfc;nster et&#xa0;al., 1997</xref>; <xref ref-type="bibr" rid="B32">Emms and Kelly, 2015</xref>, <xref ref-type="bibr" rid="B33">Emms and Kelly, 2019</xref>) by which hundreds of genes have been determined to not only be highly conserved across plant lineages but to exist as low- or single-copy (<xref ref-type="bibr" rid="B112">Wu et&#xa0;al., 2006</xref>; <xref ref-type="bibr" rid="B30">Duarte et&#xa0;al., 2010</xref>; <xref ref-type="bibr" rid="B23">De Smet et&#xa0;al., 2013</xref>). The inferred evolutionary relationships of <italic>WHY1</italic> (<xref ref-type="fig" rid="f6">
<bold>Figure&#xa0;6</bold>
</xref>) were largely congruent with our contemporary understanding of angiosperm phylogenetic relationships and recent inferences based on genomic-scale datasets (<xref ref-type="bibr" rid="B104">The Angiosperm Phylogeny Group, 2016</xref>; <xref ref-type="bibr" rid="B42">Guo et&#xa0;al., 2021</xref>; <xref ref-type="bibr" rid="B122">Zhao et&#xa0;al., 2021</xref>). Likewise, the inferred relationships within the Orchidaceae were largely consistent with those inferred in other studies (e.g., <xref ref-type="bibr" rid="B38">Givnish et&#xa0;al., 2016</xref>; <xref ref-type="bibr" rid="B63">Li et al. 2019</xref>; <xref ref-type="bibr" rid="B83">P&#xe9;rez-Escobar et&#xa0;al., 2021</xref>; <xref ref-type="bibr" rid="B98">Serna-S&#xe1;nchez et&#xa0;al., 2021</xref>; <xref ref-type="bibr" rid="B118">Zhang et&#xa0;al., 2023</xref>; <xref ref-type="bibr" rid="B5">Barrett et&#xa0;al., 2024</xref>; <xref ref-type="bibr" rid="B82">P&#xe9;rez-Escobar et&#xa0;al., 2024</xref>), apart from <italic>D</italic>. <italic>catenatum</italic>, <italic>G</italic>. <italic>elata</italic>, and <italic>E</italic>. <italic>aphyllum</italic> forming an early diverging, paraphyletic grade within the Epidendroids. However, our recovery of <italic>C</italic>. <italic>trifida</italic> as sister to the remainder of the species sampled from <italic>Corallorhiza</italic>, rather than <italic>C</italic>. <italic>striata</italic>, was not congruent with relationships inferred previously on the basis of other loci and even genomic-scale datasets (<xref ref-type="bibr" rid="B2">Barrett and Freudenstein, 2008</xref>; <xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>, <xref ref-type="bibr" rid="B7">Barrett et al, 2018</xref>). We are not surprised to infer discordance between the evolutionary history of <italic>WHY1</italic> and that of the phylogenetic history of <italic>Corallorhiza</italic>, since it is accepted that the evolutionary history of any gene can differ from that of the genome within which it is found (<xref ref-type="bibr" rid="B78">Pamilo and Nei, 1988</xref>; <xref ref-type="bibr" rid="B68">Maddison, 1997</xref>). That said, the recovery of the <italic>Corallorhiza</italic> species with the most intact plastome as sister to those with more degraded plastomes may indicate that the gene-species tree discordance we infer is due to functional convergence in <italic>WHY1</italic> sequence. For example, the phenylalanine substitution at position 336 shared by all <italic>Corallorhiza</italic> species aside from <italic>C</italic>. <italic>trifida</italic> may affect protein conformation and therefore <italic>WHY1</italic> function. Our findings of general congruence between the evolutionary history of <italic>WHY1</italic> across angiosperms supports the gene as single- or low-copy across the 110 angiosperm taxa sampled.</p>
<p>Differences in the degree of nucleotide conservation were evident among <italic>WHY1</italic> exons, with the transit peptide region consistently the most divergent throughout the lineages sampled. The diversity and evolution of transit peptides, and the apparent discrepancy between our perception of their functional importance and high-sequence divergence, have long been of interest (<xref ref-type="bibr" rid="B9">Bruce, 2001</xref>; <xref ref-type="bibr" rid="B81">Patron and Waller, 2007</xref>; <xref ref-type="bibr" rid="B21">Christian et&#xa0;al., 2020</xref>). While the functional nature of a transit peptide might lead to an expectation of conservatism, low-sequence similarity and patterns of mutation that we describe in <italic>WHY1</italic> are emerging as generalizable properties of plant transit peptides. For example, <xref ref-type="bibr" rid="B21">Christian et&#xa0;al. (2020)</xref> found that the transit peptides in the genomes of 15 genera sampled throughout the angiosperms had a mean pairwise percent identity of just 37.9% and that random indels drive transit peptide evolution. Our results provide evidence that the transit peptide of <italic>WHY1</italic> evolves similarly, with a pairwise percent identity of 32.5% across the angiosperms, and with evident substitutions and indels downstream of a homologous start codon being the most likely drivers of divergence in the gene region. In contrast, the portion of the chain encoding the ssDNA binding motif was the most conserved with a mean percent pairwise similarity of 72.1%. The only sizable indel observed in the gene region was a seven-codon insertion in <italic>E</italic>. <italic>aphyllum</italic>, a late-stage mycoheterotrophic orchid. It is likely that <italic>E</italic>. <italic>aphyllum WHY1</italic> results in a non-functional product, given that it encodes a premature stop codon that would result in a protein 70 amino acids shorter than that of any other sampled species. Interestingly, <italic>WHY1</italic> was more conserved overall at the nucleotide level among <italic>Corallorhiza</italic> species than were either expression patterns across tissues or exon inclusion in sequenced mRNAs across the trophic gradient. Among autotrophic species, nucleotide divergence of <italic>WHY1</italic> is particularly pronounced in the grasses, with the mean pairwise percent identity among all members of that clade versus the remainder of samples being more than 5% lower than that of the same comparison made for either of the late-stage mycoheterotrophic species sampled. The high substitution rate we inferred for <italic>WHY1</italic> in the grasses is interesting because the plastomes of many species in the clade are known to contain inversions and structural heteroplasmy within individual plants, of which the latter has only recently been described from their relatives the Cyperaceae (<xref ref-type="bibr" rid="B28">Doyle et&#xa0;al., 1992</xref>; <xref ref-type="bibr" rid="B57">Lee et&#xa0;al., 2020</xref>). The contrasting levels of nucleotide conservation in the transit and chain portions of <italic>WHY1</italic> across lineages suggest that lineage-specific functions of the transit peptide sequence could be a fruitful line of investigation, especially given that <xref ref-type="bibr" rid="B21">Christian et&#xa0;al. (2020)</xref> identified bias in amino acid usage between the plastid transit peptide sequences of monocot and eudicot lineages.</p>
<p>Alignment of <italic>WHY1</italic> revealed lineage-specific substitutions at sites inferred to be involved in structural conformation in both poorly and highly conserved gene regions. For example, the transit regions of both late-stage mycoheterotrophic species contained indels not found in other samples, and the sequence of <italic>E</italic>. <italic>aphyllum</italic> contained a premature stop codon in the typically highly conserved chain region followed downstream by four autapomorphic amino substitutions in an eight amino acid span. We identified substitutions that could underlie functional change in each <italic>Corallorhiza</italic> species, the most interesting of which was a substitution involving an alpha helix (position 336) inferred in previous work conducted in <italic>A</italic>. <italic>thaliana</italic> (<xref ref-type="bibr" rid="B15">Cappadocia et&#xa0;al., 2013</xref>). Our phylogenetic framework supports the plesiomorphic codon state of position 336 as a leucine, which we infer was substituted to a phenylalanine in <italic>Corallorhiza</italic> and then to an isoleucine in <italic>C</italic>. <italic>trifida</italic>, on the basis of the relationships depicted in <xref ref-type="fig" rid="f1">
<bold>Figure&#xa0;1</bold>
</xref>. We additionally infer that the phenylalanine substitution present in <italic>Corallorhiza</italic> species aside from <italic>C</italic>. <italic>trifida</italic> would result in the introduction of a benzene ring in which there were previously only aliphatic hydrocarbons. However, comprehensive taxonomic sampling of <italic>WHY1</italic> across <italic>Corallorhiza</italic> is needed to determine sequence diversity and substitution patterns. While our work revealed many nonsynonymous substitutions throughout <italic>Corallorhiza</italic> sequences, we did not identify nucleotide substitutions that obviously result in loss of function in these transitional mycoheterotrophic species.</p>
<p>Statistically significant shifts of selective regime were detected for late-stage mycoheterotrophic and some non-orchid autotrophic lineages. Our inference that 68.1% of <italic>WHY1</italic> codons are under pervasive purifying selection and none are under pervasive positive selection, despite the relatively low levels of nucleotide conservatism we documented, are congruent with the critical function of the gene across the angiosperms as has been documented in model and agricultural species in previous work (<xref ref-type="bibr" rid="B15">Cappadocia et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B60">Lepage et&#xa0;al., 2013</xref>; <xref ref-type="bibr" rid="B116">Zampini et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B93">Ren et&#xa0;al., 2017</xref>; <xref ref-type="bibr" rid="B87">Qiu et&#xa0;al., 2022</xref>; <xref ref-type="bibr" rid="B96">Ruan et&#xa0;al., 2022</xref>; <xref ref-type="bibr" rid="B103">Taylor et&#xa0;al., 2022</xref>; <xref ref-type="bibr" rid="B101">Sun et&#xa0;al., 2023</xref>). While we inferred disproportionately high substitution rates for <italic>WHY1</italic> in the grasses, we did not find evidence that divergence of those sequences was associated with diversifying selection. Despite the lack of pervasive positive selection in <italic>WHY1</italic> across the angiosperms, our identification of significant episodic diversifying selection in seven of 100 autotrophic lineages sampled suggests that a lineage-specific adaptive role of <italic>WHY1</italic> may be relatively common, though similar signal was not identified in sampled mycoheterotrophs. The lack of diversifying signal in any of our mycoheterotrophic species does not support neofunctionalization of the gene, which we find interesting given the multiple functions of <italic>WHY1</italic> and the diversifying signal identified in lineages across the angiosperms. Our findings together suggest that relaxation of <italic>WHY1</italic> selective constraint occurs after the transition to full mycoheterotrophy, as significant signal of relaxed selection was only detected in branch sets containing <italic>E</italic>. <italic>aphyllum</italic> and <italic>G</italic>. <italic>elata</italic>, two late-stage fully mycoheterotrophic orchids.</p>
<p>Our analyses suggest that the expression of <italic>WHY1</italic> in <italic>Corallorhiza</italic> may differ by both tissue type and across the mycoheterotrophic gradient. Multiple studies have induced and characterized the effects of differential expression of <italic>WHY1</italic> by exposure to biotic and abiotic stimuli, implicating roles for the gene ranging from mediating drought stress (<xref ref-type="bibr" rid="B121">Zhao et al., 2018</xref>; <xref ref-type="bibr" rid="B96">Ruan et&#xa0;al., 2022</xref>) to pathogen response (<xref ref-type="bibr" rid="B25">Desveaux et&#xa0;al., 2000</xref>; <xref ref-type="bibr" rid="B101">Sun et&#xa0;al., 2023</xref>). A minimum level of <italic>WHY1</italic> expression could be expected due to roles of <italic>WHY1</italic> that are not involved in photosynthesis, such as the maintenance of telomere length of nuclear chromosomes (<xref ref-type="bibr" rid="B114">Yoo et&#xa0;al., 2007</xref>). Our analyses are the first to characterize <italic>WHY1</italic> expression in non-model or non-cultivated plant species, and therefore baseline expectations for tissue-specific expression levels for wild species have not yet been established. However, the estimations of gene expression we inferred for <italic>Corallorhiza</italic> species are within the expression ranges reported in studies of <italic>A</italic>. <italic>thaliana</italic> (4&#x2013;76 TPM; <xref ref-type="bibr" rid="B66">Liu et&#xa0;al., 2012</xref>; <xref ref-type="bibr" rid="B72">Mergner et&#xa0;al., 2020</xref>) and <italic>Solanum tuberosum</italic> (7&#x2013;50 TPM; <xref ref-type="bibr" rid="B105">The Potato Genome Sequencing Consortium, 2011</xref>). Additionally, the Klepikova <italic>Arabidopsis</italic> Atlas (<xref ref-type="bibr" rid="B50">Klepikova et&#xa0;al., 2016</xref>) and 1,122 tissue-specific samples available via the <italic>Arabidopsis</italic> RNA-seq Database (<ext-link ext-link-type="uri" xlink:href="http://ipf.sustech.edu.cn/pub/athrdb/">http://ipf.sustech.edu.cn/pub/athrdb/</ext-link> accessed 8 March 2023) evidence that <italic>WHY1</italic> expression should be expected to be lower in roots or rhizomes than in leaves, which is congruent with expression patterns in <italic>C</italic>. <italic>trifida</italic>. Previous studies reporting <italic>WHY1</italic> expression have been conducted in species with larger individuals with typical, non-reduced morphologies allowing for finer scale investigations of tissue-specific expression than can be conducted in <italic>Corallorhiza</italic>, due to a lack of leaf laminae and roots in the latter. However, the patterns of expression we inferred across <italic>Corallorhiza</italic> tissues are congruent with those known for <italic>WHY1</italic>. While gene-level expression inferred across tissues was highest for <italic>C</italic>. <italic>wisteriana</italic> and <italic>C. trifida</italic>, tissue-specific expression was only statistically significant in <italic>C</italic>. <italic>trifida</italic>, after correcting for repeated testing. Our inferences of <italic>WHY1</italic> expression among the belowground tissues of <italic>Corallorhiza</italic> provide for a hypothesized minimum expression level of canonical <italic>WHY1</italic>, which is similar to aboveground levels of expression in <italic>Corallorhiza</italic> species aside from <italic>C</italic>. <italic>trifida.</italic> Taken together, our results provide evidence for a trajectory beginning with differential expression of <italic>WHY1</italic> between aboveground and belowground tissues of the most photosynthetically capable <italic>Corallorhiza</italic> species to similar expression levels between above and belowground tissues of the latest stage mycoheterotrophic members of the genus. However, future work leveraging qPCR-derived estimates of <italic>WHY1</italic> expression across species and tissues is needed to corroborate the trends in expression patterns we characterize here. Taken together, our results suggest that alteration in expression or splicing of <italic>WHY1</italic> is unlikely to underlie the transition to mycoheterotrophy since the tissue-level expression patterns of the gene in <italic>C</italic>. <italic>trifida</italic> are similar to those described from autotrophic plants.</p>
<p>Our work is the first to provide evidence for alternative splicing of, and intron retention in, <italic>WHY1.</italic> Approximately 70% of plant genes with multiple exons can be expected to be alternatively spliced (<xref ref-type="bibr" rid="B92">Reddy et al., 2013</xref>; <xref ref-type="bibr" rid="B16">Chamala et&#xa0;al., 2015</xref>), and intron retention is a common form of alternative splicing in plants (<xref ref-type="bibr" rid="B76">Ner-Gaon et&#xa0;al., 2004</xref>). Our finding of intron retention in one isoform from each <italic>Corallorhiza</italic> species aside from <italic>C</italic>. <italic>trifida</italic> is the first such event described for the gene. It has long been recognized that intron retention is most common in transcripts of genes like <italic>WHY1</italic> which serve roles related to photosynthesis and stress response (<xref ref-type="bibr" rid="B76">Ner-Gaon et&#xa0;al., 2004</xref>), a finding also supported by work investigating the effects of plant stressors on levels of alternative splicing (<xref ref-type="bibr" rid="B35">Filichkin et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B46">Jabre et&#xa0;al., 2019</xref>). In fact, tissue-specific differential intron retention has been shown to be an inducible stress response in <italic>Populus trichocarpa</italic> (<xref ref-type="bibr" rid="B35">Filichkin et&#xa0;al., 2018</xref>). Modification of <italic>WHY1</italic>, including inserted sequence, has long been used to study the effects of mutations on the function of the gene and to induce knockouts (<xref ref-type="bibr" rid="B27">Desveaux et&#xa0;al., 2004</xref>; <xref ref-type="bibr" rid="B114">Yoo et&#xa0;al., 2007</xref>; <xref ref-type="bibr" rid="B69">Mar&#xe9;chal et&#xa0;al., 2009</xref>), work which helped identify the many pathways that <italic>WHY1</italic> is involved in. For example, <xref ref-type="bibr" rid="B114">Yoo et&#xa0;al. (2007)</xref> and <xref ref-type="bibr" rid="B69">Mar&#xe9;chal et&#xa0;al. (2009)</xref> leveraged knockouts caused by T-DNA insertions into <italic>WHY1</italic> to reveal the critical role the gene plays in maintaining telomere length and plastome stability in <italic>Arabidopsis thaliana</italic>, respectively. Similarly, <xref ref-type="bibr" rid="B86">Prikryl et&#xa0;al. (2008)</xref> and <xref ref-type="bibr" rid="B87">Qiu et&#xa0;al. (2022)</xref> found that double knockout <italic>WHY1</italic> mutants were characterized by a lethal albino phenotype after the development of a few leaves, with <xref ref-type="bibr" rid="B87">Qiu et&#xa0;al. (2022)</xref> also characterizing divergent splicing and mRNA editing of plastid genes in <italic>WHY1</italic> mutants. The assembly of intron-retaining transcripts from all <italic>Corallorhiza</italic> species aside from <italic>C</italic>. <italic>trifida</italic>, and more transcriptional isoforms in <italic>Corallorhiza</italic> species in later stages of mycoheterotrophy, is suggestive of a negative correlation between increased splicing variation and both plastome stability and chlorophyll concentration in tissues (<xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>). However, future long-read sequencing of isoforms is needed to definitively verify assembled isoform variants and the presence of retained introns. We propose that it is unlikely that the intron-containing isoforms result in functional products, since a premature stop codon results from intron retention. The recovery of non-canonical and intron-retaining <italic>WHY1</italic> isoforms across individuals and tissues of <italic>Corallorhiza</italic> could be signal of idiosyncratic spliceosome regulation in mycoheterotrophic species, epitranscriptomic differences (<xref ref-type="bibr" rid="B46">Jabre et&#xa0;al., 2019</xref>), differential responses to stress in sampled tissues (<xref ref-type="bibr" rid="B35">Filichkin et&#xa0;al., 2018</xref>), or the expression of multiple, divergent copies of the gene. However, we hypothesize that reduced fidelity in the spliceosome of a mycoheterotrophic plant is the most likely cause of the observed splicing variation, given the phylogenetic and sequencing data at hand. We predict that changes in the expression and splicing of <italic>WHY1</italic> across <italic>Corallorhiza</italic> would likely be due to the alteration of one or more pathways involved in gene regulation, since we did find nucleotide-level changes that could be responsible.</p>
<p>Our work is the first to characterize the evolution of a transcription factor that could impact the genetic and phenotypic changes that occur along the path to full mycoheterotrophy. The previously characterized roles that <italic>WHY1</italic> plays in plastome stability (<xref ref-type="bibr" rid="B79">Parent et&#xa0;al., 2011</xref>; <xref ref-type="bibr" rid="B60">Lepage et al, 2013</xref>), defense responses (<xref ref-type="bibr" rid="B65">Lin et&#xa0;al., 2020</xref>), and leaf senescence (<xref ref-type="bibr" rid="B64">Lin et&#xa0;al., 2019</xref>), together position the gene as a worthwhile target for the study of the molecular underpinnings of the transition to mycoheterotrophy. Heterotrophy in plants is associated with genomic restructuring, where a trend of plastome contraction and nuclear genome expansion via rampant repetitive element accumulation has commonly been observed (<xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>; <xref ref-type="bibr" rid="B67">Lyko and Wicke, 2021</xref>). Dramatic reductions of plastome length and gene content of mycoheterotrophic plants have been documented, ranging from minimal degradation in early transitional orchids such as <italic>Corallorhiza</italic> (<xref ref-type="bibr" rid="B1">Barrett and Davis, 2012</xref>) to pronounced degradation in late-transitional orchids such as <italic>Epipogium</italic> and <italic>Pogoniopsis</italic> (<xref ref-type="bibr" rid="B97">Schelkunov et&#xa0;al., 2015</xref>; <xref ref-type="bibr" rid="B51">Klimpert et&#xa0;al., 2022</xref>). Our finding of putatively functional <italic>WHY1</italic>, but the putative presence of increasingly alternatively spliced <italic>WHY1</italic> isoforms across the <italic>Corallorhiza</italic> trophic gradient is not necessarily surprising, given the minimally destabilized plastomes of the group. For example, <xref ref-type="bibr" rid="B1">Barrett and Davis (2012)</xref> found that the plastome of <italic>C</italic>. <italic>striata</italic>, the most destabilized of the <italic>Corallorhiza</italic> species included here, is only about 6% reduced relative to that of a leafy, autotrophic relative. Despite their relatively intact states, <italic>Corallorhiza</italic> plastomes are in various stages of degradation (<xref ref-type="bibr" rid="B4">Barrett et&#xa0;al., 2014</xref>), and our work here together suggests a negative correlation between both increased putatively aberrant splicing and nucleotide-level divergence of <italic>WHY1</italic> with plastome stability across the sampled species. Likewise, the plastomes of <italic>Gastrodia elata</italic> and <italic>Epipogium aphyllum</italic>, both late-stage mycoheterotrophs for which our analyses show that <italic>WHY1</italic> contains premature stop codons and significant signal of relaxed selection, are both extremely reduced and syntenically disrupted (<xref ref-type="bibr" rid="B115">Yuan et&#xa0;al., 2018</xref>; <xref ref-type="bibr" rid="B19">Chen et&#xa0;al., 2020</xref>; <xref ref-type="bibr" rid="B113">Xu et&#xa0;al., 2021</xref>). Taken together, our findings provide the first evidence of a potential negative correlation between increased divergence in sequence, splicing of <italic>WHY1</italic>, and plastome stability in early to late-stage mycoheterotrophic orchids.</p>
</sec>
<sec id="s5" sec-type="conclusions">
<label>5</label>
<title>Conclusions</title>
<p>Our work showcases the rich opportunities afforded by mycoheterotrophic plants not just for the study of the evolution of <italic>WHY1</italic> but for any gene of which homozygous knockouts can result in a fatal phenotype in autotrophic plants. Continued investigation of non-autotrophic plant lineages promises to fill gaps in our understanding of the precursors and consequences of genomic instability, and even the minimum gene space of land plants. Here we presented findings of non-synonymous nucleotide substitutions in functionally annotated regions in <italic>Corallorhiza WHY1</italic> sequence, and a high degree of divergence in <italic>WHY1</italic> in late-stage fully mycoheterotrophic orchids. However, our results together suggest that changes to the expression and splicing of <italic>WHY1</italic> may occur prior to the establishment of obviously deleterious genomic substitutions that would render the TF non-functional in late stage mycoheterotrophic orchids. In sum, our work characterizes <italic>WHY1</italic> variation and evolution throughout the angiosperms and serves as the first evidence of a potential correlation between decreased expression and increased alternative splicing of <italic>WHY1</italic> concomitant with plastome degradation in a group of early transitional mycoheterotrophic orchids. However, our results do not implicate divergent <italic>WHY1</italic> function as a primary factor in the transition from partial to full mycoheterotrophy. Future work documenting differential non-canonical splicing and tissue-level expression of <italic>WHY1 in vivo</italic> are necessary to fully confirm the results of our transcriptomic analyses.</p>
</sec>
<sec id="s6" sec-type="data-availability">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: Bioproject accession number: PRJNA984634.</p>
</sec>
<sec id="s7" sec-type="author-contributions">
<title>Author contributions</title>
<p>BS and CB contributed to the conception and design of the study. CB collected <italic>Corallorhiza</italic> samples for RNA-seq. RM and BS collected all publicly available samples, conducted wet-lab work, Nanopore sequencing, and bioinformatic analyses. RM wrote the first manuscript draft. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="s8" sec-type="funding-information">
<title>Funding</title>
<p>The author(s) declare financial support was received for the research, authorship, and/or publication of this article. Funding was provided by the Undergraduate Student Research Award and the Bert and Jane Horn Endowed Student Research Award from Otterbein University to RM; US National Science Foundation (DEB#0830020), the California State University Program for Research and Education in Biotechnology, and the West Virginia University Program to Stimulate Competitive Research to CB, and a Faculty Scholarship and Development Fund Award from Otterbein University to BS.</p>
</sec>
<ack>
<title>Acknowledgments</title>
<p>We would like to acknowledge John V. Freudenstein for supplying DNAs used for Nanopore sequencing, the West Virginia University Genomics Core Facility for their services and technical expertise. We also thank the USDA Forest Service and California State Parks for permission to collect material.</p>
</ack>
<sec id="s9" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s10" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s11" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpls.2024.1241515/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpls.2024.1241515/full#supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Table_1.docx" id="ST1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
<supplementary-material xlink:href="DataSheet_1.fasta" id="SM1" mimetype="text/x-fasta"/>
<supplementary-material xlink:href="DataSheet_2.fasta" id="SM2" mimetype="text/x-fasta"/>
<supplementary-material xlink:href="DataSheet_3.fasta" id="SM3" mimetype="text/x-fasta"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Davis</surname> <given-names>J. I.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>The plastid genome of the mycoheterotrophic <italic>Corallorhiza striata</italic> (Orchidaceae) is in the relatively early stages of degradation</article-title>. <source>Am. J. Bot.</source> <volume>99</volume>, <fpage>1513</fpage>&#x2013;<lpage>1523</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3732/ajb.1200256</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Freudenstein</surname> <given-names>J. V.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Molecular evolution of <italic>rbcL</italic> in the mycoheterotrophic coralroot orchids (<italic>Corallorhiza</italic> Gagnebin, Orchidaceae)</article-title>. <source>Mol. Phylogenet. Evol.</source> <volume>47</volume>, <fpage>665</fpage>&#x2013;<lpage>679</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ympev.2008.02.014</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Freudenstein</surname> <given-names>J. V.</given-names>
</name>
<name>
<surname>Lee Taylor</surname> <given-names>D.</given-names>
</name>
<name>
<surname>K&#xf5;ljalg</surname> <given-names>U.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Rangewide analysis of fungal associations in the fully mycoheterotrophic <italic>Corallorhiza striata</italic> complex (Orchidaceae) reveals extreme specificity on ectomycorrhizal <italic>Tomentella</italic> (Thelephoraceae) across North America</article-title>. <source>Am. J. Bot.</source> <volume>97</volume>, <fpage>628</fpage>&#x2013;<lpage>643</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3732/ajb.0900230</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Freudenstein</surname> <given-names>J. V.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Mayfield-Jones</surname> <given-names>D. R.</given-names>
</name>
<name>
<surname>Perez</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Pires</surname> <given-names>J. C.</given-names>
</name>
<etal/>
</person-group>. (<year>2014</year>). <article-title>Investigating the path of plastid genome degradation in an early transitional clade of heterotrophic orchids, and implications for heterotrophic angiosperms</article-title>. <source>Mol. Biol. Evol.</source> <volume>31</volume>, <fpage>3095</fpage>&#x2013;<lpage>3112</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msu252</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Pace</surname> <given-names>M. C.</given-names>
</name>
<name>
<surname>Corbett</surname> <given-names>C. W.</given-names>
</name>
<name>
<surname>Kennedy</surname> <given-names>A. H.</given-names>
</name>
<name>
<surname>Thixton-Nolan</surname> <given-names>H. L.</given-names>
</name>
<name>
<surname>Freudenstein</surname> <given-names>J. V.</given-names>
</name>
</person-group> (<year>2024</year>). <article-title>Organellar phylogenomics at the epidendroid orchid base, with a focus on the mycoheterotrophic <italic>Wullschlaegelia</italic>
</article-title>. <source>Ann. Bot</source>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/aob/mcae084</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Sinn</surname> <given-names>B. T.</given-names>
</name>
<name>
<surname>Kennedy</surname> <given-names>A. H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Unprecedented parallel photosynthetic losses in a heterotrophic orchid genus</article-title>. <source>Mol. Biol. Evol.</source> <volume>36</volume>, <fpage>1884</fpage>&#x2013;<lpage>1902</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msz111</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Wicke</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Sass</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Dense infraspecific sampling reveals rapid and independent trajectories of plastome degradation in a heterotrophic orchid complex</article-title>. <source>New Phytol.</source> <volume>218</volume>, <fpage>1192</fpage>&#x2013;<lpage>1204</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/nph.15072</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bolger</surname> <given-names>A. M.</given-names>
</name>
<name>
<surname>Lohse</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Usadel</surname> <given-names>B.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Trimmomatic: a flexible trimmer for Illuminasequence data</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>2114</fpage>&#x2013;<lpage>2120</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btu170</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bruce</surname> <given-names>B. D.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>The paradox of plastid transit peptides: conservation of function despitedivergence in primary structure</article-title>. <source>Biochim. Biophys. Acta</source> <volume>1541</volume>, <fpage>2</fpage>&#x2013;<lpage>21</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S0167-4889(01)00149-5</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Bushnell</surname> <given-names>B. B.</given-names>
</name>
</person-group> (<year>2022</year>) <source>bbtools</source>. Available online at: <uri xlink:href="https://sourceforge.net/projects/bbmap/files">https://sourceforge.net/projects/bbmap/files</uri>.</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cai</surname> <given-names>L.</given-names>
</name>
</person-group> (<year>2023</year>). <article-title>Rethinking convergence in plant parasitism through the lens of molecular and population genetic processes</article-title>. <source>Am. J. Bot</source>. <volume>110</volume> (<issue>5</issue>), <elocation-id>e16174</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/ajb2.16174</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cai</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Vanneste</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Proost</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Tsai</surname> <given-names>W.-C.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>K.-W.</given-names>
</name>
<etal/>
</person-group>. (<year>2015</year>). <article-title>The genome sequence of the orchid <italic>Phalaenopsis equestris</italic>
</article-title>. <source>Nat. Genet.</source> <volume>47</volume>, <fpage>65</fpage>&#x2013;<lpage>72</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/ng.3149</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cameron</surname> <given-names>D. D.</given-names>
</name>
<name>
<surname>Preiss</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Gebauer</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Read</surname> <given-names>D. J.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>The chlorophyll-containingorchid <italic>Corallorhiza trifida</italic> derives little carbon through photosynthesis</article-title>. <source>New Phytol.</source> <volume>183</volume>, <fpage>358</fpage>&#x2013;<lpage>364</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1469-8137.2009.02853.x</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cappadocia</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Mar&#xe9;chal</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Parent</surname> <given-names>J.-S.</given-names>
</name>
<name>
<surname>Lepage</surname> <given-names>&#xc9;.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Crystal structures of DNA Whirly complexes and their role in <italic>Arabidopsis</italic> organelle genome repair</article-title>. <source>Plant Cell</source> <volume>22</volume>, <fpage>1849</fpage>&#x2013;<lpage>1867</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1105/tpc.109.071399</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cappadocia</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Parent</surname> <given-names>J.-S.</given-names>
</name>
<name>
<surname>Sygusch</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>A family portrait: structural comparison of the Whirly proteins from <italic>Arabidopsis thaliana</italic> and <italic>Solanum tuberosum</italic>
</article-title>. <source>Acta Crystallographica Section</source> <volume>F69</volume>, <fpage>1207</fpage>&#x2013;<lpage>1211</lpage>. doi:&#xa0;10.1107%2FS1744309113028698
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chamala</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Feng</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Chavarro</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Barbazuk</surname> <given-names>W. B.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Genome-wide identification of evolutionarily conserved alternative splicing events in flowering plants</article-title>. <source>Front. Bioengineering Biotechnol.</source> <volume>3</volume>, <elocation-id>33.</elocation-id> doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fbioe.2015.00033</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chao</surname> <given-names>Y.-T.</given-names>
</name>
<name>
<surname>Yen</surname> <given-names>S.-H.</given-names>
</name>
<name>
<surname>Yeh</surname> <given-names>J.-H.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>W.-C.</given-names>
</name>
<name>
<surname>Shih</surname> <given-names>M.-C.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Orchidstra 2.0 &#x2013; a transcriptomics resource for the orchid family</article-title>. <source>Plant Cell Physiol.</source> <volume>58</volume>, <elocation-id>e9</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/pcp/pcw220</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>Y.-Y.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>C.-I.</given-names>
</name>
<name>
<surname>Hsiao</surname> <given-names>Y.-Y.</given-names>
</name>
<name>
<surname>Ho</surname> <given-names>S.-Y.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>Z.-B.</given-names>
</name>
<name>
<surname>Liao</surname> <given-names>C.-C.</given-names>
</name>
<etal/>
</person-group>. (<year>2022</year>). <article-title>OrchidBase 5.0: updates of the orchid genome knowledgebase</article-title>. <source>BMC Plant Biol.</source> <volume>22</volume>, <fpage>557</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s12870-022-03955-5</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Song</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Dong</surname> <given-names>X.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>Improved <italic>de novo</italic> assembly of the achlorophyllous orchid <italic>Gastrodia elata</italic>
</article-title>. <source>Front. Genet.</source> <volume>11</volume>, <elocation-id>580568.</elocation-id> doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fgene.2020.580568</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chernomor</surname> <given-names>O.</given-names>
</name>
<name>
<surname>von Haeseler</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Minh</surname> <given-names>B. Q.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Terrace aware data structure for phylogenomic inference from supermatrices</article-title>. <source>Systematic Biol.</source> <volume>65</volume>, <fpage>997</fpage>&#x2013;<lpage>1008</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/sysbio/syw037</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Christian</surname> <given-names>R. W.</given-names>
</name>
<name>
<surname>Hewitt</surname> <given-names>S. L.</given-names>
</name>
<name>
<surname>Nelson</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Roalson</surname> <given-names>E. H.</given-names>
</name>
<name>
<surname>Dhingra</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Plastid transit peptides&#x2014;where do they come from and where do they all belong? Multi-genome and pan-genomic assessment of chloroplast transit peptide evolution</article-title>. <source>PeerJ</source> <volume>8</volume>, <elocation-id>e9772</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.7717/peerj.9772</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>de Mendoza</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Seb&#xe9;-Pedr&#xf3;s</surname> <given-names>A.</given-names>
</name>
<name>
<surname>&#x15c;estak</surname> <given-names>M. S.</given-names>
</name>
<name>
<surname>Matej&#x10d;i&#x107;</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Torruella</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Domazet-Lo&#x161;o</surname> <given-names>T.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>Transcription factor evolution in eukaryotes and the assembly of the regulatory toolkit in multicellular lineages</article-title>. <source>Proc. Natl. Acad. Sci. United States America</source> <volume>110</volume>, <fpage>E4858</fpage>&#x2013;<lpage>E4866</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.1311818110</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>De Smet</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Adams</surname> <given-names>K. L.</given-names>
</name>
<name>
<surname>Vandepoele</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Van Montagu</surname> <given-names>M. C. E.</given-names>
</name>
<name>
<surname>Maere</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Van de Peer</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Convergent gene loss following gene and genome duplications creates single-copy families in flowering plants</article-title>. <source>Proc. Acad. Sci. United States America</source> <volume>110</volume>, <fpage>2898</fpage>&#x2013;<lpage>2903</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.1300127110</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Desveaux</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Allard</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Sygusch</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>A new family of plant transcriptionfactors displays a novel ssDNA-binding surface</article-title>. <source>Nat. Struct. Mol. Biol.</source> <volume>9</volume>, <fpage>512</fpage>&#x2013;<lpage>517</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nsb814</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Desveaux</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Despr&#xe9;s</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Joyeux</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Subramaniam</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>PBF-2 is a novel single-stranded DNA binding factor implicated in <italic>PR-10a</italic> gene activation in potato</article-title>. <source>Plant Cell</source> <volume>12</volume>, <fpage>1477</fpage>&#x2013;<lpage>1489</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1105/tpc.12.8.1477</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Desveaux</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Mar&#xe9;chal</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Whirly transcription favors: defense gene regulation and beyond</article-title>. <source>Trends Plant Sci.</source> <volume>10</volume>, <fpage>1360</fpage>&#x2013;<lpage>1385</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.tplants.2004.12.008</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Desveaux</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Subramaniam</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Despr&#xe9;s</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Mess</surname> <given-names>J.-N.</given-names>
</name>
<name>
<surname>L&#xe9;vesque</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Fobert</surname> <given-names>P. R.</given-names>
</name>
<etal/>
</person-group>. (<year>2004</year>). <article-title>A &#x201c;whirly&#x201d; transcription factor is required for salicylic acid dependent disease resistance in <italic>Arabidopsis</italic>
</article-title>. <source>Dev. Cell</source> <volume>6</volume>, <fpage>229</fpage>&#x2013;<lpage>240</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/S1534-5807(04)00028-0</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Doyle</surname> <given-names>J. J.</given-names>
</name>
<name>
<surname>Davis</surname> <given-names>J. I.</given-names>
</name>
<name>
<surname>Soreng</surname> <given-names>R. J.</given-names>
</name>
<name>
<surname>Garvin</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Anderson</surname> <given-names>M. J.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>Chloroplast DNA inversions and the origin of the grass family</article-title>. <source>Proc. Natl. Acad. Sci. United States America</source> <volume>89</volume>, <fpage>7722</fpage>&#x2013;<lpage>7726</lpage>. doi:&#xa0;10.1073%2Fpnas.89.16.7722
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Doyle</surname> <given-names>J. J.</given-names>
</name>
<name>
<surname>Doyle</surname> <given-names>J. L.</given-names>
</name>
</person-group> (<year>1987</year>). <article-title>A rapid DNA isolation procedure for small quantities of fresh leaf tissue</article-title>. <source>Phytochemical Bull.</source> <volume>19</volume>, <fpage>11</fpage>&#x2013;<lpage>15</lpage>.</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Duarte</surname> <given-names>J. M.</given-names>
</name>
<name>
<surname>Wall</surname> <given-names>P. K.</given-names>
</name>
<name>
<surname>Edger</surname> <given-names>P. P.</given-names>
</name>
<name>
<surname>Landherr</surname> <given-names>L. L.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Pires</surname> <given-names>J. C.</given-names>
</name>
<etal/>
</person-group>. (<year>2010</year>). <article-title>Identification of shared single copy nuclear genes in Arabidopsis, Populus, <italic>Vitis</italic> and <italic>Oryza</italic> and their phylogenetic utility across various taxonomic levels</article-title>. <source>BMC Evolutionary Biol.</source> <volume>10</volume>, <fpage>61</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/1471-2148-10-61</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eddy</surname> <given-names>S. R.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Accelerated profile HMM searches</article-title>. <source>PloS Comput. Biol.</source> <volume>7</volume>, <elocation-id>e1002195</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pcbi.1002195</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Emms</surname> <given-names>D. M.</given-names>
</name>
<name>
<surname>Kelly</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy</article-title>. <source>Genome Biol.</source> <volume>16</volume>, <fpage>157</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s13059-015-0721-2</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Emms</surname> <given-names>D. M.</given-names>
</name>
<name>
<surname>Kelly</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>OrthoFinder: phylogenetic orthology inference for comparative genomics</article-title>. <source>Genome Biol.</source> <volume>20</volume>, <fpage>238</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s13059-019-1832-y</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Felsenstein</surname> <given-names>J.</given-names>
</name>
</person-group> (<year>1985</year>). <article-title>Phylogenies and the comparative method</article-title>. <source>Am. Nat.</source> <volume>125</volume>, <fpage>1</fpage>&#x2013;<lpage>15</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1086/284325</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Filichkin</surname> <given-names>S. A.</given-names>
</name>
<name>
<surname>Hamilton</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Dharmawardhana</surname> <given-names>P. D.</given-names>
</name>
<name>
<surname>Singh</surname> <given-names>S. K.</given-names>
</name>
<name>
<surname>Sullivan</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Ben-Hur</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>Abiotic stresses modulate landscape of <italic>Popular</italic> transcriptome via alternative splicing, differential intron retention, and isoform ratio switching</article-title>. <source>Front. Plant Sci.</source> <volume>9</volume>, <elocation-id>5.</elocation-id> doi:&#xa0;<pub-id pub-id-type="doi">10.3389/fpls.2018.00005</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Foyer</surname> <given-names>C. H.</given-names>
</name>
<name>
<surname>Karpinska</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Krupinska</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>The functions of WHIRLY1 and REDOXRESPONSIVE TRANSCRIPTION FACTOR 1 in cross tolerance responses in plants: a hypothesis</article-title>. <source>Philos. Trans. R. Soc. B</source> <volume>369</volume>, <fpage>20130226</fpage>. doi:&#xa0;10.1098%2Frstb.2013.0226
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Freudenstein</surname> <given-names>J. V.</given-names>
</name>
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Fungal host utilization helps circumscribe leafless Coralroot orchid species: an integrative analysis of Corallorhiza odontorhiza and C. wisteriana</article-title>. <source>Taxon</source> <volume>63</volume> (<issue>4</issue>), <fpage>759</fpage>&#x2013;<lpage>772</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.12705/634.3</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Givnish</surname> <given-names>T. J.</given-names>
</name>
<name>
<surname>Spali nk</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Ames</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Lyon</surname> <given-names>S. P.</given-names>
</name>
<name>
<surname>Hunter</surname> <given-names>S. J.</given-names>
</name>
<name>
<surname>Zuluaga</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>Orchid historical biogeography, diversification, Antarctica and the paradox of orchid dispersal</article-title>. <source>J. Biogeography</source> <volume>43</volume>, <fpage>1905</fpage>&#x2013;<lpage>1916</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/jbi.12854</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Grabherr</surname> <given-names>M. G.</given-names>
</name>
<name>
<surname>Haas</surname> <given-names>B. J.</given-names>
</name>
<name>
<surname>Yassour</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Levin</surname> <given-names>J. Z.</given-names>
</name>
<name>
<surname>Thompson</surname> <given-names>D. A.</given-names>
</name>
<name>
<surname>Amit</surname> <given-names>I.</given-names>
</name>
<etal/>
</person-group>. (<year>2011</year>). <article-title>Full-length transcriptome assembly from RNA-Seq data without a reference genome</article-title>. <source>Nat. Biotechnol.</source> <volume>29</volume>, <fpage>644</fpage>&#x2013;<lpage>652</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nbt.1883</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Grabowski</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Miao</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Mulisch</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Krupinska</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Single-stranded DNA-binding protein Whirly1 in barley leaves is located in plastids and the nucleus of the same cell</article-title>. <source>Plant Physiol.</source> <volume>147</volume>, <fpage>1800</fpage>&#x2013;<lpage>1804</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1104/pp.108.122796</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Graham</surname> <given-names>S. W.</given-names>
</name>
<name>
<surname>Lam</surname> <given-names>V. K. Y.</given-names>
</name>
<name>
<surname>Merckx</surname> <given-names>V. S. F. T.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Plastomes on the edge: the evolutionary breakdown of mycoheterotroph plastid genomes</article-title>. <source>New Phytol.</source> <volume>214</volume>, <fpage>48</fpage>&#x2013;<lpage>55</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/nph.14398</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Fang</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Sahu</surname> <given-names>S. K.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Guang</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Folk</surname> <given-names>R.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>
<italic>Chloranthus</italic> genome provides insights into the early diversification of angiosperms</article-title>. <source>Nat. Commun.</source> <volume>12</volume>, <fpage>6930</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41467-021-26922-4</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haas</surname> <given-names>B. J.</given-names>
</name>
<name>
<surname>Papanicolaou</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Yassour</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Grabherr</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Blood</surname> <given-names>P. D.</given-names>
</name>
<name>
<surname>Bowden</surname> <given-names>J.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>
<italic>De novo</italic> transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis</article-title>. <source>Nat. Protoc.</source> <volume>8</volume>, <fpage>1494</fpage>&#x2013;<lpage>1512</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nprot.2013.084</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hoang</surname> <given-names>D. T.</given-names>
</name>
<name>
<surname>Chernomor</surname> <given-names>O.</given-names>
</name>
<name>
<surname>von Haeseler</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Minh</surname> <given-names>B. Q.</given-names>
</name>
<name>
<surname>Vinh</surname> <given-names>L. S.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>UFBoot2: improving the ultrafast bootstrap approximation</article-title>. <source>Mol. Biol. Evol.</source> <volume>35</volume>, <fpage>518</fpage>&#x2013;<lpage>522</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msx281</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Isemer</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Mulisch</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Sch&#xe4;fer</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Kirchner</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Koop</surname> <given-names>H.-U.</given-names>
</name>
<name>
<surname>Krupinksa</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Recombinant Whirly1 translocates from transplastomic chloroplasts to the nucleus</article-title>. <source>Fed. Eur. Biochem. Societies Lett.</source> <volume>586</volume>, <fpage>85</fpage>&#x2013;<lpage>88</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.febslet.2011.11.029</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jabre</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Reddy</surname> <given-names>A. S. N.</given-names>
</name>
<name>
<surname>Kalyna</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Chaudhary</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Khokhar</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Byrne</surname> <given-names>L. J.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>Does co-transcriptional regulation of alternative splicing mediate plant stress responses</article-title>? <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>2716</fpage>&#x2013;<lpage>2726</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkz121</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kalyaanamoorthy</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Minh</surname> <given-names>B. Q.</given-names>
</name>
<name>
<surname>Wong</surname> <given-names>T. K. F.</given-names>
</name>
<name>
<surname>von Haeseler</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Jermiin</surname> <given-names>L. S.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>ModelFinder: fast model selection for accurate phylogenetic estimates</article-title>. <source>Nat. Methods</source> <volume>14</volume>, <fpage>587</fpage>&#x2013;<lpage>589</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nmeth.4285</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Katoh</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Misawa</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kuma</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Miyata</surname> <given-names>T.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform</article-title>. <source>Nucleic Acids Res.</source> <volume>30</volume>, <fpage>3059</fpage>&#x2013;<lpage>3066</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkf436</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Katoh</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Standley</surname> <given-names>D. M.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>MAFFT multiple sequence alignment software version 7: improvements in performance and usability</article-title>. <source>Mol. Biol. Evol.</source> <volume>30</volume>, <fpage>772</fpage>&#x2013;<lpage>780</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/mst010</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Klepikova</surname> <given-names>A. V.</given-names>
</name>
<name>
<surname>Kasianov</surname> <given-names>A. S.</given-names>
</name>
<name>
<surname>Gerasimov</surname> <given-names>E. S.</given-names>
</name>
<name>
<surname>Logacheva</surname> <given-names>M. D.</given-names>
</name>
<name>
<surname>Penin</surname> <given-names>A. A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>A high resolution map of the <italic>Arabidopsis thaliana</italic> developmental transcriptome based on RNA-seq profiling</article-title>. <source>Plant J.</source> <volume>88</volume>, <fpage>1058</fpage>&#x2013;<lpage>1070</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/tpj.13312</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Klimpert</surname> <given-names>N. J.</given-names>
</name>
<name>
<surname>Mayer</surname> <given-names>J. L. S.</given-names>
</name>
<name>
<surname>Sarzi</surname> <given-names>D. S.</given-names>
</name>
<name>
<surname>Prosdocimi</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Pinheiro</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Graham</surname> <given-names>S. W.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Phylogenomics and plastome evolution of a Brazilian mycoheterotrophic orchid, <italic>Pogoniopsis schenckii</italic>
</article-title>. <source>Am. J. Bot.</source> <volume>109</volume>, <fpage>2030</fpage>&#x2013;<lpage>2050</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/ajb2.16084</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krause</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kilbienski</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Mulisch</surname> <given-names>M.</given-names>
</name>
<name>
<surname>R&#xf6;diger</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Sch&#xe4;fer</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Krupinksa</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>DNA-binding proteins of the Whirly family in <italic>Arabidiopsis thaliana</italic> are targeted to the organelles</article-title>. <source>Fed. Eur. Biochem. Societies Lett.</source> <volume>579</volume>, <fpage>3707</fpage>&#x2013;<lpage>3712</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.febslet.2005.05.059</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lai</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Chahtane</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Martin-Arevalillo</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Zubita</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Parcy</surname> <given-names>F.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Contrasted evolutionary trajectories of plant transcription factors</article-title>. <source>Curr. Opin. Plant Biol.</source> <volume>54</volume>, <fpage>101</fpage>&#x2013;<lpage>107</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.pbi.2020.03.002</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lallemand</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Martin-Magniette</surname> <given-names>M.-L.</given-names>
</name>
<name>
<surname>Gilard</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Gaki&#xe8;re</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Launay-Avon</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Delannoy</surname> <given-names>&#xc9;.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>
<italic>In situ</italic> transcriptomic and metabolomic study of the loss of photosynthesis in the leaves of mixotrophic plants exploiting fungi</article-title>. <source>Plant J.</source> <volume>98</volume>, <fpage>826</fpage>&#x2013;<lpage>841</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/tpj.14276</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Latchman</surname> <given-names>D. S.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Transcription factors: an overview</article-title>. <source>Int. J. Exp. Pathol.</source> <volume>74</volume>, <fpage>417</fpage>&#x2013;<lpage>422</lpage>.</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Leake</surname> <given-names>J. R.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>The biology of myco-heterotrophic (&#x201c;saprophytic&#x201d;) plants</article-title>. <source>New Phytol.</source> <volume>127</volume>, <fpage>171</fpage>&#x2013;<lpage>216</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1469-8137.1994.tb04272.x</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Ruhlman</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Jansen</surname> <given-names>R. K.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Unprecedented intraindividual structural heteroplasmy in <italic>Eleocharis</italic> (Cyperaceae, Poales) Plastomes</article-title>. <source>Genome Biol. Evol.</source> <volume>12</volume>, <fpage>641</fpage>&#x2013;<lpage>655</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/gbe/evaa076</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lehti-Shiu</surname> <given-names>M. D.</given-names>
</name>
<name>
<surname>Panchy</surname> <given-names>N.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Uygun</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Shiu</surname> <given-names>S.-H.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Diversity, expansion, and evolutionary novelty of plant DNA-binding transcription factor families</article-title>. <source>Biochim. Biophys. Acta</source> <volume>1860</volume>, <fpage>3</fpage>&#x2013;<lpage>20</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.bbagrm.2016.08.005</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lemaire</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Huysmans</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Smets</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Merckx</surname> <given-names>V. S. F. T.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Rate accelerations in nuclear 18S rDNA of mycoheterotrophic and parasitic angiosperms</article-title>. <source>J. Plant Res.</source> <volume>124</volume>, <fpage>561</fpage>&#x2013;<lpage>576</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s10265-010-0395-5</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lepage</surname> <given-names>&#xc9;.</given-names>
</name>
<name>
<surname>Zampini</surname> <given-names>&#xc9;.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Plastid genome instability leads to reactive oxygen species production and plastid-to-nucleus retrograde signaling in <italic>Arabidopsis</italic>
</article-title>. <source>Plant Physiol.</source> <volume>163</volume>, <fpage>867</fpage>&#x2013;<lpage>881</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1104/pp.113.223560</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>H.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Minimap2: pairwise alignment for nucleotide sequences</article-title>. <source>Bioinformatics</source> <volume>34</volume>, <fpage>3094</fpage>&#x2013;<lpage>3100</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/bty191</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Handsaker</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Wysoker</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Fennell</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Ruan</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Homer</surname> <given-names>N.</given-names>
</name>
<etal/>
</person-group>. (<year>2009</year>). <article-title>The sequence alignment/map (SAM) format and SAMtools</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>2078</fpage>&#x2013;<lpage>2079</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname> <given-names>Y. X.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Z. H.</given-names>
</name>
<name>
<surname>Schuiteman</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Chase</surname> <given-names>M. W.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>J.-W.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>W. C.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>Phylogenomics of Orchidaceae based on plastid and mitochondrial genomes</article-title>. <source>Mol. Phylogenet. Evol.</source> <volume>139</volume>, <fpage>106540</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.ympev.2019.106540</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Shi</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Deng</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Ren</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Lin</surname> <given-names>W.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>H<sub>2</sub>O<sub>2</sub> as a feedback signal on dual-located WHIRLY1 associates with leaf senescence in <italic>Arabidopsis</italic>
</article-title>. <source>Cells</source> <volume>8</volume> (<issue>12</issue>), <fpage>1585</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.3390/cells8121585</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Schenke</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Cai</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>B.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>Dual-localized WHIRLY1 affects alicyclic acid biosynthesis via coordination of ISOCHORISMATE SYNTHASE1, PHENYLALANINE AMMONIA LYASE1, and S-ADENOSYL-L-METHIONINE-DEPENDENT METHYLTRANSFERASE1</article-title>. <source>Plant Physiol.</source> <volume>1840</volume>, <fpage>1884</fpage>&#x2013;<lpage>1899</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1104/pp.20.00964</pub-id>
</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
</name>
<name>
<surname>Jung</surname> <given-names>J. C.</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Deng</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Bernad</surname> <given-names>L.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in <italic>Arabidopsis</italic>
</article-title>. <source>Plant Cell</source> <volume>24</volume>, <fpage>4333</fpage>&#x2013;<lpage>4345</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1105/tpc.112.102855</pub-id>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lyko</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Wicke</surname> <given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Genomic reconfiguration in parasitic plants involves considerable gene losses alongside global genome size inflation and gene births</article-title>. <source>Plant Physiol.</source> <volume>186</volume>, <fpage>1412</fpage>&#x2013;<lpage>1423</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/plphys/kiab192</pub-id>
</citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maddison</surname> <given-names>W. P.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Gene trees in species trees</article-title>. <source>Systematic Biol.</source> <volume>46</volume>, <fpage>523</fpage>&#x2013;<lpage>536</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/sysbio/46.3.523</pub-id>
</citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mar&#xe9;chal</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Parent</surname> <given-names>J. S.</given-names>
</name>
<name>
<surname>V&#xe9;ronneau-Lafortune</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Joyeux</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Lang</surname> <given-names>B. F.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Whirly proteins maintain plastid genome stability in Arabidopsis</article-title>. <source>Proc. Natl. Acad. Sci. United States America</source> <volume>106</volume>, <fpage>14693</fpage>&#x2013;<lpage>14698</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.0901710106</pub-id>
</citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Merckx</surname> <given-names>V. S. F. T.</given-names>
</name>
<name>
<surname>Freudenstein</surname> <given-names>J. V.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Evolution of mycoheterotrophy in plants: a phylogenetic perspective</article-title>. <source>New Phytol.</source> <volume>185</volume>, <fpage>605</fpage>&#x2013;<lpage>609</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1469-8137.2009.03155.x</pub-id>
</citation>
</ref>
<ref id="B71">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Merckx</surname> <given-names>V. S. F. T.</given-names>
</name>
<name>
<surname>Merckx</surname> <given-names>V. S. F. T.</given-names>
</name>
</person-group> (<year>2013</year>). &#x201c;<article-title>Mycoheterotrophy: an introduction</article-title>,&#x201d; in <source>Mycoheterotrophy: the biology of plants living on fungi</source>, <edition>1st ed</edition> (<publisher-name>Springer</publisher-name>, <publisher-loc>NY</publisher-loc>). doi:&#xa0;<pub-id pub-id-type="doi">10.1007/978-1-4614-5209-6</pub-id>
</citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mergner</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Fresno</surname> <given-names>M.</given-names>
</name>
<name>
<surname>List</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Papacek</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Chaudhary</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>Mass-spectrometry-based draft of the <italic>Arabidopsis</italic> proteome</article-title>. <source>Nature</source> <volume>579</volume>, <fpage>409</fpage>&#x2013;<lpage>414</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41586-020-2094-2</pub-id>
</citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>M&#xfc;nster</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Pahnke</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Di Rosa</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Kim</surname> <given-names>J. T.</given-names>
</name>
<name>
<surname>Martin</surname> <given-names>W.</given-names>
</name>
<name>
<surname>Saedler</surname> <given-names>H.</given-names>
</name>
<etal/>
</person-group>. (<year>1997</year>). <article-title>Floral homeotic genes were recruited from homologous MADS-box genes preexisting in the common ancestor of ferns and seed plants</article-title>. <source>Proc. Natl. Acad. United States America</source> <volume>94</volume>, <fpage>3415</fpage>&#x2013;<lpage>2420</lpage>. doi:&#xa0;10.1073%2Fpnas.94.6.2415
</citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Murrell</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Moola</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Mabona</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Weighill</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Sheward</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Pond</surname> <given-names>S. L. K.</given-names>
</name>
<etal/>
</person-group>. (<year>2013</year>). <article-title>FUBAR: A Fast, Unconstrained Bayesian AppRoximation for inferring selection</article-title>. <source>Mol. Biol. Evol.</source> <volume>30)</volume>, <fpage>1196</fpage>&#x2013;<lpage>1205</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/mst030</pub-id>
</citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Murrell</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Weaver</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Smith</surname> <given-names>M. D.</given-names>
</name>
<name>
<surname>Wertheim</surname> <given-names>J. O.</given-names>
</name>
<name>
<surname>Murrell</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Aylward</surname> <given-names>A.</given-names>
</name>
<etal/>
</person-group>. (<year>2015</year>). <article-title>Gene-wide identification of episodic selection</article-title>. <source>Mol. Biol. Evol.</source> <volume>32</volume>, <fpage>1365</fpage>&#x2013;<lpage>1371</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msv035</pub-id>
</citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ner-Gaon</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Halachmi</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Savaldi-Goldstein</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Rubin</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Ophir</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Fluhr</surname> <given-names>R.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Intronretention is a major phenomenon in alternative splicing in <italic>Arabidopsis</italic>
</article-title>. <source>Plant J.</source> <volume>39</volume>, <fpage>877</fpage>&#x2013;<lpage>885</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1365-313x.2004.02172.x</pub-id>
</citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nguyen</surname> <given-names>L.-T.</given-names>
</name>
<name>
<surname>Schmidt</surname> <given-names>H. A.</given-names>
</name>
<name>
<surname>von Haeseler</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Minh</surname> <given-names>B. Q.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies</article-title>. <source>Mol. Biol. Evol.</source> <volume>32</volume>, <fpage>268</fpage>&#x2013;<lpage>274</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msu300</pub-id>
</citation>
</ref>
<ref id="B78">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pamilo</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Nei</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>1988</year>). <article-title>Relationships between gene trees and species trees</article-title>. <source>Mol. Biol. Evol.</source> <volume>5</volume>, <fpage>568</fpage>&#x2013;<lpage>583</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a040517</pub-id>
</citation>
</ref>
<ref id="B79">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Parent</surname> <given-names>J.-S.</given-names>
</name>
<name>
<surname>Lepage</surname> <given-names>E.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Divergent roles for the two polI-like organelle DNA polymerases of <italic>Arabidopsis</italic>
</article-title>. <source>Plant Physiol.</source> <volume>156</volume>, <fpage>254</fpage>&#x2013;<lpage>262</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1104/pp.111.173849</pub-id>
</citation>
</ref>
<ref id="B80">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Patro</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Duggal</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Love</surname> <given-names>M. I.</given-names>
</name>
<name>
<surname>Irizarry</surname> <given-names>R. A.</given-names>
</name>
<name>
<surname>Kingsford</surname> <given-names>C.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Salmon provides fast and bias-aware quantification of transcript expression</article-title>. <source>Nat. Methods</source> <volume>14</volume>, <fpage>417</fpage>&#x2013;<lpage>419</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nmeth.4197</pub-id>
</citation>
</ref>
<ref id="B81">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Patron</surname> <given-names>N. J.</given-names>
</name>
<name>
<surname>Waller</surname> <given-names>R. F.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Transit peptide diversity and divergence: a global analysis of plastid targeting signals</article-title>. <source>BioEssays</source> <volume>29</volume>, <fpage>1048</fpage>&#x2013;<lpage>1058</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/bies.20638</pub-id>
</citation>
</ref>
<ref id="B82">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>P&#xe9;rez-Escobar</surname> <given-names>O. A.</given-names>
</name>
<name>
<surname>Bogar&#xed;n</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Przelomska</surname> <given-names>N. A. S.</given-names>
</name>
<name>
<surname>Ackerman</surname> <given-names>J. D.</given-names>
</name>
<name>
<surname>Balbuena</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Bellot</surname> <given-names>S.</given-names>
</name>
<etal/>
</person-group>. (<year>2024</year>). <article-title>The origin and speciation of orchids</article-title>. <source>New Phytol.</source> <volume>242</volume>, <fpage>700</fpage>&#x2013;<lpage>716</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/nph.19580</pub-id>
</citation>
</ref>
<ref id="B83">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>P&#xe9;rez-Escobar</surname> <given-names>O. A.</given-names>
</name>
<name>
<surname>Dodsworth</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Bogar&#xed;n</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Bellot</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Balbuena</surname> <given-names>J. A.</given-names>
</name>
<name>
<surname>Schely</surname> <given-names>R. J.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>Hundreds of nuclear and plastid loci yield novel insights into orchid relationships</article-title>. <source>Am. J. Bot.</source> <volume>108</volume>, <fpage>1166</fpage>&#x2013;<lpage>1180</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/ajb2.1702</pub-id>
</citation>
</ref>
<ref id="B84">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Pond</surname> <given-names>S. L. K.</given-names>
</name>
<name>
<surname>Muse</surname> <given-names>S. V.</given-names>
</name>
</person-group> (<year>2005</year>). &#x201c;<article-title>HyPhy: hypothesis testing using phylogenies</article-title>,&#x201d; in <source>Statistical methods in molecular evolution</source>, <edition>1st ed</edition>. Ed. <person-group person-group-type="editor">
<name>
<surname>Nielsen</surname> <given-names>R.</given-names>
</name>
</person-group> (<publisher-name>Springer</publisher-name>, <publisher-loc>NY</publisher-loc>).</citation>
</ref>
<ref id="B85">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pond</surname> <given-names>S. L. K.</given-names>
</name>
<name>
<surname>Posada</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Gravenor</surname> <given-names>M. B.</given-names>
</name>
<name>
<surname>Woelk</surname> <given-names>C. H.</given-names>
</name>
<name>
<surname>Frost</surname> <given-names>S. D. W.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>GARD: a genetic algorithm for recombination detection</article-title>. <source>Bioinformatics</source> <volume>22</volume>, <fpage>3096</fpage>&#x2013;<lpage>3098</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btl474</pub-id>
</citation>
</ref>
<ref id="B86">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Prikryl</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Watkins</surname> <given-names>K. P.</given-names>
</name>
<name>
<surname>Friso</surname> <given-names>G.</given-names>
</name>
<name>
<surname>van Wijk</surname> <given-names>K. J.</given-names>
</name>
<name>
<surname>Barkan</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>A member of the Whirly family is a multifunctional RNA- and DNA-binding protein that is essential for chloroplast biogenesis</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume>, <fpage>5152</fpage>&#x2013;<lpage>5165</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gkn492</pub-id>
</citation>
</ref>
<ref id="B87">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qiu</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Teng</surname> <given-names>L.</given-names>
</name>
<name>
<surname>Guan.</surname> <given-names>P.</given-names>
</name>
<name>
<surname>Yu</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>P.</given-names>
</name>
<etal/>
</person-group>. (<year>2022</year>). <article-title>OsWHY1 interacts with OsTRXz and is essential for early chloroplast development in rice</article-title>. <source>Rice</source> <volume>15</volume>, <fpage>50</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s12284-022-00596-y</pub-id>
</citation>
</ref>
<ref id="B88">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ranwez</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Harispe</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Delsuc</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Douzery</surname> <given-names>E. J. P.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>MACSE: multiple alignment ofcoding sequences accounting for frameshifts and stop codons</article-title>. <source>PloS One</source> <volume>6</volume>, <elocation-id>e22594</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.pone.0022594</pub-id>
</citation>
</ref>
<ref id="B89">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Rasmussen</surname> <given-names>H. N.</given-names>
</name>
</person-group> (<year>1995</year>). <source>Terrestrial orchids: from seed to mycotrophic plant</source> (<publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>). doi:&#xa0;<pub-id pub-id-type="doi">10.1017/CBO9780511525452</pub-id>
</citation>
</ref>
<ref id="B90">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rasmussen</surname> <given-names>H. N.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Recent developments in the study of orchid mycorrhiza</article-title>. <source>Plant Soil</source> <volume>244</volume>, <fpage>149</fpage>&#x2013;<lpage>163</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1023/A:1020246715436</pub-id>
</citation>
</ref>
<ref id="B91">
<citation citation-type="book">
<person-group person-group-type="author">
<collab>R Core Team</collab>
</person-group> (<year>2019</year>). <source>R: A language and environment for statistical computing</source> (<publisher-loc>Vienna, Austria</publisher-loc>: <publisher-name>The R Foundation</publisher-name>). Available at: <uri xlink:href="https://www.R-project.org/">https://www.R-project.org/</uri>.</citation>
</ref>
<ref id="B92">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reddy</surname> <given-names>A. S.</given-names>
</name>
<name>
<surname>Marquez</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Kalyna</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Barta</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Complexity of the alternative splicing landscape in plants</article-title>. <source>Plant Cell</source> <volume>25</volume>, <fpage>3657</fpage>&#x2013;<lpage>3683</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1105/tpc.113.117523</pub-id>
</citation>
</ref>
<ref id="B93">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ren</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Jiang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Wu</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Miao</surname> <given-names>Y.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Phosphorylation of WHIRLY1 by CIPK14 shifts its localization and dual functions in <italic>Arabidopsis</italic>
</article-title>. <source>Mol. Plant</source> <volume>10</volume>, <fpage>749</fpage>&#x2013;<lpage>763</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1016/j.molp.2017.03.011</pub-id>
</citation>
</ref>
<ref id="B94">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Robinson</surname> <given-names>M. D.</given-names>
</name>
<name>
<surname>McCarthy</surname> <given-names>D. J.</given-names>
</name>
<name>
<surname>Smyth</surname> <given-names>G. K.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>edgeR: a Bioconductor package for differential expression analysis of digital gene expression data</article-title>. <source>Bioinformatics</source> <volume>26</volume>, <fpage>139</fpage>&#x2013;<lpage>140</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btp616</pub-id>
</citation>
</ref>
<ref id="B95">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Robinson</surname> <given-names>M. D.</given-names>
</name>
<name>
<surname>Oshlack</surname> <given-names>A.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>A scaling normalization method for differential expression analysis of RNA-seq data</article-title>. <source>Genome Biol.</source> <volume>11</volume>, <fpage>R25</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/gb-2010-11-3-r25</pub-id>
</citation>
</ref>
<ref id="B96">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ruan</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Zhu</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Wei</surname> <given-names>B.</given-names>
</name>
<etal/>
</person-group>. (<year>2022</year>). <article-title>Genome-wide identification, phylogenetic, and expression analysis under abiotic stress conditions of Whirly (WHY) gene family in <italic>Medicago sativa</italic> L</article-title>. <source>Sci. Rep.</source> <volume>12</volume>, <fpage>18676</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41598-022-22658-3</pub-id>
</citation>
</ref>
<ref id="B97">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schelkunov</surname> <given-names>M. I.</given-names>
</name>
<name>
<surname>Shtratnikova</surname> <given-names>V. Y.</given-names>
</name>
<name>
<surname>Nuraliev</surname> <given-names>M. S.</given-names>
</name>
<name>
<surname>Selosse</surname> <given-names>M.-A.</given-names>
</name>
<name>
<surname>Penin</surname> <given-names>A. A.</given-names>
</name>
<name>
<surname>D.Logacheva</surname> <given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Exploring the limits for reduction of plastid genomes: a case study of the mycoheterotrophic orchids <italic>Epipogium aphyllum</italic> and <italic>Epipogium roseum</italic>
</article-title>. <source>Genome Biol. Evol.</source> <volume>7</volume>, <fpage>1179</fpage>&#x2013;<lpage>1191</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/gbe/evv019</pub-id>
</citation>
</ref>
<ref id="B98">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Serna-S&#xe1;nchez</surname> <given-names>M. A.</given-names>
</name>
<name>
<surname>P&#xe9;rez-Escobar</surname> <given-names>O. A.</given-names>
</name>
<name>
<surname>Bogar&#x131;n</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Torres-Jimenez</surname> <given-names>F. F.</given-names>
</name>
<name>
<surname>Alvarez-Yela</surname> <given-names>A. C.</given-names>
</name>
<name>
<surname>Arcila-Galvis</surname> <given-names>J. E.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>Plastid phylogenomics resolves ambiguous relationships within the orchid family and provides a solid timeframe for biogeography and macroevolution</article-title>. <source>Sci. Rep.</source> <volume>11</volume>, <fpage>6858</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41598-021-83664-5</pub-id>
</citation>
</ref>
<ref id="B99">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sinn</surname> <given-names>B. T.</given-names>
</name>
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Ancient mitochondrial gene transfer between fungi and theorchids</article-title>. <source>Mol. Biol. Evol.</source> <volume>37</volume>, <fpage>44</fpage>&#x2013;<lpage>57</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msz198</pub-id>
</citation>
</ref>
<ref id="B100">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Suetsugu</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Yamato</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Miura</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Yamaguchi</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Takahashi</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Ida</surname> <given-names>Y.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Comparison of green and albino individuals of the partially mycoheterotrophic orchid <italic>Epipactis helleborine</italic> on molecular identities of mycorrhizal fungi, nutritional modes and gene expression in mycorrhizal roots</article-title>. <source>Mol. Ecol.</source> <volume>26</volume>, <fpage>1652</fpage>&#x2013;<lpage>1669</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/mec.14021</pub-id>
</citation>
</ref>
<ref id="B101">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>X.</given-names>
</name>
</person-group> (<year>2023</year>). <article-title>WRKY1 represses the WHIRLY1 transcription factor to positively regulate plant defense against geminivirus infection</article-title>. <source>PloS Pathog.</source> <volume>19</volume>, <elocation-id>e1011319</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1371/journal.ppat.1011319</pub-id>
</citation>
</ref>
<ref id="B102">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taylor</surname> <given-names>D. L.</given-names>
</name>
<name>
<surname>Bruns</surname> <given-names>T. D.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Independent, specialized invasions of ectomycorrhizal mutualism by two nonphotosynthetic orchids</article-title>. <source>Proc. Natl. Acad. United States America.</source> <volume>94</volume>, <fpage>4510</fpage>&#x2013;<lpage>4515</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.94.9.4510</pub-id>
</citation>
</ref>
<ref id="B103">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taylor</surname> <given-names>R. E.</given-names>
</name>
<name>
<surname>West</surname> <given-names>C. E.</given-names>
</name>
<name>
<surname>Foyer</surname> <given-names>C. H.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>WHIRLY protein functions in plants</article-title>. <source>Food Energy Secur.</source> <volume>00</volume>, <elocation-id>e379</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1002/fes3.379</pub-id>
</citation>
</ref>
<ref id="B104">
<citation citation-type="journal">
<person-group person-group-type="author">
<collab>The Angiosperm Phylogeny Group</collab>
<name>
<surname>Chase</surname> <given-names>M. W.</given-names>
</name>
<name>
<surname>Christenhusz</surname> <given-names>M. J. M.</given-names>
</name>
<name>
<surname>Fay</surname> <given-names>M. F.</given-names>
</name>
<name>
<surname>Byng</surname> <given-names>J. W.</given-names>
</name>
<name>
<surname>Judd</surname> <given-names>W. S.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>An update of the Angiosperm Phylogeny Classification for the orders and families of flowering plants: APG IV</article-title>. <source>Botanical J. Linn. Society.</source> <volume>181</volume>, <fpage>1</fpage>&#x2013;<lpage>20</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/boj.12385</pub-id>
</citation>
</ref>
<ref id="B105">
<citation citation-type="journal">
<person-group person-group-type="author">
<collab>The Potato Genome Sequencing Consortium</collab>
</person-group> (<year>2011</year>). <article-title>Genome sequence and analysis of the tuber crop potato</article-title>. <source>Nature</source> <volume>475</volume>, <fpage>189</fpage>&#x2013;<lpage>195</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nature10158</pub-id>
</citation>
</ref>
<ref id="B106">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Timilsena</surname> <given-names>P. R.</given-names>
</name>
<name>
<surname>Barrett</surname> <given-names>C. F.</given-names>
</name>
<name>
<surname>Pi&#xf1;eyro-Nelson</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Wafula</surname> <given-names>E. K.</given-names>
</name>
<name>
<surname>Ayyampalayam</surname> <given-names>S.</given-names>
</name>
<name>
<surname>McNeal</surname> <given-names>J. R.</given-names>
</name>
<etal/>
</person-group>. (<year>2023</year>). <article-title>Phylotranscriptomic analyses of mycoheterotrophic monocots show a continuum of convergent evolutionary changes in expressed nuclear genes from three independent nonphotosynthetic lineages</article-title>. <source>Genome Biol. Evol.</source> <volume>15</volume>, <elocation-id>evac183</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/gbe/evac183</pub-id>
</citation>
</ref>
<ref id="B107">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Untergasser</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Cutcutache</surname> <given-names>I.</given-names>
</name>
<name>
<surname>Koressaar</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Ye</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Faircloth</surname> <given-names>B. C.</given-names>
</name>
<name>
<surname>Remm</surname> <given-names>M.</given-names>
</name>
<etal/>
</person-group>. (<year>2012</year>). <article-title>Primer3-new capabilities and interfaces</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume>, <elocation-id>e115</elocation-id>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/nar/gks596</pub-id>
</citation>
</ref>
<ref id="B108">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Waterworth</surname> <given-names>W. M.</given-names>
</name>
<name>
<surname>Drury</surname> <given-names>G. E.</given-names>
</name>
<name>
<surname>Bray</surname> <given-names>C. M.</given-names>
</name>
<name>
<surname>West</surname> <given-names>C. E.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Repairing breaks in the plant genome: the importance of keeping it together</article-title>. <source>New Phytol.</source> <volume>192</volume>, <fpage>805</fpage>&#x2013;<lpage>822</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1469-8137.2011.03926.x</pub-id>
</citation>
</ref>
<ref id="B109">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wertheim</surname> <given-names>J. O.</given-names>
</name>
<name>
<surname>Murrell</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Smith</surname> <given-names>M. D.</given-names>
</name>
<name>
<surname>Pond</surname> <given-names>S. L. K.</given-names>
</name>
<name>
<surname>Scheffler</surname> <given-names>K.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>RELAX:detecting relaxed selection in a phylogenetic framework</article-title>. <source>Mol. Biol. Evol.</source> <volume>32)</volume>, <fpage>820</fpage>&#x2013;<lpage>832</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/molbev/msu400</pub-id>
</citation>
</ref>
<ref id="B110">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wheeler</surname> <given-names>T. J.</given-names>
</name>
<name>
<surname>Eddy</surname> <given-names>S. R.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>nhmmer: DNA homology search with profile HMMs</article-title>. <source>Bioinformatics</source> <volume>29</volume> (<issue>19</issue>), <fpage>2487</fpage>&#x2013;<lpage>2489</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/bioinformatics/btt403</pub-id>
</citation>
</ref>
<ref id="B111">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wicke</surname> <given-names>S.</given-names>
</name>
<name>
<surname>M&#xfc;ller</surname> <given-names>K. F.</given-names>
</name>
<name>
<surname>dePamphilis</surname> <given-names>C. W.</given-names>
</name>
<name>
<surname>Quandt</surname> <given-names>D.</given-names>
</name>
<name>
<surname>Bellot</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Schneeweiss</surname> <given-names>G. M.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Mechanistic model of evolutionary rate variation en route to a nonphotosynthetic lifestyle in plants</article-title>. <source>Proc. Natl. Acad. Sci. United States America</source> <volume>113</volume>, <fpage>9045</fpage>&#x2013;<lpage>9050</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1073/pnas.1607576113</pub-id>
</citation>
</ref>
<ref id="B112">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
</name>
<name>
<surname>Mueller</surname> <given-names>F. L. A.</given-names>
</name>
<name>
<surname>Crouzillat</surname> <given-names>D.</given-names>
</name>
<name>
<surname>P&#xe9;tiard</surname> <given-names>V.</given-names>
</name>
<name>
<surname>Tanksley</surname> <given-names>S. D.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Combining bioinformatics and phylogenetics to identify large sets of single-copy orthologous genes (COSII) for comparative, evolutionary, and systematic studies: a test case in the euasterid plant clade</article-title>. <source>Genetics</source> <volume>174</volume>, <fpage>1407</fpage>&#x2013;<lpage>1420</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1534/genetics.106.062455</pub-id>
</citation>
</ref>
<ref id="B113">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Lei</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Su</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Shen</surname> <given-names>G.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>A chromosome-scale <italic>Gastrodia elata</italic> genome and large-scale comparative genomic analysis indicate convergent evolution by gene loss in mycoheterotrophic and parasitic plants</article-title>. <source>Plant J.</source> <volume>108</volume>, <fpage>1609</fpage>&#x2013;<lpage>1623</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/tpj.15528</pub-id>
</citation>
</ref>
<ref id="B114">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yoo</surname> <given-names>H. H.</given-names>
</name>
<name>
<surname>Kwon</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Lee</surname> <given-names>M. M.</given-names>
</name>
<name>
<surname>Chung</surname> <given-names>I. K.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Single-stranded DNA binding factor AtWHY1 modulates telomere length homeostasis in <italic>Arabidopsis</italic>
</article-title>. <source>Plant J.</source> <volume>49</volume>, <fpage>442</fpage>&#x2013;<lpage>451</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1365-313X.2006.02974.x</pub-id>
</citation>
</ref>
<ref id="B115">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Jin</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Zhou</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>X.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>The <italic>Gastrodia elata</italic> genome provides insights into plant adaptation to heterotrophy</article-title>. <source>Nat. Commun.</source> <volume>9</volume>, <fpage>1615</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41467-018-03423-5</pub-id>
</citation>
</ref>
<ref id="B116">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zampini</surname> <given-names>&#xc9;.</given-names>
</name>
<name>
<surname>Lepage</surname> <given-names>&#xc9;.</given-names>
</name>
<name>
<surname>Tremblay-Belzile</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Truche</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Brisson</surname> <given-names>N.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Organelle DNA rearrangement mapping reveals U-turn-like inversions as a major source of genomic instability in <italic>Arabidopsis</italic> and humans</article-title>. <source>Genome Res.</source> <volume>25</volume>, <fpage>645</fpage>&#x2013;<lpage>654</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1101/gr.188573.114</pub-id>
</citation>
</ref>
<ref id="B117">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zeng</surname> <given-names>X.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Ling</surname> <given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>J.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>Transcriptomic analyses reveal clathrin-mediated endocytosis involved in symbiotic seed germination of <italic>Gastrodia elata</italic>
</article-title>. <source>Botanical Stud.</source> <volume>58</volume>, <fpage>31</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1186/s40529-017-0185-7</pub-id>
</citation>
</ref>
<ref id="B118">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>G.</given-names>
</name>
<name>
<surname>Hu</surname> <given-names>Y.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>M.-Z.</given-names>
</name>
<name>
<surname>Huang</surname> <given-names>W.-C.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>D.-K.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>D.</given-names>
</name>
<etal/>
</person-group>. (<year>2023</year>). <article-title>Comprehensive phylogenetic analyses of Orchidaceae using nuclear genes and evolutionary insights into epiphytism</article-title>. <source>J. Integr. Plant Biol.</source> <volume>65</volume>, <fpage>1204</fpage>&#x2013;<lpage>1225</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/jipb.13462</pub-id>
</citation>
</ref>
<ref id="B119">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>G.-Q.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>K.-W.</given-names>
</name>
<name>
<surname>Lohaus</surname> <given-names>R.</given-names>
</name>
<name>
<surname>Hsiao</surname> <given-names>Y.-Y.</given-names>
</name>
<name>
<surname>Niu</surname> <given-names>S.-C.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>J.-Y.</given-names>
</name>
<etal/>
</person-group>. (<year>2017</year>). <article-title>The <italic>Apostasia</italic> genome and the evolution of orchids</article-title>. <source>Nature</source> <volume>549</volume> (<issue>7672</issue>), <fpage>379</fpage>&#x2013;<lpage>383</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/nature23897</pub-id>
</citation>
</ref>
<ref id="B120">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname> <given-names>G.-Q.</given-names>
</name>
<name>
<surname>Xu</surname> <given-names>Q.</given-names>
</name>
<name>
<surname>Bian</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Tsai</surname> <given-names>W.-C.</given-names>
</name>
<name>
<surname>Yeh</surname> <given-names>C.-M.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>K.-W.</given-names>
</name>
<etal/>
</person-group>. (<year>2016</year>). <article-title>The <italic>Dendrobium catenatum</italic> Lindl. genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution</article-title>. <source>Sci. Rep.</source> <volume>6</volume>, <fpage>19029</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/srep19029</pub-id>
</citation>
</ref>
<ref id="B121">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname> <given-names>S. Y.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>G. D.</given-names>
</name>
<name>
<surname>Zhao</surname> <given-names>W. Y.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Kong</surname> <given-names>F. Y.</given-names>
</name>
<name>
<surname>Dong</surname> <given-names>X. C.</given-names>
</name>
<etal/>
</person-group>. (<year>2018</year>). <article-title>Overexpression of tomato WHIRLY protein enhances tolerance to drought stress and resistance to <italic>Pseudomonas solanacearum</italic> in transgenic tobacco</article-title>. <source>Biol. Plantarum</source> <volume>62</volume>, <fpage>55</fpage>&#x2013;<lpage>68</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1007/s10535-017-0714-y</pub-id>
</citation>
</ref>
<ref id="B122">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname> <given-names>T.</given-names>
</name>
<name>
<surname>Zwaenepoel</surname> <given-names>A.</given-names>
</name>
<name>
<surname>Xue</surname> <given-names>J.-Y.</given-names>
</name>
<name>
<surname>Kao</surname> <given-names>S.-M.</given-names>
</name>
<name>
<surname>Li</surname> <given-names>Z.</given-names>
</name>
<name>
<surname>Schranz</surname> <given-names>M. E.</given-names>
</name>
<etal/>
</person-group>. (<year>2021</year>). <article-title>Whole-genome microsynteny-based phylogeny of angiosperms</article-title>. <source>Nat. Commun.</source> <volume>12</volume>, <fpage>3498</fpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1038/s41467-021-23665-0</pub-id>
</citation>
</ref>
<ref id="B123">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhuang</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Kong</surname> <given-names>F.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>S.</given-names>
</name>
<name>
<surname>Meng</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Yang</surname> <given-names>M.</given-names>
</name>
<name>
<surname>Liu</surname> <given-names>Z.</given-names>
</name>
<etal/>
</person-group>. (<year>2019</year>). <article-title>Whirly1 enhances tolerance to chilling stress in tomato via protection of photosystem II and regulation of starch degradation</article-title>. <source>New Phytol.</source> <volume>221</volume>, <fpage>1998</fpage>&#x2013;<lpage>2012</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/nph.15532</pub-id>
</citation>
</ref>
<ref id="B124">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhuang</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Wang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Jiao</surname> <given-names>B.</given-names>
</name>
<name>
<surname>Chen</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Zhang</surname> <given-names>J.</given-names>
</name>
<name>
<surname>Ma</surname> <given-names>N.</given-names>
</name>
<etal/>
</person-group>. (<year>2020</year>). <article-title>WHIRLY1 maintains leaf photosynthetic capacity in tomato by regulating the expression of <italic>RbcS1</italic> under chilling stress</article-title>. <source>J. Exp. Bot.</source> <volume>12</volume>, <fpage>3653</fpage>&#x2013;<lpage>3663</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1093/jxb/eraa145</pub-id>
</citation>
</ref>
<ref id="B125">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zimmer</surname> <given-names>K.</given-names>
</name>
<name>
<surname>Meyer</surname> <given-names>C.</given-names>
</name>
<name>
<surname>Gebauer</surname> <given-names>G.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>The ectomycorrhizal specialist orchidCorallorhiza trifida is a partial myco-heterotroph</article-title>. <source>New Phytol.</source> <volume>178</volume>, <fpage>395</fpage>&#x2013;<lpage>400</lpage>. doi:&#xa0;<pub-id pub-id-type="doi">10.1111/j.1469-8137.2007.02362.x</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>