<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2021.771983</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Hypothesis and Theory</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Poorly Conserved P15 Proteins of Cileviruses Retain Elements of Common Ancestry and Putative Functionality: A Theoretical Assessment on the Evolution of Cilevirus Genomes</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>Pedro L.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/204483/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Pons</surname> <given-names>Tirso</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1270550/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Chabi-Jesus</surname> <given-names>Camila</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/390843/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Arena</surname> <given-names>Gabriella Dias</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/362745/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Freitas-Astua</surname> <given-names>Juliana</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/362723/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Laborat&#x00F3;rio de Biologia Molecular Aplicada, Instituto Biol&#x00F3;gico de S&#x00E3;o Paulo</institution>, <addr-line>S&#x00E3;o Paulo</addr-line>, <country>Brazil</country></aff>
<aff id="aff2"><sup>2</sup><institution>National Centre for Biotechnology (CNB-CSIC)</institution>, <addr-line>Madrid</addr-line>, <country>Spain</country></aff>
<aff id="aff3"><sup>3</sup><institution>Escola Superior de Agricultura Luiz de Queiroz (ESALQ), Universidade de S&#x00E3;o Paulo</institution>, <addr-line>Piracicaba</addr-line>, <country>Brazil</country></aff>
<aff id="aff4"><sup>4</sup><institution>Embrapa Mandioca e Fruticultura</institution>, <addr-line>Cruz das Almas</addr-line>, <country>Brazil</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Ken Komatsu, Tokyo University of Agriculture and Technology, Japan</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Ken-Taro Sekine, University of the Ryukyus, Japan; Alejandro Olmedo-Velarde, University of Hawaii, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Pedro L. Ramos-Gonz&#x00E1;lez, <email>plrg1970@gmail.com</email></corresp>
<corresp id="c002">Juliana Freitas-Astua, <email>juliana.astua@embrapa.br</email></corresp>
<fn fn-type="equal" id="fn002"><p><sup>&#x2020;</sup>These authors have contributed equally to this work</p></fn>
<fn fn-type="other" id="fn004"><p>This article was submitted to Plant Pathogen Interactions, a section of the journal Frontiers in Plant Science</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>11</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>771983</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>09</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>18</day>
<month>10</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Ramos-Gonz&#x00E1;lez, Pons, Chabi-Jesus, Arena and Freitas-Astua.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Ramos-Gonz&#x00E1;lez, Pons, Chabi-Jesus, Arena and Freitas-Astua</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The genus <italic>Cilevirus</italic> groups enveloped single-stranded (+) RNA virus members of the family <italic>Kitaviridae</italic>, order <italic>Martellivirales</italic>. Proteins P15, scarcely conserved polypeptides encoded by cileviruses, have no apparent homologs in public databases. Accordingly, the open reading frames (ORFs) <italic>p15</italic>, located at the 5&#x2032;-end of the viral RNA2 molecules, are considered orphan genes (ORFans). In this study, we have delved into ORFs <italic>p15</italic> and the relatively poorly understood biochemical properties of the proteins P15 to posit their importance for viruses across the genus and theorize on their origin. We detected that the ORFs <italic>p15</italic> are under purifying selection and that, in some viral strains, the use of synonymous codons is biased, which might be a sign of adaptation to their plant hosts. Despite the high amino acid sequence divergence, proteins P15 show the conserved motif [FY]-L-x(3)-[FL]-H-x-x-[LIV]-S-C-x-C-x(2)-C-x-G-x-C, which occurs exclusively in members of this protein family. Proteins P15 also show a common predicted 3D structure that resembles the helical scaffold of the protein ORF49 encoded by radinoviruses and the phosphoprotein C-terminal domain of mononegavirids. Based on the 3D structural similarities of P15, we suggest elements of common ancestry, conserved functionality, and relevant amino acid residues. We conclude by postulating a plausible evolutionary trajectory of ORFans <italic>p15</italic> and the 5&#x2032;-end of the RNA2 of cileviruses considering both protein fold superpositions and comparative genomic analyses with the closest kitaviruses, negeviruses, nege/kita-like viruses, and unrelated viruses that share the ecological niches of cileviruses.</p>
</abstract>
<kwd-group>
<kwd><italic>Kitaviridae</italic></kwd>
<kwd>orphan ORF</kwd>
<kwd>structure-based phylogenetic analysis</kwd>
<kwd>synonymous codon usage bias</kwd>
<kwd>horizontal gene transfer</kwd>
<kwd><italic>Dichorhavirus</italic></kwd>
<kwd>miniproteins</kwd>
<kwd>small ORF</kwd>
</kwd-group>
<contract-num rid="cn001">PNPD20132154 - 33141010001P4 - PNPD &#x201C;&#x201D; IBSP</contract-num>
<contract-num rid="cn003">2017/50222-0</contract-num>
<contract-num rid="cn003">2019/25078-9</contract-num>
<contract-num rid="cn003">2019/02137-0</contract-num>
<contract-sponsor id="cn001">Coordena&#x00E7;&#x00E3;o de Aperfei&#x00E7;oamento de Pessoal de N&#x00ED;vel Superior<named-content content-type="fundref-id">10.13039/501100002322</named-content></contract-sponsor><contract-sponsor id="cn002">Conselho Nacional de Desenvolvimento Cient&#x00ED;fico e Tecnol&#x00F3;gico<named-content content-type="fundref-id">10.13039/501100003593</named-content></contract-sponsor>
<contract-sponsor id="cn003">Funda&#x00E7;&#x00E3;o de Amparo &#x00E0; Pesquisa do Estado de S&#x00E3;o Paulo<named-content content-type="fundref-id">10.13039/501100001807</named-content></contract-sponsor>
<counts>
<fig-count count="6"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="113"/>
<page-count count="17"/>
<word-count count="13791"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="S1">
<title>Introduction</title>
<p>Orphan genes (ORFans) code for proteins with unrecognized homologs in any other species (<xref ref-type="bibr" rid="B33">Fischer and Eisenberg, 1999</xref>; <xref ref-type="bibr" rid="B5">Arendsee et al., 2014</xref>). They are present in a variety of organisms, including prokaryotes, eukaryotes, and viruses, reaching up to one-third of the genome of some microorganisms and 50% to two-thirds in mimiviruses (<xref ref-type="bibr" rid="B33">Fischer and Eisenberg, 1999</xref>; <xref ref-type="bibr" rid="B49">Kaur Saini and Fischer, 2007</xref>; <xref ref-type="bibr" rid="B111">Yin and Fischer, 2008</xref>; <xref ref-type="bibr" rid="B103">Tautz and Domazet-Lo&#x0161;o, 2011</xref>; <xref ref-type="bibr" rid="B18">Colson et al., 2017</xref>). The quick origin of ORFans may provide adaptive mechanisms conferring rapid fitness to changing environments (<xref ref-type="bibr" rid="B31">Entwistle et al., 2019</xref>). Most of the known proteins encoded by ORFans interact with conserved proteins like transcription factors or receptors, acting as toxins, or as modulators of metabolic or regulatory networks (<xref ref-type="bibr" rid="B101">Singh and Wurtele, 2020</xref>).</p>
<p><italic>Kitaviridae</italic>, order <italic>Martellivirales</italic>, is a family of heterogeneous plant-infecting viruses displaying two, three, or four segments of single-stranded positive-sense RNA molecules as genomes, which have been assigned into the genera <italic>Cilevirus</italic>, <italic>Higrevirus</italic>, and <italic>Blunervirus</italic>, respectively (<xref ref-type="bibr" rid="B87">Quito-Avila et al., 2021</xref>). Taken together, kitavirids have been detected in a relatively narrow range of natural hosts, where they normally produce local infections mainly characterized by chlorotic and/or necrotic lesions that in some cases resemble the outcome of a hypersensitive-like response (<xref ref-type="bibr" rid="B3">Arena et al., 2016</xref>, <xref ref-type="bibr" rid="B2">2020</xref>). However, regardless of their failure to consummate the systemic movement through the plants, kitavirids pose serious threats to major crops such as citrus (<xref ref-type="bibr" rid="B88">Ramos-Gonz&#x00E1;lez et al., 2018</xref>). Likely most kitaviruses are transmitted by mites, but an effective transmission by <italic>Brevipalpus</italic> mites has been only confirmed for cileviruses (<xref ref-type="bibr" rid="B51">Kitajima and Alberti, 2014</xref>; <xref ref-type="bibr" rid="B87">Quito-Avila et al., 2021</xref>).</p>
<p>With several strains and isolates belonging to three species already described at the biological and molecular level, <italic>Cilevirus</italic> is the best-studied genus of kitavirids (<xref ref-type="bibr" rid="B35">Freitas-Ast&#x00FA;a et al., 2018</xref>; <xref ref-type="bibr" rid="B87">Quito-Avila et al., 2021</xref>). Virions of citrus leprosis virus C (CiLV-C), citrus leprosis virus C2 (CiLV-C2), and passion fruit green spot virus (PfGSV) are enveloped, short bacilliform particles that encapsidate two poly-adenylated RNA molecules (<xref ref-type="bibr" rid="B66">Locali-Fabris et al., 2006</xref>; <xref ref-type="bibr" rid="B94">Roy et al., 2013</xref>; <xref ref-type="bibr" rid="B51">Kitajima and Alberti, 2014</xref>; <xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>). The canonical genome of cileviruses contains six open reading frames (ORF) distributed in two RNA segments. ORFs <italic>RdRp</italic> (RNA dependent-RNA polymerase) and <italic>p29</italic> (putative coat protein) are in RNA 1 (&#x2248;9.0 kb); whilst the ORFs <italic>p32</italic> (movement protein), <italic>p61</italic>, <italic>p24</italic>, and <italic>p15</italic> are contained in the RNA2 (&#x2248;5.0 kb) (<xref ref-type="fig" rid="F1">Figure 1</xref>). RNA2 in certain cileviruses also includes some accessory ORFs, for instance, <italic>p7</italic> in CiLV-C2, and <italic>p11</italic>-<italic>13</italic> in PfGSV. ORFs <italic>p61</italic> and <italic>p24</italic> encode proteins that are likely involved in the virion structure (<xref ref-type="bibr" rid="B102">Solovyev and Morozov, 2017</xref>). P24 is a transmembrane (TM) protein that is also probably an integral component of the virus factory-like membranes, whereas P61 is a putative glycoprotein that in the case of CiLV-C acts as a viral effector in plants (<xref ref-type="bibr" rid="B59">Kuchibhatla et al., 2014</xref>; <xref ref-type="bibr" rid="B3">Arena et al., 2016</xref>, <xref ref-type="bibr" rid="B2">2020</xref>; <xref ref-type="bibr" rid="B63">Leastro et al., 2018</xref>; <xref ref-type="bibr" rid="B108">Vinokurov and Koloniuk, 2019</xref>). ORF <italic>p61</italic> is taxonomically restricted to cileviruses, whereas, besides in the kitavirids, <italic>p24</italic> is also present in an increasingly discovered number of arthropod-infecting viruses including mosquito-specific viruses of the proposed group Negevirus (clades Nelorpivirus and Sandewavirus) (<xref ref-type="bibr" rid="B106">Vasilakis et al., 2013</xref>; <xref ref-type="bibr" rid="B59">Kuchibhatla et al., 2014</xref>; <xref ref-type="bibr" rid="B55">Kondo et al., 2020</xref>). P24 displays the highly conserved motif SP24 (TM-SP24, structure protein of 24 kDa, Pfam code PF16504). The glycoproteins P61 of cileviruses lack the motifs DiSA (Pfam code PF19226) and DiSB-ORF2_chro B (Pfam code PF16506) detected in the glycoproteins encoded by their taxonomically related arthropod-infecting viruses (<xref ref-type="bibr" rid="B59">Kuchibhatla et al., 2014</xref>). ORF <italic>p15</italic>, only present in cileviruses, encodes the smallest annotated polypeptide across the genomes of these viruses (<xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Linear genomic maps of some members of the family <italic>Kitaviridae</italic> and some of their closest phylogenetically arthropod-infecting viruses. <bold>(A)</bold> Two members of the group Negevirus, <bold>(B)</bold> two members of the group centivirus, <bold>(C)</bold> unclassified kita/nege-like virus, <bold>(D)</bold> the sole member of the genus <italic>Higrevirus</italic>, <bold>(E)</bold> a cile-like virus, and <bold>(F)</bold> members of the genus <italic>Cilevirus</italic>. Open reading frames are represented by boxes and the fill colors indicate a conserved functional or structural relationship between them. White boxes indicate unknown features. The solid gray background box in <bold>(F)</bold> highlights the 5&#x2032;-end of the RNA2 genomic segment of cileviruses. The purple ribbon points out the position of the ORFs containing the SP-24 motif (structural protein of 24 kDa, Pfam code PF16504). Small geometric symbols show the presence of relevant sequences as described in the legend box.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-12-771983-g001.tif"/>
</fig>
<p>In this study, based on an integrator process comprising genomics, population genetics, usage of synonymous codons, and tridimensional (3D) structures, we elaborated a hypothesis about the conserved putative functionality of the P15 proteins across the genus. Following a chain of logical thinking, we theorize on the origin and evolution of the ORF <italic>p15</italic> considering the genomic context in the 5&#x2032;-end of the RNA2 in cileviruses. A meaningful part of the study has been founded on the comparisons of the protein folds using the principles of structure-based phylogenetic analyses.</p>
</sec>
<sec id="S2">
<title>The Open Reading Frames <italic>p15</italic> of Cileviruses Are Under Purifying Selection and Encode Poorly Conserved Proteins</title>
<p>Citrus leprosis virus C is the best-characterized cilevirus at both molecular and epidemiological levels. Its population is subdivided into three clades (<xref ref-type="bibr" rid="B89">Ramos-Gonz&#x00E1;lez et al., 2016</xref>; <xref ref-type="bibr" rid="B16">Chabi-Jesus et al., 2021</xref>). Average identity values of genomic nucleotide sequences between CiLV-C isolates belonging to different lineages range from 85 to 89%. Contrastingly, a highly conserved segment of approximately 1.5 kb, showing almost 100% of identity, is present at the 5&#x2032;-end of their RNA2 molecules and comprises the ORF <italic>p15</italic> (393 nts) (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<p>Based on the analysis of 58 nucleotide sequences of <italic>p15</italic> (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 1</xref>) using MEGA v. 10.1.8 (<xref ref-type="bibr" rid="B60">Kumar et al., 2018</xref>) and DnaSP v. 6.12.03 (<xref ref-type="bibr" rid="B95">Rozas et al., 2017</xref>) software suites, we confirmed that CiLV-C <italic>p15</italic> variability is very low with an overall genetic distance (<italic>D</italic>) and nucleotide diversity (&#x03C0;) &#x2264; 0.01 (<xref ref-type="fig" rid="F2">Figure 2</xref>). According to population analyses, this low variability may result from constant purges of the non-synonymous substitutions by purifying selection (&#x03C9; &#x003C; 1), and/or intra-species recombination processes, as already described (<xref ref-type="bibr" rid="B89">Ramos-Gonz&#x00E1;lez et al., 2016</xref>; <xref ref-type="bibr" rid="B16">Chabi-Jesus et al., 2021</xref>). P15 encoded by CiLV-C shows Cys residues that were earlier suggested as taking part in a putative Zn-finger structure (<xref ref-type="bibr" rid="B89">Ramos-Gonz&#x00E1;lez et al., 2016</xref>) (<xref ref-type="fig" rid="F3">Figure 3</xref>). Transiently expressed P15 of CiLV-C of the strain CRD enters into the cell nucleus likely by passive diffusion and the formation of homodimers and heterodimers with P29 and the viral movement proteins can be detected in the cytosol of agroinfiltrated <italic>Nicotiana benthamiana</italic> plants (<xref ref-type="bibr" rid="B63">Leastro et al., 2018</xref>). When expressed from a viral vector, the infected <italic>N. benthamiana</italic> plants show stunted growth and enhanced necrosis in their younger leaves suggesting pathophysiological disorders caused by its ectopic expression. A possible role as an RNA silencing suppressor was also suggested for P15 of CiLV-C strain CRD (<xref ref-type="bibr" rid="B62">Leastro et al., 2020</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Phylogenetic analysis of P15 proteins and intra-species variability of ORF <italic>p15</italic> from cileviruses. Nucleotide sequence alignments were performed using the MAFFT algorithm (<xref ref-type="bibr" rid="B48">Katoh and Standley, 2013</xref>). The best-fit nucleotide substitution model was estimated using MEGA X v. 10.1.8 (<xref ref-type="bibr" rid="B60">Kumar et al., 2018</xref>). Model HKY + G (<xref ref-type="bibr" rid="B44">Hasegawa et al., 1985</xref>), with the lowest Bayesian information criterion (BIC), was used for the estimation of the phylogenetic relationships. The phylogenetic tree was inferred using the Maximum Likelihood method implemented in the Galaxy platform (<xref ref-type="bibr" rid="B1">Afgan et al., 2018</xref>). The reliability of the inferred evolutionary relationships was assessed by 1,000 bootstrapped replications. The tree was edited and visualized using Interactive Tree Of Life (iTOL) v4 (<xref ref-type="bibr" rid="B64">Letunic and Bork, 2019</xref>). ORF <italic>p15</italic> sequences from 47, 4, and 7 isolates of CiLV-C, CiLV-C2, and PfGSV, respectively, were retrieved from the GenBank database accessed on 17th March 2020 (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 1</xref>). Intraspecific genetic distance (<italic>D</italic>) and Non-synonymous (dN) and synonymous (dS) rates of substitutions (&#x03C9;) were calculated in MEGA X, and nucleotide diversity (&#x03C0;) and the confidence intervals for &#x03C0; values were estimated using DnaSP software v6.12.03 (<xref ref-type="bibr" rid="B95">Rozas et al., 2017</xref>). Photo of the CiLV-C2-infected orange leaf is courtesy of Dr. Guillermo Adolfo Leon Martinez, AGROSAVIA, 2021.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-12-771983-g002.tif"/>
</fig>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Consensus sequence obtained from a multiple sequence alignment (MSA) of the cysteine-rich domain of the P15 protein family. MSA was built using ClustalO (<xref ref-type="bibr" rid="B100">Sievers and Higgins, 2014</xref>). The alignment was manually curated by analyzing gaps, conserved amino acid positions, and the predicted secondary structure. The non-aligning segments at the N-terminus of P15 proteins, presumably representing non-core regions, were removed. Conserved cysteine residues are highlighted by gray background and boldface letters. Secondary structure elements, as predicted by I-TASSER, are depicted on top. Helix and &#x03B2;-strand conformations are indicated by h and s, respectively. A conserved sequential motif {[FY]-L-x(3)-[FL]-H-x-[LIV]-S-C-x-C-x(2)-C-x-G-x-C} is displayed below the sequence alignment. Strictly conserved residues are in boldface and putative functional residues specific from each group are in magenta, blue, and green colors. The putative disulfide bridge between the cysteine residues (positions 85 and 92 in CiLV-C) is represented by a blue bracket. Asterisk and point symbols beneath the alignment indicate strictly conserved residues and conservative substitutions, respectively.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-12-771983-g003.tif"/>
</fig>
<p>Analyses conducted in this work indicated that despite the low number of isolates available for CiLV-C2 and PfGSV (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 1</xref>), intra-species variability of their <italic>p15</italic> is almost 10-fold higher than among the CiLV-C isolates. Yet, as well as for CiLV-C, ORFs <italic>p15</italic> from CiLV-C2 and PfGSV show signatures of purifying selection (&#x03C9; &#x003C; 1). P15 proteins of these two viruses are phylogenetically closer to each other and more distant from that in CiLV-C (<xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<p>Pairwise comparisons of the P15 amino acid (aa) sequences reveal very low identity values, approximately 14 or 20%, between the proteins from CiLV-C and PfGSV or CiLV-C2, respectively. Higher values, still moderate, approximately 55% aa sequence identity, are detected in the comparisons between P15 from CiLV-C2 and PfGSV (<xref ref-type="bibr" rid="B89">Ramos-Gonz&#x00E1;lez et al., 2016</xref>, <xref ref-type="bibr" rid="B90">2020</xref>). Remarkably, the aa pairwise sequence identity values among P15 of CiLV-C and those from CiLV-C2 and PfGSV meaningfully deviate from those commonly observed within the cilevirus proteomes. The identity values between the P15 of PfGSV and CiLV-C are less than half of those shown by the next less conserved proteins among cileviruses, i.e., P29 and P61 (31&#x2013;32%) (<xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>).</p>
</sec>
<sec id="S3">
<title>Open Reading Frame <italic>p15</italic> Displays Synonymous Codon Usage Bias That Might Reflect Adaption to Host-Specific Codon Usage Patterns</title>
<p>Codon usage bias (CUB) refers to differences in the frequency of occurrence of synonymous codons in coding sequences (<xref ref-type="bibr" rid="B9">Behura and Severson, 2013</xref>). In viruses, CUB reflects changes in gene expression that occur as a consequence of, for instance, the interplay and co-evolution with their hosts, which may increase the viral fitness (<xref ref-type="bibr" rid="B12">Biswas et al., 2019</xref>; <xref ref-type="bibr" rid="B45">He et al., 2019</xref>; <xref ref-type="bibr" rid="B50">Khandia et al., 2019</xref>). Several indices, such as the Effective Number of codons (ENc), the Relative Synonymous Codon Usage (RSCU), COdon Usage Similarity Index (COUSIN), and Relative Codon Deoptimization Index (RCDI) are used to quantify the CUB (<xref ref-type="bibr" rid="B98">Sharp and Li, 1987</xref>; <xref ref-type="bibr" rid="B109">Wright, 1990</xref>; <xref ref-type="bibr" rid="B13">Bourret et al., 2019</xref>). These indices enable the comparison of genes within a genome and across different genotypes and species revealing elements about their evolution.</p>
<p>Effective Number of codons values range from 20 to 61, with a value of 20 indicating extreme bias, whereas, inversely, 61 indicates non-biased codon usage (<xref ref-type="bibr" rid="B109">Wright, 1990</xref>). Meanwhile, an RSCU value higher or lower than 1 denotes that the codon has a positive or negative CUB, respectively, and those with an RSCU value equal to the unit are randomly chosen. In this study, ENc and RSCU values were calculated using the web servers COUSIN<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> and CAIcal<sup><xref ref-type="fn" rid="footnote2">2</xref></sup>, respectively. Analyses of both ENc and RSCU of the cilevirus ORFs revealed codon usage bias. Even though the means of the ENc values of ORFs both at virus and genus levels were generally higher than 50, some viral isolates and ORFs, e.g., PfGSV_Snp1 and ORF <italic>p15</italic>, had mean values lower than 50 which are considered an indicator of skewed codon usage. Within this group, the ORF <italic>p15</italic> of CiLV-C2_Hw displayed a distinctly low ENc value, &#x2248;35 (<xref ref-type="fig" rid="F4">Figure 4A</xref> and <xref ref-type="supplementary-material" rid="DS2">Supplementary Table 2</xref>), which is generally accepted as a mark of genes with significant codon usage bias (<xref ref-type="bibr" rid="B19">Comeron and Aguad&#x00E9;, 1998</xref>; <xref ref-type="bibr" rid="B14">Butt et al., 2016</xref>). The values of RSCU were computed for every codon of each ORF by the respective genomes (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 3</xref>). When the RSCU patterns were hierarchically clustered, most of the ORFs were grouped in a large branch of the generated dendrogram. ORFs <italic>p15</italic>, however, were generally distributed in two small branches shared also with the ORFs <italic>p32</italic> and <italic>p24</italic> of a few isolates (<xref ref-type="fig" rid="F4">Figure 4B</xref>). Differently, the ORF <italic>p15</italic> of CiLV-C2_Hw was clustered together with the ORFs <italic>RdRp</italic> and <italic>p61</italic> of that isolate, suggesting that they could have undergone a comparable mutational selection.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Codon usage bias of cilevirus ORFs. <bold>(A)</bold> Boxplot of Effective Number of codons (ENc). ENc values were calculated using COUSIN webserver (<ext-link ext-link-type="uri" xlink:href="http://cousin.ird.fr/index.php">http://cousin.ird.fr/index.php</ext-link>) and plotted in Rstudio (<xref ref-type="bibr" rid="B96">Rstudio_team, 2021</xref>). <bold>(B)</bold> RSCU profiles across ORF of seven different isolates of three species of cileviruses. RSCU values were obtained using the CAIcal webserver (<ext-link ext-link-type="uri" xlink:href="https://ppuigbo.me/programs/CAIcal/">https://ppuigbo.me/programs/CAIcal/</ext-link>). The heat map was drawn in Rstudio using the &#x201C;heatmapply&#x201D; function (<xref ref-type="bibr" rid="B37">Galili et al., 2018</xref>). RSCU values were transformed using the &#x201C;percentize&#x201D; method and dist_method = &#x201C;Euclidean&#x201D; and hclust_method = &#x201C;complete&#x201D; were selected. Darker purple represents higher RSCU values.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-12-771983-g004.tif"/>
</fig>
<p>To get further insight into the codon preferences of <italic>p15</italic>, we assessed and compared the CUB of each ORF in each cilevirus strain using COUSIN<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> and the ratio RCDI/eRCDI<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> (RCDI/expected RCDI) (<xref ref-type="bibr" rid="B85">Puigb&#x00F2; et al., 2010</xref>). It should be noted that while ENc and RSCU values exclusively rely on the coding sequence of the analyzed gene, COUSIN and the ratio RCDI/eRCDI assess the similarity of the codon usage patterns among the coding sequences of at least two genes. In this study, the COUSIN score (<xref ref-type="bibr" rid="B13">Bourret et al., 2019</xref>) estimates the codon usage preferences of a viral ORF compared with those of a reference, the viral host, normalized over a null hypothesis of equal usage of synonymous codons. Sequences with scores &#x2265; 1 display adaptation to the host cellular machinery and higher values might reflect an increasing level of expression. RCDI reflects the similarity of the codon usage between a given coding sequence and a reference genome (<xref ref-type="bibr" rid="B77">Mueller et al., 2006</xref>). A lower RCDI value shows the best rate of viral gene translation in the host as well as being indicative of the possible co-evolution of virus and host genomes. The eRCDI value is the RCDI corresponding to a random sequence generated with similar G + C content and amino acid composition to the analyzed sequence and provides a threshold that allows for statistical analysis of the calculated RCDI values. A lower ratio RCDI/eRCDI means a better viral ORF adaptation to its host.</p>
<p>The values of COUSIN and the ratio RCDI/eRCDI were calculated using the set of host plants whose codon usage data could be directly retrieved from, or calculated using available genomic information in, public databases. Codon usage frequencies of some natural and experimental hosts of cileviruses, i.e., citrus (<italic>Citrus</italic> &#x00D7; <italic>sinensis</italic>), passion fruit (<italic>Passiflora</italic> spp.), orchid (<italic>Oncidium</italic> spp.) (<xref ref-type="bibr" rid="B52">Kitajima et al., 2010</xref>), <italic>Arabidopsis thaliana</italic> (<xref ref-type="bibr" rid="B4">Arena et al., 2017</xref>), and <italic>Phaseolus vulgaris</italic> (<xref ref-type="bibr" rid="B39">Garita et al., 2013</xref>) were retrieved from Codon Usage Database<sup><xref ref-type="fn" rid="footnote3">3</xref></sup> or calculated using countcodon v4<sup><xref ref-type="fn" rid="footnote4">4</xref></sup> (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 4</xref>). Coding sequence information of <italic>Passiflora edulis</italic>, approximately 300 ORFs, was retrieved from the CoGe platform<sup><xref ref-type="fn" rid="footnote5">5</xref></sup>. Orchids were included as cilevirus natural hosts since the infection by PfGSV of plants from two orchids species collected in Brazil was molecularly confirmed (unpublished results). For the sake of the comparisons, the analyses were performed considering both the sum of natural and experimental host plants and two subsets of plants, each comprising natural or experimental hosts. Altogether, the analysis included plants belonging to five families, including perennial woody (citrus) and perennial herbaceous (orchids) plants, short-lived evergreen perennial vine (passion fruit), and herbaceous annual (common beans and Arabidopsis) plants. Values of COUSIN and the ratio RCDI/eRCDI were visualized and compared using hierarchical clustering. The accuracy of the assessment could be potentially increased as new annotated genomes of cilevirus host plants are available.</p>
<p>Regardless of the viral species, the analysis by ORFs using the complete set of host plants revealed a &#x003E;1 bias in the mean COUSIN values of <italic>p61</italic> (1.28 &#x00B1; 0.44), <italic>RdRp</italic> (1.19 &#x00B1; 0.36), <italic>p32</italic> (1.15 &#x00B1; 0.37), and <italic>p24</italic> (1.09 &#x00B1; 0.38), whereas mean values below or near to 1 were observed for the ORFs <italic>p15</italic> (0.89 &#x00B1; 0.53) and <italic>p29</italic> (0.93 &#x00B1; 0.35) (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 5A</xref>). COUSIN values of <italic>p15</italic>, however, showed the highest variability with a coefficient of variation surpassing 50%. Some CiLV-C2 ORFs reached COUSIN values higher than 1.5, whereas the minimum values, e.g., &#x2212;0.15, corresponded to PfGSV isolates. The same trend of COUSIN values was obtained when calculations were performed with the two subgroups of plants conveniently separated into natural and experimental hosts (<xref ref-type="supplementary-material" rid="DS2">Supplementary Tables 5B,C</xref>). In a global analysis across the genus, the COUSIN profile of <italic>p15</italic> resembled that showed by <italic>p32</italic>, which encodes the viral movement protein (<xref ref-type="fig" rid="F5">Figure 5A</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Codon adaption of cilevirus ORFs. Heatmaps representing the values of COUSIN <bold>(A)</bold> and the RCDI/eRCDI <bold>(B)</bold>, respectively, of every ORF across cilevirus isolates. For the sake of comparative analyses, COUSIN and the RCDI/eRCDI values have been obtained simulating the interaction of all viruses with the same hypothetical host range. Codon usage database of host plants and values of ENc, COUSIN, and RCDI are shown in <xref ref-type="supplementary-material" rid="DS2">Supplementary Tables 3</xref>, <xref ref-type="supplementary-material" rid="DS2">4</xref>. The heat map was drawn in Rstudio (<xref ref-type="bibr" rid="B96">Rstudio_team, 2021</xref>) using the &#x201C;heatmapply&#x201D; function (<xref ref-type="bibr" rid="B37">Galili et al., 2018</xref>). Both COUSIN and RCDI/eRCDI values were transformed using the &#x201C;percentize&#x201D; method and dist_method = &#x201C;Euclidean&#x201D; and hclust_method = &#x201C;complete&#x201D; were selected. Darker red and light green to white represent the best values in each heat map, respectively. At, <italic>Arabidopsis thaliana</italic>; Cs, <italic>Citrus</italic> &#x00D7; <italic>sinensis</italic>; Pe, <italic>Passiflora edulis</italic>; On, <italic>Oncidium</italic> sp.; and Phv, <italic>Phaseolus vulgaris</italic>.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-12-771983-g005.tif"/>
</fig>
<p>The study of the ratio RCDI/eRCDI of the cilevirus ORFs indicated that <italic>RdRp</italic> and <italic>p61</italic> bear the lowest mean values, followed in ascending order by <italic>p24</italic>, <italic>p29</italic>, and <italic>p32</italic>, and finally <italic>p15</italic> (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 6A</xref>). In the particular comparisons of <italic>p15</italic>, values from CiLV-C2 were collectively the lowest ones reached. In the evaluations using <italic>P. vulgaris</italic>, the ratios RCDI/eRCDI of <italic>p15</italic> genes from CiLV-C2 isolates were below the threshold of 0.9, similar to those held by other viral ORFs, e.g., <italic>p29</italic>, <italic>p24</italic>, and <italic>p32.</italic> The analyses suggest that the CUB of all those ORFs might be the outcome of equivalent processes of strain/host adaptation.</p>
<p>Regardless of the subgroup of studied plants, i.e., the subsets of natural, experimental, or the array containing all host plants, the mean values of the ratio RCDI/eRCDI of <italic>p15</italic> were higher than those shown by the remaining viral ORFs (<xref ref-type="supplementary-material" rid="DS2">Supplementary Tables 6B,C</xref>). At the genus level, when analyzed through hierarchical clustering, the RCDI/eRCDI profile of <italic>p15</italic> is unique among those shown by any other ORF of cileviruses (<xref ref-type="fig" rid="F5">Figure 5B</xref>).</p>
</sec>
<sec id="S4">
<title>P15 Proteins Display a Unique Conserved Sequence Motif</title>
<p>Even though a low global sequence identity of less than 25% was observed for P15 proteins, the consensus sequence obtained from a multiple sequence alignment of the cysteine-rich domain reveals a stretch of highly conserved residues organized in the sequence motif {[FY]-L-x(3)-[FL]-H-x-[LIV]-S-C-x-C-x(2)-C-x-G-x-C} (<xref ref-type="fig" rid="F3">Figure 3</xref> and <xref ref-type="supplementary-material" rid="DS2">Supplementary Table 7</xref>). Besides four full-conserved residues of cysteines and one of histidine, the consensus sequence motif also contains three other invariable amino acids: leucine, serine, and glycine. A ScanProsite<sup><xref ref-type="fn" rid="footnote6">6</xref></sup> (<xref ref-type="bibr" rid="B23">de Castro et al., 2006</xref>) search using the conserved sequence motif in the UniProtKB/Swiss-Prot<sup><xref ref-type="fn" rid="footnote7">7</xref></sup> (release 07-Apr-2021: 564,638 entries) and UniProtKB/TrEMBL<sup><xref ref-type="fn" rid="footnote8">8</xref></sup> (release 07-Apr-2021: 214,406,399 entries) databases retrieved 30 hits spanned in 30 sequences, all of them exclusively corresponding to the P15 proteins from CiLV-C, CiLV-C2, and PfGSV. Additionally, no hits were retrieved from a similar search but against a randomized UniProtKB/Swiss-Prot database, i.e., reversed option. According to ScanProsite, the conserved sequence motif has an approximate number of expected random matches of 6.97 &#x00D7; 10<sup>&#x2013;08</sup> in 100,000 sequences (50,000,000 residues). Altogether, these results suggested a low probability of P15 matches by chance and, thus, highlighted the singular occurrence of the identified cysteine-rich domain in the P15 protein family.</p>
</sec>
<sec id="S5">
<title>Three-Dimensional Structure Prediction Suggests a Helical Bundle-Like Scaffold for the P15 Proteins</title>
<p>The existence of a conserved sequence motif among the P15 proteins dropped a hint that all of them might exhibit a similar fold. Interestingly, this motif overlaps a predicted &#x03B1;-turn-&#x03B1; motif in the secondary structure of P15 in CiLV-C and CiLV-C2, whereas in PfGSV the second &#x03B1;-helix is missing (<xref ref-type="fig" rid="F3">Figure 3</xref>). The predictions by evaluation of sequence-structure fitness (<xref ref-type="bibr" rid="B82">Ouzounis et al., 1993</xref>) with proteins of known 3D structures of a representative group of ten P15 proteins from the three species of cileviruses were carried out by the iterative threading assembly refinement (I-TASSER) server<sup><xref ref-type="fn" rid="footnote9">9</xref></sup> (<xref ref-type="bibr" rid="B110">Yang and Zhang, 2015</xref>). For the computational structural analyses, experimentally determined 3D structures were downloaded from the RCSB PDB database<sup><xref ref-type="fn" rid="footnote10">10</xref></sup> and the protein 3D structure alignments were done using the PDBeFold server<sup><xref ref-type="fn" rid="footnote11">11</xref></sup> (<xref ref-type="bibr" rid="B58">Krissinel and Henrick, 2004</xref>). The quality of protein structure templates and predicted full-length models were assessed using the <italic>C</italic>-score and TM-score implemented in I-TASSER<sup><xref ref-type="fn" rid="footnote9">9</xref></sup> (<xref ref-type="bibr" rid="B112">Zhang and Skolnick, 2004</xref>). <italic>C</italic>-score is typically in the range [&#x2212;5,2], where a <italic>C</italic>-score of higher value signifies a model with high confidence, and vice-versa. TM-score values range from [0,1], where 1 indicates a perfect match between two structures. A TM-score below 0.17 corresponds to randomly chosen unrelated proteins whereas structures with a score higher than 0.5 assume generally the same fold.</p>
<p>The best generated P15 3D model corresponded to that for CiLV-C_Ar02 (<italic>C</italic>-score &#x2212;2.82, TM-score 0.39 &#x00B1; 0.13, <xref ref-type="supplementary-material" rid="DS2">Supplementary Table 8</xref>). The model from this protein suggested the presence of a four-helical bundle-like scaffold with structural similarity to the 12 &#x03B1;-helices bundled of protein ORF49 from the human gammaherpesvirus 8 (ORF49_HHV-8, PDB ID: 5ipx). The multiple structural alignments of 3D models of P15 proteins with the crystal structure of ORF49_ HHV-8 by using PDBeFOLD<sup><xref ref-type="fn" rid="footnote11">11</xref></sup> revealed the &#x201C;plasticity&#x201D; of the helix-bundle protein folds (<xref ref-type="fig" rid="F6">Figure 6A</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>3D structural similarities between P15 and <bold>(A)</bold> the protein ORF49 from the human gammaherpesvirus 8 (ORF49_HHV-8, PDB ID: 5ipx) and <bold>(B)</bold> the rhabdoviruses rabies virus (PDB ID: 1vyi) and vesicular stomatitis Indiana virus (PDB ID: 2k47), and the filovirus Zaire ebolavirus (PDB ID: 3fke). Multiple structural alignments were done with the PDBeFold method. The conserved sequential motif {[FY]-L-x(3)-[FL]-H-x [LIV]-S-C-x-C-x(2)-C-x-G-x-C} is displayed in red. Structure figures were prepared using PyMOL (<xref ref-type="bibr" rid="B25">Delano, 2002</xref>).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-12-771983-g006.tif"/>
</fig>
<p>PDBeFold provides average root mean square deviation (RMSD) and <italic>Q</italic>-score values to estimate the quality of matching structures. RMSD distance between corresponding residues is calculated after an optimal rotation of one structure to another. An RMSD of less than about 3&#x2013;4 &#x00C5; would generally be considered very close, but there is no absolute rule. The <italic>Q</italic>-score assesses the number of residues in the matched secondary structural elements and their positions in space. <italic>Q</italic>-scores reach the value of 1 for identical structures and decline to zero with the lack of similarity, thus, the higher <italic>Q</italic>, the better.</p>
<p>Among the P15 from CiLV-C isolates, an overall RMSD = 2.47 &#x00C5; and <italic>Q</italic>-score = 0.31 over 94 aligned residues were obtained, whereas in those from CiLV-C2 the values were RMSD = 3.83 &#x00C5; and <italic>Q</italic>-score = 0.01 over 22 aligned residues. In P15 from the PfGSV isolates, the values were an overall RMSD = 4.1 &#x00C5; and <italic>Q</italic>-score = 0.02 over 30 aligned residues. Detailed analysis of the 3D model for the CiLV-C_Ar02 P15 indicated that cysteine residues at positions 85 (Cys85) and 92 (Cys92) could form a disulfide bridge and consequently stabilize the four-helical bundle. Remarkably, this two-cysteine-residues pattern is also present in the conserved sequence motif across the P15 proteins. However, since the spatial localization of the two cysteines is not close enough in some 3D models, the formation of a disulfide bridge is not completely evident. Such discrepancies could ensue from the analysis of low-resolution 3D models.</p>
<p>The resolved crystal structures of ORF49_ HHV-8 (PDB ID: 5ipx) and ORF49 from the murine herpesvirus 68 (ORF49_MHV68, PDB ID: 6a4v) (<xref ref-type="bibr" rid="B46">Hew et al., 2017</xref>; <xref ref-type="bibr" rid="B17">Chung et al., 2018</xref>) show a fold consisting of 12 &#x03B1;-helices bundled into two pseudo-domains (left top and bottom panels in <xref ref-type="fig" rid="F6">Figure 6A</xref>). By mapping the identical and similar amino acids onto the helical scaffold, the potentially functional residues in P15 from CiLV-C were predicted. Conserved leucine and valine residues were localized in hydrophobic regions along the interacting surfaces of the &#x03B1;-helices and have the potential to play a structural role. Polar and charged residues were localized on the surface and could be associated with protein fold stability or protein interactions (<xref ref-type="table" rid="T1">Table 1</xref>).</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Summary of the predicted functional residues for the P15 family of proteins.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Amino acid<sup><xref ref-type="table-fn" rid="t1fn1">a</xref></sup></td>
<td valign="top" align="center">ORF49<sup><xref ref-type="table-fn" rid="t1fn2">b</xref></sup></td>
<td valign="top" align="center">3D<sup><xref ref-type="table-fn" rid="t1fn3">c</xref></sup></td>
<td valign="top" align="left">Description</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Cys 18</td>
<td valign="top" align="center">His 14</td>
<td valign="top" align="center">&#x03B1;1</td>
<td valign="top" align="left">Polar, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Cys 20</td>
<td valign="top" align="center">Phe 16</td>
<td valign="top" align="center">&#x03B1;1</td>
<td valign="top" align="left">Polar, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Asp 26</td>
<td valign="top" align="center">Ser 22</td>
<td valign="top" align="center">Loop</td>
<td valign="top" align="left">Charged, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Leu 34</td>
<td valign="top" align="center">Leu 30</td>
<td valign="top" align="center">&#x03B1;2</td>
<td valign="top" align="left">Hydrophobic, buried, helix packing</td>
</tr>
<tr>
<td valign="top" align="left">Val 35</td>
<td valign="top" align="center">Asp 31</td>
<td valign="top" align="center">&#x03B1;2</td>
<td valign="top" align="left">Hydrophobic, buried, helix packing</td>
</tr>
<tr>
<td valign="top" align="left">Leu 50</td>
<td valign="top" align="center">Ala 46</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Hydrophobic, buried, helix packing</td>
</tr>
<tr>
<td valign="top" align="left">Leu 74</td>
<td valign="top" align="center">Pro 70</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Hydrophobic, buried, helix packing</td>
</tr>
<tr>
<td valign="top" align="left">Tyr 76</td>
<td valign="top" align="center">Leu 72</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">His 79</td>
<td valign="top" align="center">Ala 75</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Charged, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Ser 82</td>
<td valign="top" align="center">Ala 78</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Cys 83</td>
<td valign="top" align="center">Asn 79</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Asn 84</td>
<td valign="top" align="center">Leu 80</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Cys 85</td>
<td valign="top" align="center">Ala 81</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, disulfide bridge, protein fold stability</td>
</tr>
<tr>
<td valign="top" align="left">Cys 88</td>
<td valign="top" align="center">Leu 84</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, protein interaction</td>
</tr>
<tr>
<td valign="top" align="left">Gly 90</td>
<td valign="top" align="center">Gln 86</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, protein fold stability</td>
</tr>
<tr>
<td valign="top" align="left">Cys 92</td>
<td valign="top" align="center">Tyr 88</td>
<td valign="top" align="center">&#x03B1;3</td>
<td valign="top" align="left">Polar, exposed, disulfide bridge, protein fold stability</td>
</tr>
<tr>
<td valign="top" align="left">Gly 98</td>
<td valign="top" align="center">Lys 94</td>
<td valign="top" align="center">&#x03B1;4</td>
<td valign="top" align="left">Polar, exposed, protein fold stability</td>
</tr>
<tr>
<td valign="top" align="left">Gly 114</td>
<td valign="top" align="center">Phe 113</td>
<td valign="top" align="center">&#x03B1;5</td>
<td valign="top" align="left">Polar, exposed, protein fold stability</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="t1fn1"><p><italic><sup><italic>a</italic></sup>Amino acid position in the CiLV-C_Ar02_ALF45382.1 sequence.</italic></p></fn>
<fn id="t1fn2"><p><italic><sup><italic>b</italic></sup>Equivalent position in the ORF49 sequence according to the structure superposition between ORF49_HHV-8 (PDB ID: 5ipx) and CiLV-C_Ar02_ALF45382.1 (I-TASSER 3D-model).</italic></p></fn>
<fn id="t1fn3"><p><italic><sup><italic>c</italic></sup>Nomenclature of alpha-helices is based on the 3D structure of ORF49_ HHV-8. The structure superposition between ORF49_HHV-8 and CiLV-C 3D-models by PDBeFold is provided in <xref ref-type="supplementary-material" rid="DS1">Supplementary File 1</xref>.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<p>&#x03B1;-helical folds have also been detected in the C-terminal domain of the phosphoprotein (P<sub><italic>CTD</italic></sub>) of mononegaviruses (<xref ref-type="bibr" rid="B7">Assenberg et al., 2010</xref>; <xref ref-type="bibr" rid="B47">Ivanov et al., 2010</xref>; <xref ref-type="bibr" rid="B79">Ng et al., 2020</xref>). It is of note that cileviruses and mononegaviruses of the genus <italic>Dichorhavirus</italic>, family <italic>Rhabdoviridae</italic> (<xref ref-type="bibr" rid="B28">Dietzgen et al., 2018</xref>), share several commonalities, including hosts and vectors, that have led to postulate the likely existence of evolutionary convergence among viruses of these two genera (<xref ref-type="bibr" rid="B35">Freitas-Ast&#x00FA;a et al., 2018</xref>). The peculiar architecture observed in P<sub><italic>CTD</italic></sub> and P15 prompted us to compare the 3D structure of these proteins.</p>
<p>The P<sub><italic>CTD</italic></sub> of the cytorhabdovirus lettuce necrotic yellows virus (LNYV) comprises five &#x03B1;-helices that have an overall topology that although with different structural features, seems to be conserved with those in other rhabdoviruses, filoviruses, and paramyxoviruses (<xref ref-type="bibr" rid="B69">Martinez et al., 2013</xref>). P<sub><italic>CTD</italic></sub> crystal structure from plant-infecting rhabdoviruses other than LNYV has not been described. Hence, the predicted 3D models of P15 were aligned with the P<sub><italic>CDT</italic></sub> of mononegaviruses, regardless of their range hosts. P15 of CiLV-C_Ar02 and CiLV-C2_Hw are structurally related to P<sub><italic>CTD</italic></sub> of mokola virus (PDB ID: 2wzl) (<xref ref-type="bibr" rid="B26">Delmas et al., 2010</xref>) and rabies virus (PDB ID: 1vyi) (<xref ref-type="bibr" rid="B71">Mavrakis et al., 2004</xref>) with overall RMSD = 3.1 &#x00C5; and <italic>Q</italic>-score = 0.06 over 42 aligned residues, whereas those of CiLV-C2_Co2 and CiLV-C2_Fla are structurally related to vesicular stomatitis Indiana virus (PDB ID: 2k47) with overall RMSD = 4.0 &#x00C5; and <italic>Q</italic>-score = 0.04 over 32 aligned residues (<xref ref-type="fig" rid="F6">Figure 6B</xref>). In the case of PfGSV_Cmp1 and PfGSV_BJL1, their modeled P15 are structurally related to Zaire ebolavirus (PDB ID: 3fke) with an overall RMSD = 3.1 &#x00C5; and <italic>Q</italic>-score = 0.04 over 36 aligned residues. Based on structural similarities, it could be hypothesized that P15 proteins can perform functions related to those accomplished by ORF49 and P<sub><italic>CTD</italic></sub> proteins. ORF49 interacts and upregulates the transcriptional activity of a protein known as RTA (replication and transcription activator) in HHV-8, which drives the expression of all lytic genes (<xref ref-type="bibr" rid="B40">Gonz&#x00E1;lez et al., 2006</xref>). Present in a large number of viruses, P protein orthologs are involved in a wide range of functions, e.g., acting as chaperons, regulation of viral transcription, RNA binding, and suppressor of RNA silencing in plants (<xref ref-type="bibr" rid="B70">Masters and Banerjee, 1988</xref>; <xref ref-type="bibr" rid="B83">Peluso and Moyer, 1988</xref>; <xref ref-type="bibr" rid="B69">Martinez et al., 2013</xref>; <xref ref-type="bibr" rid="B10">Bejerman et al., 2016</xref>; <xref ref-type="bibr" rid="B68">Mann et al., 2016</xref>; <xref ref-type="bibr" rid="B79">Ng et al., 2020</xref>).</p>
</sec>
<sec id="S6">
<title>Zn-Finger-Like Proteins in Arthropod-Infecting Viruses Closely Related to Kitaviruses Are Not Related With the Cilevirus P15 Proteins</title>
<p>Based on the predicted isoelectric point and the presence of a cysteine-rich pattern compatible with a potential Zn-finger domain, an acidic cilevirus P15-like protein was previously recognized in the genome of Wuhan insect virus 8 (WhIV-8) (<xref ref-type="bibr" rid="B108">Vinokurov and Koloniuk, 2019</xref>). The protein is encoded by the putative ORF <italic>p11</italic> (<xref ref-type="fig" rid="F1">Figure 1</xref>) that was neither identified during the virus genome description (<xref ref-type="bibr" rid="B99">Shi et al., 2016</xref>) nor annotated at its corresponding GenBank locus KX883812.</p>
<p>Our search for supposed <italic>p15</italic> homologs based on the presence of a zinc-finger-like motif was expanded to nege/kita-like viruses and other related arthropod-infecting viruses. It also included previously ignored or poorly characterized putative ORFs. In members of the proposed subgroup Centivirus of the nege/kita-like viruses, a homolog of WhIV-8 <italic>p11</italic> was identified in the uncharacterized <italic>Glycine dolichocarpa</italic> virus (GDV, Transcriptomic shotgun assembly TSA GGIW01009300) (<xref ref-type="bibr" rid="B55">Kondo et al., 2020</xref>). Proteins encoded by the ORF <italic>p11</italic> in GDV and WhIV-8 have 41.4% amino acid sequence identity. The region comprising the cysteine-rich pattern in these two proteins shows a marginal identity (<italic>E</italic>-value &#x003C; 1 according to MOTIF Search<sup><xref ref-type="fn" rid="footnote12">12</xref></sup>) with the Pfam motif C4-type zinc-finger of DNA polymerase delta (PF14260). Similarly, the ORF p<italic>19</italic> in the putative centivirus Wuhan house centipede virus 1 (WHCV-1) and ORF <italic>p9</italic> of Hubei virga-like virus 9 (HVLV-9) encode proteins with signatures of a putative zinc-finger domain (<xref ref-type="bibr" rid="B108">Vinokurov and Koloniuk, 2019</xref>), which in the case of HVLV-9 also displays a subtle identity [<italic>E</italic>-value &#x003C; 1 according to MOTIF Search<sup><xref ref-type="fn" rid="footnote11">11</xref></sup>] with the Pfam motif SLBP (stem-loop binding protein) of the RNA binding superfamily (PF15247). The N-terminal of the ORF3-encoded protein of <italic>Pyrrhocoris apterus</italic> virus slightly resembles the canonical Pfam DSRM (double-stranded RNA binding motif, PF00035) architecture, whereas, although with low sequence similarity, the C-terminal end of this protein displays a conservative basic RNA-interacting motif. However, neither nucleotide sequences of these ORFs nor their predicted polypeptides show significant sequence identity to any element across cileviruses.</p>
<p>To get further insight into the putative relationships between P15 and P11 proteins of WhIV-8 and GDV, their 3D structures were predicted using the I-TASSER server<sup><xref ref-type="fn" rid="footnote9">9</xref></sup>. Although P11 proteins show &#x03B1; helix scaffolds, the arrangement of helixes is different from that observed in P15. Multiple structural alignments of 3D models by PDBeFold<sup><xref ref-type="fn" rid="footnote11">11</xref></sup> indicated that only one &#x03B1;-helix in P11 proteins shows structural similarity to &#x03B1;-helices in the P15 3D models (overall RMSD = 2.68 &#x00C5; and <italic>Q</italic>-score = 0.01 over 17 aligned residues). The lack of overall sequence or 3D structure similarity between both groups of proteins drastically reduces the probability of a common origin.</p>
</sec>
<sec id="S7">
<title>The 5&#x2032;-End of the RNA2 in Cileviruses, Besides the ORF <italic>p15</italic>, Harbors ORFs Encoding Small Hydrophobic Proteins</title>
<p>Downstream the ORF <italic>p15</italic>, the genomic stretch extended up to <italic>p61</italic> in the 5&#x2032;-end of the RNA2 of cileviruses is known as the intergenic region (IR). However, this is not a proper description because rather than lacking ORFs, the IR lacks ORFs greater than 300 nucleotides. <italic>In silico</italic> tools have traditionally considered a minimum of 100 amino acids for an ORF to be annotated as a putative coding protein sequence, which has led to the inaccurate annotation of certain genomes. However, despite the detection of those small ORFs, the IRs of cileviruses show a very low density of predicted coding sequences, only 20&#x2013;30%, with large non-coding stretches (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<p>In congruence with ORF <italic>p15</italic>, the interspecies comparisons of the IRs of cileviruses show low levels of nucleotide sequence identity (&#x2264;45%) (<xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>). With variable length, the IR is larger in CiLV-C isolates (1,095 &#x00B1; 56 nucleotides) and shorter in PfGSV isolates (850 &#x00B1; 21 nucleotides). The region harbors small ORFs encoding hydrophobic proteins with predicted TM domains, but which are unrelated to the SP24 motif (<xref ref-type="bibr" rid="B73">Melzer et al., 2013</xref>; <xref ref-type="bibr" rid="B94">Roy et al., 2013</xref>; <xref ref-type="bibr" rid="B89">Ramos-Gonz&#x00E1;lez et al., 2016</xref>, <xref ref-type="bibr" rid="B90">2020</xref>). ORF <italic>p13</italic> in PfGSV_Snp1, the largest detected so far (333 nucleotides), presents four regularly distributed cysteine residues and low sequence identity with motifs identified in the cytokine-induced anti-apoptosis inhibitor 1 (CIAPIN1, PF05093), and PGC7/Stella/Dppa3-like domain (PGC7_Stella, PF15549) (<xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>).</p>
<p>Further inspection in the 5&#x2032;-end of the RNA2 in cileviruses, allowed us to detect the ORF <italic>p8</italic> overlapping the ORF <italic>p15</italic> of CiLV-C. ORF <italic>p8</italic> is present in every sequenced virus of the three lineages of CiLV-C (<xref ref-type="bibr" rid="B16">Chabi-Jesus et al., 2021</xref>), but it is absent in CiLV-C2 and PfGSV. Previously overlooked, the ORF <italic>p8</italic> spans 54% of <italic>p15</italic> and has 213 nucleotides in the frame +2 (<xref ref-type="fig" rid="F1">Figure 1</xref>). ORF <italic>p8</italic> is significantly longer than any other ORF expected in all reading frames within the ORF <italic>p15</italic>, which is a fair indicator of its real existence according to an <italic>in silico</italic> method for detecting overlapping genes (<xref ref-type="bibr" rid="B97">Schlub et al., 2018</xref>) performed in this study (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 9</xref>). The presence of the overlapping ORF <italic>p8</italic> in CiLV-C may limit the variability of <italic>p15</italic> and suggests a different evolutionary trajectory in the formation of this genomic region among cileviruses.</p>
<p>The predicted P8 protein seems to be hydrophobic with a TM domain according to MEMSAT-SVM<sup><xref ref-type="fn" rid="footnote13">13</xref></sup> and TMHMM Server v. 2.0<sup><xref ref-type="fn" rid="footnote14">14</xref></sup>. This TM domain resembles the predicted properties of the small P7 proteins encoded downstream the ORF <italic>p15</italic>, in the IR of CiLV-C2 (<xref ref-type="bibr" rid="B94">Roy et al., 2013</xref>), CiLV-C_SJP, and PfGSV (<xref ref-type="fig" rid="F1">Figure 1</xref>). Small hydrophobic proteins by themselves are outside the scope of this work but what is particularly pertinent is their constant presence in the 5&#x2032;-end of the RNA2 of cileviruses, and the association with the small Zn-finger-like or nucleic acid binding-like proteins in some nege/kita-like viruses. The overlapping gene array observed in the pair P15/P8 in CiLV-C was also observed in the pairs P11/P6 and P19/P8 of the centiviruses WhIV-8 and WHCV-1, respectively (<xref ref-type="fig" rid="F1">Figure 1</xref>). Overlapping genes <italic>p6</italic> and <italic>p8</italic> of centiviruses were detected following the same procedure described for the identification of <italic>p8</italic> in CiLV-C (<xref ref-type="supplementary-material" rid="DS2">Supplementary Table 9</xref>). Small hydrophobic proteins have been described in several taxonomic groups of plant RNA viruses where they can be involved in coupling viral replication and cell-to-cell movement showing properties similar to those of reticulons (<xref ref-type="bibr" rid="B102">Solovyev and Morozov, 2017</xref>; <xref ref-type="bibr" rid="B75">Morozov and Solovyev, 2020</xref>; <xref ref-type="bibr" rid="B61">Lazareva et al., 2021</xref>).</p>
</sec>
<sec sec-type="discussion" id="S8">
<title>Discussion</title>
<p>This study was driven by the search for commonalities between the ORFs <italic>p15</italic>, their encoded proteins, and the questioning of whether these polypeptides can be considered orthologs. Finally, these analyses led us to a critical appraisal of the 5&#x2032;-end of the RNA2 of cileviruses under an evolutionary perspective.</p>
<p>Absent in kitavirids beyond the genus <italic>Cilevirus</italic> and with no apparent homology with any sequence available in public databases, ORF <italic>p15</italic> was considered an orphan gene (<xref ref-type="bibr" rid="B3">Arena et al., 2016</xref>). P15 proteins from different species show a remarkably reduced amino-acid sequence identity between them, with values as low as those shown by the global alignment of unrelated proteins (&#x2248;16%) (<xref ref-type="bibr" rid="B53">Konat&#x00E9; et al., 2019</xref>; <xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>). As well as P13 from PfGSV, it has been speculated that P15 proteins are viral auxiliary factors that act as regulators of host physiology during the plant-virus interplay (<xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>). Particularly, the putative role of P15 from CiLV-C of the strain CRD as a viral suppressor of the plant gene silencing antiviral defense was analyzed (<xref ref-type="bibr" rid="B62">Leastro et al., 2020</xref>), but further studies are needed to reach sound conclusions.</p>
<p>Together with the hallmark ORF <italic>RdRp</italic> (<xref ref-type="bibr" rid="B57">Koonin et al., 2006</xref>), the genomes of the typical cileviruses comprise the ORFs <italic>p24, p32</italic>, and the taxonomically restricted <italic>p61.</italic> Their RNA2 molecules also accommodate at their 5&#x2032;-ends the <italic>p15</italic> and the IR, which in addition to <italic>p11</italic> and <italic>p13</italic> in PfGSV strains, harbor some putative small ORFs encoding predicted TM small polypeptides. Interestingly, hibiscus yellow blotch virus (HYBV) (<xref ref-type="bibr" rid="B80">Olmedo-Velarde et al., 2021</xref>), a specimen considered a cilevirus with an unusual genomic organization and phylogenetically intermediate among typical cileviruses and hibiscus green spot virus 2 (HGSV-2), genus <italic>Higrevirus</italic> (<xref ref-type="bibr" rid="B72">Melzer et al., 2012</xref>), poses an additional puzzle on the composition of the RNA2 segments of typical cileviruses and the role of the ORFs in their 5&#x2032;-ends, e.g., <italic>p15</italic>. While the RNA1 in all these viruses encodes two proteins, the RdRp and P29 or P10 in typical cileviruses and HYBV, respectively, the RNA2 of HYBV resembles a truncated version of the RNA2 of typical cileviruses, lacking the 5&#x2032;- end stretch upstream the homologs of the threesome <italic>p61</italic>-<italic>p32</italic>-<italic>p24</italic> (<xref ref-type="fig" rid="F1">Figure 1</xref>). RNA2 of HYBV also includes an additional ORF, homolog to the cilevirus RNA1 <italic>p29</italic>, at its 3&#x2032;-end. The very conserved synteny and composition among the genomes of HYBV and typical cileviruses suggest, under a reductionist approach, the rather dispensable role of the 5&#x2032;-end of the RNA2 of cileviruses. Perhaps, this region only provides proteins and/or unknown regulatory non-coding RNAs that albeit non-essential could expand the viral fitness (<xref ref-type="bibr" rid="B36">Fr&#x00ED;as-Lasserre, 2012</xref>; <xref ref-type="bibr" rid="B74">Miller et al., 2016</xref>). Alternatively, we could speculate, for instance, that the putative functions encoded by this genomic region in typical cileviruses are taken over by other ORFs in HYBV, likely involving a compensatory evolutionary process (<xref ref-type="bibr" rid="B93">Rokyta et al., 2002</xref>; <xref ref-type="bibr" rid="B43">Harcombe et al., 2009</xref>). However, while the lack of function of the 5&#x2032; end of RNA2 in typical cileviruses does not appear to be a rational choice, in the absence of compelling evidence suggesting explicit roles, the analysis of <italic>p15</italic> across the genus, intrinsically, may shed some light on the 5&#x2032;-end of cilevirus RNA2.</p>
<p>In this study, we described that <italic>p15</italic> in each of the three cilevirus species is under purifying selection, suggesting an ongoing action to conserve its nucleotide sequence, most likely precluding the change of amino acid residues and preserving function at directly selected sites (<xref ref-type="bibr" rid="B21">Cvijovi&#x0107; et al., 2018</xref>). Furthermore, we detected that in some viral strains the synonymous codon usage of this ORF is biased showing the same codon patterns adopted by other viral genes and displaying COUSIN and RCDI values that suggest a certain level of adaptation to its plant hosts. We also revealed that although with extremely divergent amino acid sequences, P15 proteins show a conserved sequence motif, and their 3D models have preserved a backbone topology that resembles a helical bundle-like scaffold. Altogether, these two aspects suggest elements of common ancestry and conserved functionality of the P15 proteins (<xref ref-type="bibr" rid="B6">Arendsee et al., 2019</xref>; <xref ref-type="bibr" rid="B79">Ng et al., 2020</xref>; <xref ref-type="bibr" rid="B91">Ravantti et al., 2020</xref>).</p>
<p>The superimposition of the 3D models of P15 and the resolved crystal structures of ORF49_ HHV-8 and ORF49_MHV68 (<xref ref-type="bibr" rid="B46">Hew et al., 2017</xref>; <xref ref-type="bibr" rid="B17">Chung et al., 2018</xref>) and the P<sub><italic>CTD</italic></sub> of negative-stranded RNA viruses of the order <italic>Mononegavirales</italic> (<xref ref-type="bibr" rid="B69">Martinez et al., 2013</xref>) indicated the presence of an &#x03B1;-helical bundle-like scaffold in the cilevirus protein. Bundles of four &#x03B1;-helices are frequently found in proteins with a wide variety of functions (<xref ref-type="bibr" rid="B92">Rhys et al., 2019</xref>). ORF49 shows a fold consisting of 12 helices bundled into two pseudo-domains, with charged patches on its surface likely acting in protein-protein interaction sites. The protein is stabilized by double-stranded oligonucleotides suggesting its interaction and binding with DNA molecules (<xref ref-type="bibr" rid="B46">Hew et al., 2017</xref>). The P<sub><italic>CTD</italic></sub> of negative-stranded RNA viruses of the order <italic>Mononegavirales</italic> (<xref ref-type="bibr" rid="B69">Martinez et al., 2013</xref>) has been shown to regulate viral transcription, RNA binding, and immunomodulatory functions (<xref ref-type="bibr" rid="B79">Ng et al., 2020</xref>). The P protein of cytorhabdoviruses has weak local RNA silencing suppressor activity, strongly suppresses systemic RNA silencing, and the P<sub><italic>CTD</italic></sub> is essential for both local RNA silencing suppression and the interaction with AGO1, AGO2, AGO4, RDR6, and SGS3, proteins of the main core of the gene silencing mechanism (<xref ref-type="bibr" rid="B10">Bejerman et al., 2016</xref>; <xref ref-type="bibr" rid="B68">Mann et al., 2016</xref>). Therefore, the presence of an &#x03B1;-helical bundle-like scaffold in P15 could provide a structural base by mean it could interact with other proteins and likely mediate the plant-virus interplay. P15 from CiLV-C of the strain CRD forms homo- and heterodimers with other viral proteins (<xref ref-type="bibr" rid="B63">Leastro et al., 2018</xref>), and considering the diversity of its interactions, it appears that the protein could act modulating a wide spectrum of viral and cellular pathways, as observed in other viral proteins (<xref ref-type="bibr" rid="B24">Deblasio et al., 2018</xref>; <xref ref-type="bibr" rid="B22">Dao et al., 2020</xref>; <xref ref-type="bibr" rid="B65">Li et al., 2020</xref>).</p>
<sec id="S8.SS1">
<title>The Origin and Evolution of the 5&#x2032;-End of Cileviruses: A Theory</title>
<p>Phylogenetic inferences suggest a common ancestor between kitaviruses and a variety of arthropod-infecting viruses including negeviruses (<xref ref-type="bibr" rid="B106">Vasilakis et al., 2013</xref>; <xref ref-type="bibr" rid="B54">Kondo et al., 2019</xref>; <xref ref-type="bibr" rid="B90">Ramos-Gonz&#x00E1;lez et al., 2020</xref>; <xref ref-type="bibr" rid="B87">Quito-Avila et al., 2021</xref>). Having regards to the presence of the orthologs <italic>RdRp</italic> and <italic>p24</italic> in these viruses (<xref ref-type="bibr" rid="B59">Kuchibhatla et al., 2014</xref>), a synteny-based analysis indicates a parsimonious evolutionary scenario in which cileviruses, and kitaviruses in general, derived from a monopartite ss(+) RNA through genome segmentation. In brief, the process could have started amid assortments of defective molecules resulting from error-prone replication and recombination events (<xref ref-type="bibr" rid="B8">Bangham and Kirkwood, 1993</xref>; <xref ref-type="bibr" rid="B38">Garc&#x00ED;a-Arriaza et al., 2004</xref>; <xref ref-type="bibr" rid="B107">Vignuzzi and L&#x00F3;pez, 2019</xref>) and sub-genomic RNA transcripts derived from the ancestor. Those molecules with self-replication activity could have recruited segments supplying viral complementary functions. <italic>Cis</italic>- and <italic>trans</italic>-acting sequence motifs likely evolved providing these novel genomic segments with the capacity to coordinate essential processes for the persistence of split viral genomes, e.g., multiplication and packaging (<xref ref-type="bibr" rid="B78">Newburn and White, 2015</xref>). Further modifications of kitavirus genomes likely included the acquisition of auxiliary genes, as seems to be a general pathway in the formation of plant virome (<xref ref-type="bibr" rid="B30">Dolja et al., 2020</xref>). The clearest examples of horizontal gene transfer in the genome of current kitaviruses are epitomized by the ORF <italic>p32</italic> in cileviruses, and ORFs <italic>BMB1</italic> and <italic>2</italic>, which encode the binary movement block proteins in HGSV-2 (<xref ref-type="bibr" rid="B86">Quito-Avila et al., 2013</xref>; <xref ref-type="bibr" rid="B102">Solovyev and Morozov, 2017</xref>; <xref ref-type="bibr" rid="B42">Hao et al., 2018</xref>).</p>
<p>Multiplication of unnecessary viral sequences appears to have fitness costs and the size of virus genomes shows a natural trend to shrinkage (<xref ref-type="bibr" rid="B11">Belshaw et al., 2007</xref>; <xref ref-type="bibr" rid="B113">Zwart et al., 2014</xref>). The occurrence of IRs as long as observed in the RNA2 of cileviruses is atypical. Whether large IRs existed in the ancestors of kita/nege-like viruses, they were most likely purged by selection. The genomic organization of known negeviruses and kita/nege-like viruses are rather compact, with short stretches of intergenic sequences. Therefore, a relevant interrogation in this context is whether the IRs of cileviruses deviate from the selection rules. In a sense, most likely not, and some features might suggest it. The IR of PfGSV isolates are shorter and contain more predicted coding regions than those in CiLV-C isolates, whereas the presence of the overlapping ORF <italic>p8</italic> in CiLV-C may also indicate genomic compression.</p>
<p>As defined in this study, the 5&#x2032;-end of the RNA2 genomic segments in typical cileviruses comprises the ORF <italic>p15</italic> and the IR. There is no obvious evidence allowing for a direct connection between the 5&#x2032;-end of the cilevirus RNA2 molecules and any genetic element in the genome of their closely related negeviruses and nege/kita-like viruses. Yet, the relationship between the 5&#x2032;-end of the RNA2 molecules from cileviruses would be difficult to establish, except for the presence of a conserved amino acid motif across the P15 family and the 3D structure similarities among these proteins. These two features, moreover, may be pertinent clues suggesting the common ancestry of the ORF <italic>p15</italic> present in the cileviruses CiLV-C, CiLV-C2, and PfGSV, giving support to a theory about their common origin.</p>
<p>The origin of the 5&#x2032;-end of the RNA2 in typical cileviruses could have involved the following plausible scenarios: (<italic>i</italic>) a reminiscent ancient ORF from the monopartite ancestor or (<italic>ii</italic>) its acquisition from a heterologous source by recombination, i.e., horizontal gene transfer. Although we lack evidence to exclude any of these possibilities, our approach using prediction and computational analysis of 3D protein structures has provided information on a possible external origin of P15 proteins. The scaffold of &#x03B1;-helixes detected in P15 shows structural resemblance with those detected in well-characterized proteins from herpesvirids and mononegavirids. Importantly, the odds of the 3D structure of P15 is the result of convergence are very low considering the small number of protein folds available (<xref ref-type="bibr" rid="B67">Malik et al., 2017</xref>).</p>
<p>The external origin of the ORF <italic>p15</italic> could be considered part of the horizontal gene transfer events that likely happened in the course of kitavirus evolution, but what is peculiar in this case is the participation of a mononegavirid, probably an ancestor of the current plant rhabdoviruses, as its putative donor. Cileviruses and plant-infecting rhabdoviruses have a partially shared plant host range, and, in the case of dichorhaviruses, they also share mite vectors (<xref ref-type="bibr" rid="B28">Dietzgen et al., 2018</xref>, <xref ref-type="bibr" rid="B27">2020</xref>; <xref ref-type="bibr" rid="B35">Freitas-Ast&#x00FA;a et al., 2018</xref>). Likely, <italic>Brevipalpus</italic> mites, where dichorhavirus also multiplicate, have underpinned a primordial and essential role in the segmentation of both cileviruses and dichorhaviruses (<xref ref-type="bibr" rid="B56">Kondo et al., 2017</xref>).</p>
<p>However, whatever the scenario for the origin of <italic>p15</italic> in cileviruses, the very conservation of a sequence motif and fold in the encoded proteins indicate that the &#x201C;ancestor&#x201D; ORF was present before the radiation of, at least, CiLV-C, CiLV-C2, and PfGSV. Since then, it has been likely undergone a transformation process under selective pressure, acquiring or losing protein secondary structure elements and involving host-specific patterns to provide evolutionarily competitiveness to these viruses under new ecological conditions, e.g., the change of natural host range from arthropods to plants. Indeed, if this would be the case, <italic>p15</italic> likely became a fitness factor whose fixation might have indirectly contributed to the conservation of the IR. This large chunk of genomic sequence with apparently an unclear role is preserved between functional ORFs, that away of inactivity, it may be a zone of gain and loss of start and stop codons gradually incorporating novel ORFs.</p>
<p>Two mechanisms explaining the origin of orphan genes are currently accepted: (<italic>i</italic>) divergence from the coding sequence of a preexisting gene (<xref ref-type="bibr" rid="B103">Tautz and Domazet-Lo&#x0161;o, 2011</xref>; <xref ref-type="bibr" rid="B104">Vakirlis et al., 2020</xref>), and (<italic>ii</italic>) <italic>de novo</italic> appearance from intergenic or non-genic regions (<xref ref-type="bibr" rid="B15">Carvunis et al., 2012</xref>; <xref ref-type="bibr" rid="B105">Van Oss and Carvunis, 2019</xref>). The irregular organization and lack of identity among the small ORFs other than <italic>p15</italic> found in the 5&#x2032;-end of the RNA 2 of cileviruses support the idea that this genomic region might act as &#x201C;the cradle of new genes&#x201D; (<xref ref-type="bibr" rid="B34">Forterre and Ga&#x00EF;a, 2016</xref>). Several of the predicted ORFs found in the IR are small, of &#x2248;200 nucleotides, and they encode putative proteins where TM domains are detected by <italic>in silico</italic> analyses. Viral miniproteins with TM domains are encoded by a wide range of DNA and RNA viruses and they can bind and modulate cellular transmembrane proteins, providing flexible mechanisms to regulate several cellular activities (<xref ref-type="bibr" rid="B29">Dimaio, 2014</xref>; <xref ref-type="bibr" rid="B32">Finkel et al., 2018</xref>). Moreover, small ORFs can act as seeds for larger coding sequences (CDS) through a mechanism known as &#x201C;CDS elongation&#x201D; when their stop codons undergo mutations (<xref ref-type="bibr" rid="B20">Couso and Patraquim, 2017</xref>). Thus, small ORFs could provide an extra coding capacity that in the course of evolution could eventually become an additional source of fitness (<xref ref-type="bibr" rid="B41">Guerra-Almeida and Nunes-da-Fonseca, 2020</xref>).</p>
</sec>
</sec>
<sec id="S9">
<title>Concluding Remarks</title>
<p>In this study, we have addressed one of the most intriguing features of the molecular biology of cileviruses: the 5&#x2032; end of the RNA2 molecule and its two most salient elements, the ORF <italic>p15</italic>, and the IR. We presented pieces of evidence on the putative biochemical activity of the protein P15 over the genus and elucubrate on its putative contribution to viral fitness. The results are supported by the analyses of 58 nucleotide sequences of isolates of the three known species whose P15 amino acid sequences cover a spectrum of identity ranging from 14 to 100%. The main predictions have been obtained by integrating the results of different computational methods because of the premise that each of these elements taken in isolation does not allow to draw solid conclusions about the activity and function of a given protein (<xref ref-type="bibr" rid="B81">Omer et al., 2017</xref>; <xref ref-type="bibr" rid="B84">Ponting, 2017</xref>; <xref ref-type="bibr" rid="B91">Ravantti et al., 2020</xref>). Taken them together, we posit a theory that partially describes the evolution of the cilevirus genomes that involves elements denoting the likely close interplay during the coevolutionary history of <italic>Brevipalpus</italic>-transmitted viruses. In light of the plausible scenario described by the theory, two aspects of the biology of cileviruses gain new distinctions: (<italic>i</italic>) since <italic>p15</italic> across cileviruses can be considered alleles descendant from a common ancestor likely co-opted from another virus, this ORF should not be longer considered an orphan gene, and (<italic>ii</italic>) the markedly chimeric origin of the RNA2 genome segment, which adds further support to what appears to be a common feature of members of the family <italic>Kitaviridae</italic> (<xref ref-type="bibr" rid="B76">Morozov et al., 2020</xref>).</p>
</sec>
<sec sec-type="data-availability" id="S10">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="DS1">Supplementary Material</xref>, further inquiries can be directed to the corresponding authors.</p>
</sec>
<sec id="S11">
<title>Author Contributions</title>
<p>PR-G and TP: conceptualization. PR-G, TP, CC-J, and GA: formal analysis, investigation, and methodology. JF-A: funding acquisition. TP and JF-A: resources. PR-G and TP: writing&#x2014;original draft. PR-G, TP, CC-J, GA, and JF-A: writing&#x2014;review and editing. All authors approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="S14">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack>
<p>The authors are grateful to CAPES, CNPq, and FAPESP for scholarships and research grants associated with this work (CAPES PNPD20132154 &#x2013; 33141010001P4 &#x2013; PNPD &#x2013; IBSP, 88882.157041/2017-01; CNPq 312528/2020-5; FAPESP 2017/50222-0, 2019/25078-9, 2019/02137-0, and 2020/15413-2). The authors also thank the reviewers and editor for their useful and constructive comments on our work.</p>
</ack>
<sec id="S13" sec-type="supplementary material"><title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpls.2021.771983/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpls.2021.771983/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.PDF" id="DS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_2.docx" id="DS2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afgan</surname> <given-names>E.</given-names></name> <name><surname>Baker</surname> <given-names>D.</given-names></name> <name><surname>Batut</surname> <given-names>B.</given-names></name> <name><surname>van den Beek</surname> <given-names>M.</given-names></name> <name><surname>Bouvier</surname> <given-names>D.</given-names></name> <name><surname>Cech</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>W537</fpage>&#x2013;<lpage>W544</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky379</pub-id> <pub-id pub-id-type="pmid">29790989</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arena</surname> <given-names>G. D.</given-names></name> <name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Falk</surname> <given-names>B. W.</given-names></name> <name><surname>Casteel</surname> <given-names>C. L.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name> <name><surname>Machado</surname> <given-names>M. A.</given-names></name></person-group> (<year>2020</year>). <article-title>Plant Immune System Activation Upon Citrus Leprosis Virus C Infection Is Mimicked by the Ectopic Expression of the P61 Viral Protein.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>11</volume>:<issue>1188</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2020.01188</pub-id> <pub-id pub-id-type="pmid">32849736</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arena</surname> <given-names>G. D.</given-names></name> <name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Nunes</surname> <given-names>M. A.</given-names></name> <name><surname>Alves</surname> <given-names>M. R.</given-names></name> <name><surname>Camargo</surname> <given-names>L. E. A.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Citrus leprosis virus C infection results in hypersensitive-like response, suppression of the JA/ET plant defense pathway and promotion of the colonization of its mite vector.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>7</volume>:<issue>1757</issue>. <pub-id pub-id-type="doi">10.3389/FPLS.2016.01757</pub-id> <pub-id pub-id-type="pmid">27933078</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arena</surname> <given-names>G. D.</given-names></name> <name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Nunes</surname> <given-names>M. A.</given-names></name> <name><surname>Jesus</surname> <given-names>C. C.</given-names></name> <name><surname>Calegario</surname> <given-names>R. F.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title><italic>Arabidopsis thaliana</italic> as a model host for <italic>Brevipalpus</italic> mite-transmitted viruses.</article-title> <source><italic>Sci. Agric.</italic></source> <volume>74</volume> <fpage>85</fpage>&#x2013;<lpage>89</lpage>. <pub-id pub-id-type="doi">10.1590/1678-992x-2015-0380</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arendsee</surname> <given-names>Z. W.</given-names></name> <name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Wurtele</surname> <given-names>E. S.</given-names></name></person-group> (<year>2014</year>). <article-title>Coming of age: Orphan genes in plants.</article-title> <source><italic>Trends Plant Sci.</italic></source> <volume>19</volume> <fpage>698</fpage>&#x2013;<lpage>708</lpage>. <pub-id pub-id-type="doi">10.1016/j.tplants.2014.07.003</pub-id> <pub-id pub-id-type="pmid">25151064</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arendsee</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Singh</surname> <given-names>U.</given-names></name> <name><surname>Bhandary</surname> <given-names>P.</given-names></name> <name><surname>Seetharam</surname> <given-names>A.</given-names></name> <name><surname>Wurtele</surname> <given-names>E. S.</given-names></name></person-group> (<year>2019</year>). <article-title>Fagin: Synteny-based phylostratigraphy and finer classification of young genes.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>20</volume>:<issue>440</issue>. <pub-id pub-id-type="doi">10.1186/s12859-019-3023-y</pub-id> <pub-id pub-id-type="pmid">31455236</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Assenberg</surname> <given-names>R.</given-names></name> <name><surname>Delmas</surname> <given-names>O.</given-names></name> <name><surname>Ren</surname> <given-names>J.</given-names></name> <name><surname>Vidalain</surname> <given-names>P.-O.</given-names></name> <name><surname>Verma</surname> <given-names>A.</given-names></name> <name><surname>Larrous</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Structure of the Nucleoprotein Binding Domain of Mokola Virus Phosphoprotein.</article-title> <source><italic>J. Virol.</italic></source> <volume>84</volume> <fpage>1089</fpage>&#x2013;<lpage>1096</lpage>. <pub-id pub-id-type="doi">10.1128/jvi.01520-09</pub-id> <pub-id pub-id-type="pmid">19906936</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bangham</surname> <given-names>C. R. M.</given-names></name> <name><surname>Kirkwood</surname> <given-names>T. B. L.</given-names></name></person-group> (<year>1993</year>). <article-title>Defective interfering particles and virus evolution.</article-title> <source><italic>Trends Microbiol.</italic></source> <volume>1</volume> <fpage>260</fpage>&#x2013;<lpage>264</lpage>. <pub-id pub-id-type="doi">10.1016/0966-842X(93)90048-V</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Behura</surname> <given-names>S. K.</given-names></name> <name><surname>Severson</surname> <given-names>D. W.</given-names></name></person-group> (<year>2013</year>). <article-title>Codon usage bias: causative factors, quantification methods and genome-wide patterns: with emphasis on insect genomes.</article-title> <source><italic>Biol. Rev.</italic></source> <volume>88</volume> <fpage>49</fpage>&#x2013;<lpage>61</lpage>. <pub-id pub-id-type="doi">10.1111/J.1469-185X.2012.00242.X</pub-id> <pub-id pub-id-type="pmid">22889422</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bejerman</surname> <given-names>N.</given-names></name> <name><surname>Mann</surname> <given-names>K. S.</given-names></name> <name><surname>Dietzgen</surname> <given-names>R. G.</given-names></name></person-group> (<year>2016</year>). <article-title>Alfalfa dwarf cytorhabdovirus P protein is a local and systemic RNA silencing supressor which inhibits programmed RISC activity and prevents transitive amplification of RNA silencing.</article-title> <source><italic>Virus Res.</italic></source> <volume>224</volume> <fpage>19</fpage>&#x2013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1016/j.virusres.2016.08.008</pub-id> <pub-id pub-id-type="pmid">27543392</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Belshaw</surname> <given-names>R.</given-names></name> <name><surname>Pybus</surname> <given-names>O. G.</given-names></name> <name><surname>Rambaut</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>The evolution of genome compression and genomic novelty in RNA viruses.</article-title> <source><italic>Genome Res.</italic></source> <volume>17</volume> <fpage>1496</fpage>&#x2013;<lpage>1504</lpage>. <pub-id pub-id-type="doi">10.1101/gr.6305707</pub-id> <pub-id pub-id-type="pmid">17785537</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Biswas</surname> <given-names>K. K.</given-names></name> <name><surname>Palchoudhury</surname> <given-names>S.</given-names></name> <name><surname>Chakraborty</surname> <given-names>P.</given-names></name> <name><surname>Bhattacharyya</surname> <given-names>U. K.</given-names></name> <name><surname>Ghosh</surname> <given-names>D. K.</given-names></name> <name><surname>Debnath</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Codon usage bias analysis of citrus tristeza virus: Higher codon adaptation to citrus reticulata host.</article-title> <source><italic>Viruses</italic></source> <volume>11</volume>:<issue>v11040331</issue>. <pub-id pub-id-type="doi">10.3390/v11040331</pub-id> <pub-id pub-id-type="pmid">30965565</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bourret</surname> <given-names>J.</given-names></name> <name><surname>Alizon</surname> <given-names>S.</given-names></name> <name><surname>Bravo</surname> <given-names>I. G.</given-names></name></person-group> (<year>2019</year>). <article-title>COUSIN (COdon Usage Similarity INdex): A Normalized Measure of Codon Usage Preferences.</article-title> <source><italic>Genome Biol. Evol.</italic></source> <volume>11</volume> <fpage>3523</fpage>&#x2013;<lpage>3528</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evz262</pub-id> <pub-id pub-id-type="pmid">31800035</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butt</surname> <given-names>A. M.</given-names></name> <name><surname>Nasrullah</surname> <given-names>I.</given-names></name> <name><surname>Qamar</surname> <given-names>R.</given-names></name> <name><surname>Tong</surname> <given-names>Y.</given-names></name></person-group> (<year>2016</year>). <article-title>Evolution of codon usage in Zika virus genomes is host and vector specific.</article-title> <source><italic>Emerg. Microbes Infect.</italic></source> <volume>5</volume> <fpage>1</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1038/emi.2016.106</pub-id> <pub-id pub-id-type="pmid">27729643</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carvunis</surname> <given-names>A. R.</given-names></name> <name><surname>Rolland</surname> <given-names>T.</given-names></name> <name><surname>Wapinski</surname> <given-names>I.</given-names></name> <name><surname>Calderwood</surname> <given-names>M. A.</given-names></name> <name><surname>Yildirim</surname> <given-names>M. A.</given-names></name> <name><surname>Simonis</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Proto-genes and de novo gene birth.</article-title> <source><italic>Nature</italic></source> <volume>487</volume> <fpage>370</fpage>&#x2013;<lpage>374</lpage>. <pub-id pub-id-type="doi">10.1038/nature11184</pub-id> <pub-id pub-id-type="pmid">22722833</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chabi-Jesus</surname> <given-names>C.</given-names></name> <name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Postclam-Barro</surname> <given-names>M.</given-names></name> <name><surname>Fontenele</surname> <given-names>R. S.</given-names></name> <name><surname>Harakava</surname> <given-names>R.</given-names></name> <name><surname>Bassanezi</surname> <given-names>R. B.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Molecular Epidemiology of Citrus Leprosis Virus C: A New Viral Lineage and Phylodynamic of the Main Viral Subpopulations in the Americas.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>12</volume>:<issue>986</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2021.641252</pub-id> <pub-id pub-id-type="pmid">33995302</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>W. C.</given-names></name> <name><surname>Kim</surname> <given-names>J.</given-names></name> <name><surname>Kim</surname> <given-names>B. C.</given-names></name> <name><surname>Kang</surname> <given-names>H. R.</given-names></name> <name><surname>Son</surname> <given-names>J.</given-names></name> <name><surname>Ki</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Structure-based mechanism of action of a viral poly(ADP-ribose) polymerase 1-interacting protein facilitating virus replication.</article-title> <source><italic>IUCrJ</italic></source> <volume>5</volume> <fpage>866</fpage>&#x2013;<lpage>879</lpage>. <pub-id pub-id-type="doi">10.1107/S2052252518013854</pub-id> <pub-id pub-id-type="pmid">30443370</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Colson</surname> <given-names>P.</given-names></name> <name><surname>La Scola</surname> <given-names>B.</given-names></name> <name><surname>Levasseur</surname> <given-names>A.</given-names></name> <name><surname>Caetano-Anoll&#x00E9;s</surname> <given-names>G.</given-names></name> <name><surname>Raoult</surname> <given-names>D.</given-names></name></person-group> (<year>2017</year>). <article-title>Mimivirus: leading the way in the discovery of giant viruses of amoebae.</article-title> <source><italic>Nat. Rev. Microbiol.</italic></source> <volume>15</volume> <fpage>243</fpage>&#x2013;<lpage>254</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro.2016.197</pub-id> <pub-id pub-id-type="pmid">28239153</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Comeron</surname> <given-names>J. M.</given-names></name> <name><surname>Aguad&#x00E9;</surname> <given-names>M.</given-names></name></person-group> (<year>1998</year>). <article-title>An evaluation of measures of synonymous codon usage bias.</article-title> <source><italic>J. Mol. Evol.</italic></source> <volume>47</volume> <fpage>268</fpage>&#x2013;<lpage>274</lpage>. <pub-id pub-id-type="doi">10.1007/PL00006384</pub-id> <pub-id pub-id-type="pmid">9732453</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Couso</surname> <given-names>J. P.</given-names></name> <name><surname>Patraquim</surname> <given-names>P.</given-names></name></person-group> (<year>2017</year>). <article-title>Classification and function of small open reading frames.</article-title> <source><italic>Nat. Rev. Mol. Cell Biol.</italic></source> <volume>18</volume> <fpage>575</fpage>&#x2013;<lpage>589</lpage>. <pub-id pub-id-type="doi">10.1038/nrm.2017.58</pub-id> <pub-id pub-id-type="pmid">28698598</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cvijovi&#x0107;</surname> <given-names>I.</given-names></name> <name><surname>Good</surname> <given-names>B. H.</given-names></name> <name><surname>Desai</surname> <given-names>M. M.</given-names></name></person-group> (<year>2018</year>). <article-title>The effect of strong purifying selection on genetic diversity.</article-title> <source><italic>Genetics</italic></source> <volume>209</volume> <fpage>1235</fpage>&#x2013;<lpage>1278</lpage>. <pub-id pub-id-type="doi">10.1534/genetics.118.301058</pub-id> <pub-id pub-id-type="pmid">29844134</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dao</surname> <given-names>T. N. M.</given-names></name> <name><surname>Kang</surname> <given-names>S.-H.</given-names></name> <name><surname>Bak</surname> <given-names>A.</given-names></name> <name><surname>Folimonova</surname> <given-names>S. Y.</given-names></name></person-group> (<year>2020</year>). <article-title>A Non-Conserved p33 Protein of Citrus Tristeza Virus Interacts with Multiple Viral Partners.</article-title> <source><italic>Mol. Plant Microbe Interact.</italic></source> <volume>33</volume> <fpage>859</fpage>&#x2013;<lpage>870</lpage>. <pub-id pub-id-type="doi">10.1094/MPMI-11-19-0328-FI</pub-id> <pub-id pub-id-type="pmid">32141354</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Castro</surname> <given-names>E.</given-names></name> <name><surname>Sigrist</surname> <given-names>C.</given-names></name> <name><surname>Gattiker</surname> <given-names>A.</given-names></name> <name><surname>Bulliard</surname> <given-names>V.</given-names></name> <name><surname>Langendijk-Genevaux</surname> <given-names>P.</given-names></name> <name><surname>Gasteiger</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>34</volume> <fpage>362</fpage>&#x2013;<lpage>365</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkl124</pub-id> <pub-id pub-id-type="pmid">16845026</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Deblasio</surname> <given-names>S. L.</given-names></name> <name><surname>Xu</surname> <given-names>Y.</given-names></name> <name><surname>Johnson</surname> <given-names>R. S.</given-names></name> <name><surname>Rebelo</surname> <given-names>A. R.</given-names></name> <name><surname>Maccoss</surname> <given-names>M. J.</given-names></name> <name><surname>Gray</surname> <given-names>S. M.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>The interaction dynamics of two potato leafroll virus movement proteins affects their localization to the outer membranes of mitochondria and plastids.</article-title> <source><italic>Viruses</italic></source> <volume>10</volume>:<issue>v10110585</issue>. <pub-id pub-id-type="doi">10.3390/v10110585</pub-id> <pub-id pub-id-type="pmid">30373157</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delano</surname> <given-names>W.</given-names></name></person-group> (<year>2002</year>). <article-title>PyMOL: An open-source molecular graphics tool.</article-title> <source><italic>CCP4 Newsl. Protein Crystallogr.</italic></source> <volume>40</volume> <fpage>82</fpage>&#x2013;<lpage>92</lpage>.</citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delmas</surname> <given-names>O.</given-names></name> <name><surname>Assenberg</surname> <given-names>R.</given-names></name> <name><surname>Grimes</surname> <given-names>J. M.</given-names></name> <name><surname>Bourhy</surname> <given-names>H.</given-names></name></person-group> (<year>2010</year>). <article-title>The structure of the nucleoprotein binding domain of lyssavirus phosphoprotein reveals a structural relationship between the N-RNA binding domains of Rhabdoviridae and Paramyxoviridae</article-title>. <source><italic>RNA Biol</italic></source>. <volume>7</volume>, <fpage>322</fpage>&#x2013;<lpage>327</lpage>. <pub-id pub-id-type="doi">10.4161/rna.7.3.11931</pub-id> <pub-id pub-id-type="pmid">20458178</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dietzgen</surname> <given-names>R. G.</given-names></name> <name><surname>Bejerman</surname> <given-names>N. E.</given-names></name> <name><surname>Goodin</surname> <given-names>M. M.</given-names></name> <name><surname>Higgins</surname> <given-names>C. M.</given-names></name> <name><surname>Huot</surname> <given-names>O. B.</given-names></name> <name><surname>Kondo</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Diversity and epidemiology of plant rhabdoviruses.</article-title> <source><italic>Virus Res.</italic></source> <volume>281</volume>:<issue>197942</issue>. <pub-id pub-id-type="doi">10.1016/j.virusres.2020.197942</pub-id> <pub-id pub-id-type="pmid">32201209</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dietzgen</surname> <given-names>R. G.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name> <name><surname>Chabi-Jesus</surname> <given-names>C.</given-names></name> <name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Goodin</surname> <given-names>M. M.</given-names></name> <name><surname>Kondo</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2018</year>). &#x201C;<article-title>Dichorhaviruses in their Host Plants and Mite Vectors</article-title>,&#x201D; in <source><italic>Advances in Virus Research</italic></source>, (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Academic Press Inc</publisher-name>), <fpage>119</fpage>&#x2013;<lpage>148</lpage>. <pub-id pub-id-type="doi">10.1016/bs.aivir.2018.06.001</pub-id> <pub-id pub-id-type="pmid">30266171</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dimaio</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Viral miniproteins.</article-title> <source><italic>Annu. Rev. Microbiol.</italic></source> <volume>68</volume> <fpage>21</fpage>&#x2013;<lpage>43</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-micro-091313-103727</pub-id> <pub-id pub-id-type="pmid">24742054</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dolja</surname> <given-names>V. V.</given-names></name> <name><surname>Krupovic</surname> <given-names>M.</given-names></name> <name><surname>Koonin</surname> <given-names>E. V.</given-names></name></person-group> (<year>2020</year>). <article-title>Deep Roots and Splendid Boughs of the Global Plant Virome.</article-title> <source><italic>Annu. Rev. Phytopathol.</italic></source> <volume>58</volume> <fpage>23</fpage>&#x2013;<lpage>53</lpage>. <pub-id pub-id-type="doi">10.1146/ANNUREV-PHYTO-030320-041346</pub-id> <pub-id pub-id-type="pmid">32459570</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Entwistle</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Yin</surname> <given-names>Y.</given-names></name></person-group> (<year>2019</year>). <article-title>Orphan Genes Shared by Pathogenic Genomes Are More Associated with Bacterial Pathogenicity.</article-title> <source><italic>mSystems</italic></source> <volume>4</volume>:<issue>18</issue>. <pub-id pub-id-type="doi">10.1128/msystems.00290-18</pub-id> <pub-id pub-id-type="pmid">30801025</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finkel</surname> <given-names>Y.</given-names></name> <name><surname>Stern-Ginossar</surname> <given-names>N.</given-names></name> <name><surname>Schwartz</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>Viral Short ORFs and Their Possible Functions.</article-title> <source><italic>Proteomics</italic></source> <volume>18</volume>:<issue>201700255</issue>. <pub-id pub-id-type="doi">10.1002/pmic.201700255</pub-id> <pub-id pub-id-type="pmid">29150926</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fischer</surname> <given-names>D.</given-names></name> <name><surname>Eisenberg</surname> <given-names>D.</given-names></name></person-group> (<year>1999</year>). <article-title>Finding families for genomic ORFans.</article-title> <source><italic>Bioinformatics</italic></source> <volume>15</volume> <fpage>759</fpage>&#x2013;<lpage>762</lpage>.</citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Forterre</surname> <given-names>P.</given-names></name> <name><surname>Ga&#x00EF;a</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Giant viruses and the origin of modern eukaryotes.</article-title> <source><italic>Curr. Opin. Microbiol.</italic></source> <volume>31</volume> <fpage>44</fpage>&#x2013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1016/j.mib.2016.02.001</pub-id> <pub-id pub-id-type="pmid">26894379</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name> <name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Arena</surname> <given-names>G. D.</given-names></name> <name><surname>Tassi</surname> <given-names>A. D.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name></person-group> (<year>2018</year>). <article-title>Brevipalpus-transmitted viruses: parallelism beyond a common vector or convergent evolution of distantly related pathogens?</article-title> <source><italic>Curr. Opin. Virol.</italic></source> <volume>33</volume> <fpage>66</fpage>&#x2013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1016/J.COVIRO.2018.07.010</pub-id> <pub-id pub-id-type="pmid">30081359</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fr&#x00ED;as-Lasserre</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>Non coding RNAs and viruses in the framework of the phylogeny of the genes, epigenesis and heredity.</article-title> <source><italic>Int. J. Mol. Sci.</italic></source> <volume>13</volume> <fpage>477</fpage>&#x2013;<lpage>490</lpage>. <pub-id pub-id-type="doi">10.3390/ijms13010477</pub-id> <pub-id pub-id-type="pmid">22312265</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galili</surname> <given-names>T.</given-names></name> <name><surname>O&#x2019;Callaghan</surname> <given-names>A.</given-names></name> <name><surname>Sidi</surname> <given-names>J.</given-names></name> <name><surname>Sievert</surname> <given-names>C.</given-names></name></person-group> (<year>2018</year>). <article-title>Heatmaply: An R package for creating interactive cluster heatmaps for online publishing.</article-title> <source><italic>Bioinformatics</italic></source> <volume>34</volume> <fpage>1600</fpage>&#x2013;<lpage>1602</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btx657</pub-id> <pub-id pub-id-type="pmid">29069305</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garc&#x00ED;a-Arriaza</surname> <given-names>J.</given-names></name> <name><surname>Manrubia</surname> <given-names>S. C.</given-names></name> <name><surname>Toja</surname> <given-names>M.</given-names></name> <name><surname>Domingo</surname> <given-names>E.</given-names></name> <name><surname>Escarm&#x00ED;s</surname> <given-names>C.</given-names></name></person-group> (<year>2004</year>). <article-title>Evolutionary Transition toward Defective RNAs That Are Infectious by Complementation.</article-title> <source><italic>J. Virol.</italic></source> <volume>78</volume> <fpage>11678</fpage>&#x2013;<lpage>11685</lpage>. <pub-id pub-id-type="doi">10.1128/jvi.78.21.11678-11685.2004</pub-id> <pub-id pub-id-type="pmid">15479809</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garita</surname> <given-names>L.</given-names></name> <name><surname>Tassi</surname> <given-names>A. D.</given-names></name> <name><surname>Calegario</surname> <given-names>R. F.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name> <name><surname>Carbonell</surname> <given-names>S. A. M.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Common bean (Phaseolus vulgaris L.): Experimental local lesion host for citrus leprosis virus C (CiLV-C) and some other cytoplasmic-type Brevipalpus-transmitted viruses.</article-title> <source><italic>Plant Dis.</italic></source> <volume>97</volume> <fpage>1346</fpage>&#x2013;<lpage>1351</lpage>. <pub-id pub-id-type="doi">10.1094/PDIS-12-12-1143-RE</pub-id> <pub-id pub-id-type="pmid">30722150</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonz&#x00E1;lez</surname> <given-names>C. M.</given-names></name> <name><surname>Wong</surname> <given-names>E. L.</given-names></name> <name><surname>Bowser</surname> <given-names>B. S.</given-names></name> <name><surname>Hong</surname> <given-names>G. K.</given-names></name> <name><surname>Kenney</surname> <given-names>S.</given-names></name> <name><surname>Damania</surname> <given-names>B.</given-names></name></person-group> (<year>2006</year>). <article-title>Identification and Characterization of the Orf49 Protein of Kaposi&#x2019;s Sarcoma-Associated Herpesvirus.</article-title> <source><italic>J. Virol.</italic></source> <volume>80</volume> <fpage>3062</fpage>&#x2013;<lpage>3070</lpage>. <pub-id pub-id-type="doi">10.1128/jvi.80.6.3062-3070.2006</pub-id> <pub-id pub-id-type="pmid">16501115</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guerra-Almeida</surname> <given-names>D.</given-names></name> <name><surname>Nunes-da-Fonseca</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>Small Open Reading Frames: How Important Are They for Molecular Evolution?</article-title> <source><italic>Front. Genet.</italic></source> <volume>11</volume>:<issue>574737</issue>. <pub-id pub-id-type="doi">10.3389/fgene.2020.574737</pub-id> <pub-id pub-id-type="pmid">33193682</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hao</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Zhao</surname> <given-names>F.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Qian</surname> <given-names>W.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Discovery of Plant Viruses From Tea Plant (Camellia sinensis (L.) O. Kuntze) by Metagenomic Sequencing.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>9</volume>:<issue>2175</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2018.02175</pub-id> <pub-id pub-id-type="pmid">30254625</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harcombe</surname> <given-names>W. R.</given-names></name> <name><surname>Springman</surname> <given-names>R.</given-names></name> <name><surname>Bull</surname> <given-names>J. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Compensatory evolution for a gene deletion is not limited to its immediate functional network.</article-title> <source><italic>BMC Evol. Biol.</italic></source> <volume>9</volume>:<issue>106</issue>. <pub-id pub-id-type="doi">10.1186/1471-2148-9-106</pub-id> <pub-id pub-id-type="pmid">19445716</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hasegawa</surname> <given-names>M.</given-names></name> <name><surname>Kishino</surname> <given-names>H.</given-names></name> <name><surname>Yano aki</surname> <given-names>T.</given-names></name></person-group> (<year>1985</year>). <article-title>Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.</article-title> <source><italic>J. Mol. Evol.</italic></source> <volume>22</volume> <fpage>160</fpage>&#x2013;<lpage>174</lpage>. <pub-id pub-id-type="doi">10.1007/BF02101694</pub-id> <pub-id pub-id-type="pmid">3934395</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>Z.</given-names></name> <name><surname>Gan</surname> <given-names>H.</given-names></name> <name><surname>Liang</surname> <given-names>X.</given-names></name></person-group> (<year>2019</year>). <article-title>Analysis of synonymous codon usage bias in potato virus M and its adaption to hosts.</article-title> <source><italic>Viruses</italic></source> <volume>11</volume>:<issue>v11080752</issue>. <pub-id pub-id-type="doi">10.3390/v11080752</pub-id> <pub-id pub-id-type="pmid">31416257</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hew</surname> <given-names>K.</given-names></name> <name><surname>Veerappan</surname> <given-names>S.</given-names></name> <name><surname>Sim</surname> <given-names>D.</given-names></name> <name><surname>Cornvik</surname> <given-names>T.</given-names></name> <name><surname>Nordlund</surname> <given-names>P.</given-names></name> <name><surname>Dahlroth</surname> <given-names>S.-L.</given-names></name></person-group> (<year>2017</year>). <article-title>Structure of the Open Reading Frame 49 Protein Encoded by Kaposi&#x2019;s Sarcoma-Associated Herpesvirus.</article-title> <source><italic>J. Virol.</italic></source> <volume>91</volume>:<issue>16</issue>. <pub-id pub-id-type="doi">10.1128/jvi.01947-16</pub-id> <pub-id pub-id-type="pmid">27807232</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ivanov</surname> <given-names>I.</given-names></name> <name><surname>Cr&#x00E9;pin</surname> <given-names>T.</given-names></name> <name><surname>Jamin</surname> <given-names>M.</given-names></name> <name><surname>Ruigrok</surname> <given-names>R. W. H.</given-names></name></person-group> (<year>2010</year>). <article-title>Structure of the Dimerization Domain of the Rabies Virus Phosphoprotein.</article-title> <source><italic>J. Virol.</italic></source> <volume>84</volume> <fpage>3707</fpage>&#x2013;<lpage>3710</lpage>. <pub-id pub-id-type="doi">10.1128/jvi.02557-09</pub-id> <pub-id pub-id-type="pmid">20089657</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname> <given-names>K.</given-names></name> <name><surname>Standley</surname> <given-names>D. M.</given-names></name></person-group> (<year>2013</year>). <article-title>MAFFT multiple sequence alignment software version 7: improvements in performance and usability.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>30</volume> <fpage>772</fpage>&#x2013;<lpage>780</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst010</pub-id> <pub-id pub-id-type="pmid">23329690</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaur Saini</surname> <given-names>H.</given-names></name> <name><surname>Fischer</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>Structural and functional insights into Mimivirus ORFans.</article-title> <source><italic>BMC Genomics</italic></source> <volume>8</volume>:<issue>115</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-8-115</pub-id> <pub-id pub-id-type="pmid">17490476</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khandia</surname> <given-names>R.</given-names></name> <name><surname>Singhal</surname> <given-names>S.</given-names></name> <name><surname>Kumar</surname> <given-names>U.</given-names></name> <name><surname>Ansari</surname> <given-names>A.</given-names></name> <name><surname>Tiwari</surname> <given-names>R.</given-names></name> <name><surname>Dhama</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Analysis of nipah virus codon usage and adaptation to hosts.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>10</volume>:<issue>886</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2019.00886</pub-id> <pub-id pub-id-type="pmid">31156564</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kitajima</surname> <given-names>E. W.</given-names></name> <name><surname>Alberti</surname> <given-names>G.</given-names></name></person-group> (<year>2014</year>). <article-title>Anatomy and fine structure of <italic>Bevipalpus</italic> mites (Tenuipalpidae) &#x2013; Economically important plant virus vectors. Part 7. Ultrastructural detection of cytoplasmic and nuclear types of <italic>Brevipalpus</italic> transmitted viruses.</article-title> <source><italic>Zoologica</italic></source> <volume>160</volume> <fpage>174</fpage>&#x2013;<lpage>192</lpage>.</citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kitajima</surname> <given-names>E. W.</given-names></name> <name><surname>Rodrigues</surname> <given-names>J. C. V.</given-names></name> <name><surname>Freitas-Astua</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>An annotated list of ornamentals naturally found infected by <italic>Brevipalpus</italic> mite-transmitted viruses.</article-title> <source><italic>Sci. Agric.</italic></source> <volume>67</volume> <fpage>348</fpage>&#x2013;<lpage>371</lpage>. <pub-id pub-id-type="doi">10.1590/S0103-90162010000300014</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Konat&#x00E9;</surname> <given-names>M. M.</given-names></name> <name><surname>Plata</surname> <given-names>G.</given-names></name> <name><surname>Park</surname> <given-names>J.</given-names></name> <name><surname>Usmanova</surname> <given-names>D. R.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Vitkup</surname> <given-names>D.</given-names></name></person-group> (<year>2019</year>). <article-title>Molecular function limits divergent protein evolution on planetary timescales.</article-title> <source><italic>Elife</italic></source> <volume>8</volume>:<issue>39705</issue>. <pub-id pub-id-type="doi">10.7554/eLife.39705</pub-id> <pub-id pub-id-type="pmid">31532392</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kondo</surname> <given-names>H.</given-names></name> <name><surname>Chiba</surname> <given-names>S.</given-names></name> <name><surname>Maruyama</surname> <given-names>K.</given-names></name> <name><surname>Andika</surname> <given-names>I. B.</given-names></name> <name><surname>Suzuki</surname> <given-names>N.</given-names></name></person-group> (<year>2019</year>). <article-title>A novel insect-infecting virga/nege-like virus group and its pervasive endogenization into insect genomes.</article-title> <source><italic>Virus Res.</italic></source> <volume>262</volume> <fpage>37</fpage>&#x2013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.1016/j.virusres.2017.11.020</pub-id> <pub-id pub-id-type="pmid">29169832</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kondo</surname> <given-names>H.</given-names></name> <name><surname>Fujita</surname> <given-names>M.</given-names></name> <name><surname>Hisano</surname> <given-names>H.</given-names></name> <name><surname>Hyodo</surname> <given-names>K.</given-names></name> <name><surname>Andika</surname> <given-names>I. B.</given-names></name> <name><surname>Suzuki</surname> <given-names>N.</given-names></name></person-group> (<year>2020</year>). <article-title>Virome Analysis of Aphid Populations That Infest the Barley Field: The Discovery of Two Novel Groups of Nege/Kita-Like Viruses and Other Novel RNA Viruses.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>11</volume>:<issue>509</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2020.00509</pub-id> <pub-id pub-id-type="pmid">32318034</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kondo</surname> <given-names>H.</given-names></name> <name><surname>Hirota</surname> <given-names>K.</given-names></name> <name><surname>Maruyama</surname> <given-names>K.</given-names></name> <name><surname>Andika</surname> <given-names>I. B.</given-names></name> <name><surname>Suzuki</surname> <given-names>N.</given-names></name></person-group> (<year>2017</year>). <article-title>A possible occurrence of genome reassortment among bipartite rhabdoviruses.</article-title> <source><italic>Virology</italic></source> <volume>508</volume> <fpage>18</fpage>&#x2013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1016/j.virol.2017.04.027</pub-id> <pub-id pub-id-type="pmid">28478311</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koonin</surname> <given-names>E. V.</given-names></name> <name><surname>Senkevich</surname> <given-names>T. G.</given-names></name> <name><surname>Dolja</surname> <given-names>V. V.</given-names></name></person-group> (<year>2006</year>). <article-title>The ancient virus world and evolution of cells.</article-title> <source><italic>Biol. Direct</italic></source> <volume>1</volume>:<issue>29</issue>. <pub-id pub-id-type="doi">10.1186/1745-6150-1-29</pub-id> <pub-id pub-id-type="pmid">16984643</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krissinel</surname> <given-names>E.</given-names></name> <name><surname>Henrick</surname> <given-names>K.</given-names></name></person-group> (<year>2004</year>). <article-title>Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.</article-title> <source><italic>Acta Crystallogr. Sect. D Biol. Crystallogr.</italic></source> <volume>60</volume> <fpage>2256</fpage>&#x2013;<lpage>2268</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444904026460</pub-id> <pub-id pub-id-type="pmid">15572779</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuchibhatla</surname> <given-names>D. B.</given-names></name> <name><surname>Sherman</surname> <given-names>W. A.</given-names></name> <name><surname>Chung</surname> <given-names>B. Y. W.</given-names></name> <name><surname>Cook</surname> <given-names>S.</given-names></name> <name><surname>Schneider</surname> <given-names>G.</given-names></name> <name><surname>Eisenhaber</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently &#x201C;orphan&#x201D; viral proteins.</article-title> <source><italic>J. Virol.</italic></source> <volume>88</volume> <fpage>10</fpage>&#x2013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.02595-13</pub-id> <pub-id pub-id-type="pmid">24155369</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>S.</given-names></name> <name><surname>Stecher</surname> <given-names>G.</given-names></name> <name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Knyaz</surname> <given-names>C.</given-names></name> <name><surname>Tamura</surname> <given-names>K.</given-names></name></person-group> (<year>2018</year>). <article-title>MEGA X: Molecular evolutionary genetics analysis across computing platforms.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>35</volume> <fpage>1547</fpage>&#x2013;<lpage>1549</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msy096</pub-id> <pub-id pub-id-type="pmid">29722887</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lazareva</surname> <given-names>E. A.</given-names></name> <name><surname>Lezzhov</surname> <given-names>A. A.</given-names></name> <name><surname>Chergintsev</surname> <given-names>D. A.</given-names></name> <name><surname>Golyshev</surname> <given-names>S. A.</given-names></name> <name><surname>Dolja</surname> <given-names>V. V.</given-names></name> <name><surname>Morozov</surname> <given-names>S. Y.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>Reticulon-like properties of a plant virus-encoded movement protein.</article-title> <source><italic>New Phytol.</italic></source> <volume>229</volume> <fpage>1052</fpage>&#x2013;<lpage>1066</lpage>. <pub-id pub-id-type="doi">10.1111/nph.16905</pub-id> <pub-id pub-id-type="pmid">32866987</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leastro</surname> <given-names>M. O.</given-names></name> <name><surname>Castro</surname> <given-names>D. Y. O.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name> <name><surname>Pall&#x00E1;s</surname> <given-names>V.</given-names></name> <name><surname>S&#x00E1;nchez-Navarro</surname> <given-names>J. &#x00C1;</given-names></name></person-group> (<year>2020</year>). <article-title>Citrus Leprosis Virus C Encodes Three Proteins With Gene Silencing Suppression Activity.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>11</volume>:<issue>1231</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2020.01231</pub-id> <pub-id pub-id-type="pmid">32655520</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leastro</surname> <given-names>M. O.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name> <name><surname>Silva</surname> <given-names>M. S.</given-names></name> <name><surname>Resende</surname> <given-names>R. O.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>Dissecting the Subcellular Localization, Intracellular Trafficking, Interactions, Membrane Association, and Topology of Citrus Leprosis Virus C Proteins.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>9</volume>:<issue>1299</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2018.01299</pub-id> <pub-id pub-id-type="pmid">30254655</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letunic</surname> <given-names>I.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Interactive Tree Of Life (iTOL) v4: recent updates and new developments.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>W256</fpage>&#x2013;<lpage>W259</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz239</pub-id> <pub-id pub-id-type="pmid">30931475</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Li</surname> <given-names>F.</given-names></name> <name><surname>Zhang</surname> <given-names>M.</given-names></name> <name><surname>Gong</surname> <given-names>P.</given-names></name> <name><surname>Zhou</surname> <given-names>X.</given-names></name></person-group> (<year>2020</year>). <article-title>Dynamic Subcellular Localization, Accumulation, and Interactions of Proteins From Tomato Yellow Leaf Curl China Virus and Its Associated Betasatellite.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>11</volume>:<issue>840</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2020.00840</pub-id> <pub-id pub-id-type="pmid">32612626</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Locali-Fabris</surname> <given-names>E. C.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name> <name><surname>Souza</surname> <given-names>A. A.</given-names></name> <name><surname>Takita</surname> <given-names>M. A.</given-names></name> <name><surname>Ast&#x00FA;a-Monge</surname> <given-names>G.</given-names></name> <name><surname>Antonioli-Luizon</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Complete nucleotide sequence, genomic organization and phylogenetic analysis of citrus leprosis virus cytoplasmic type.</article-title> <source><italic>J. Gen. Virol.</italic></source> <volume>87</volume> <fpage>2721</fpage>&#x2013;<lpage>2729</lpage>. <pub-id pub-id-type="doi">10.1099/vir.0.82038-0</pub-id> <pub-id pub-id-type="pmid">16894213</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Malik</surname> <given-names>S. S.</given-names></name> <name><surname>Azem-e-Zahra</surname> <given-names>S.</given-names></name> <name><surname>Kim</surname> <given-names>K. M.</given-names></name> <name><surname>Caetano-Anoll&#x00E9;s</surname> <given-names>G.</given-names></name> <name><surname>Nasir</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Do viruses exchange genes across superkingdoms of life?</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>8</volume>:<issue>02110</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2017.02110</pub-id> <pub-id pub-id-type="pmid">29163404</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mann</surname> <given-names>K. S.</given-names></name> <name><surname>Johnson</surname> <given-names>K. N.</given-names></name> <name><surname>Carroll</surname> <given-names>B. J.</given-names></name> <name><surname>Dietzgen</surname> <given-names>R. G.</given-names></name></person-group> (<year>2016</year>). <article-title>Cytorhabdovirus P protein suppresses RISC-mediated cleavage and RNA silencing amplification in planta.</article-title> <source><italic>Virology</italic></source> <volume>490</volume> <fpage>27</fpage>&#x2013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.1016/j.virol.2016.01.003</pub-id> <pub-id pub-id-type="pmid">26808923</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martinez</surname> <given-names>N.</given-names></name> <name><surname>Ribeiro</surname> <given-names>E. A.</given-names></name> <name><surname>Leyrat</surname> <given-names>C.</given-names></name> <name><surname>Tarbouriech</surname> <given-names>N.</given-names></name> <name><surname>Ruigrok</surname> <given-names>R. W. H.</given-names></name> <name><surname>Jamin</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>Structure of the C-Terminal Domain of Lettuce Necrotic Yellows Virus Phosphoprotein.</article-title> <source><italic>J. Virol.</italic></source> <volume>87</volume> <fpage>9569</fpage>&#x2013;<lpage>9578</lpage>. <pub-id pub-id-type="doi">10.1128/jvi.00999-13</pub-id> <pub-id pub-id-type="pmid">23785215</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Masters</surname> <given-names>P. S.</given-names></name> <name><surname>Banerjee</surname> <given-names>A. K.</given-names></name></person-group> (<year>1988</year>). <article-title>Resolution of multiple complexes of phosphoprotein NS with nucleocapsid protein N of vesicular stomatitis virus.</article-title> <source><italic>J. Virol.</italic></source> <volume>62</volume> <fpage>2651</fpage>&#x2013;<lpage>2657</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.62.8.2651-2657.1988</pub-id> <pub-id pub-id-type="pmid">2839692</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mavrakis</surname> <given-names>M.</given-names></name> <name><surname>McCarthy</surname> <given-names>A. A.</given-names></name> <name><surname>Roche</surname> <given-names>S.</given-names></name> <name><surname>Blondel</surname> <given-names>D.</given-names></name> <name><surname>Ruigrok</surname> <given-names>R. W. H.</given-names></name></person-group> (<year>2004</year>). <article-title>Structure and function of the C-terminal domain of the polymerase cofactor of rabies virus</article-title>. <source><italic>J. Mol. Biol</italic></source>. <volume>343</volume>, <fpage>819</fpage>&#x2013;<lpage>831</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2004.08.071</pub-id> <pub-id pub-id-type="pmid">15476803</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Melzer</surname> <given-names>M. J.</given-names></name> <name><surname>Sether</surname> <given-names>D. M.</given-names></name> <name><surname>Borth</surname> <given-names>W. B.</given-names></name> <name><surname>Hu</surname> <given-names>J. S.</given-names></name></person-group> (<year>2012</year>). <article-title>Characterization of a virus infecting Citrus volkameriana with citrus leprosis-like symptoms.</article-title> <source><italic>Phytopathology</italic></source> <volume>102</volume> <fpage>122</fpage>&#x2013;<lpage>127</lpage>. <pub-id pub-id-type="doi">10.1094/PHYTO-01-11-0013</pub-id> <pub-id pub-id-type="pmid">21916557</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Melzer</surname> <given-names>M. J.</given-names></name> <name><surname>Simbajon</surname> <given-names>N.</given-names></name> <name><surname>Carillo</surname> <given-names>J.</given-names></name> <name><surname>Borth</surname> <given-names>W. B.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>A cilevirus infects ornamental hibiscus in Hawaii.</article-title> <source><italic>Arch. Virol.</italic></source> <volume>2013</volume> <fpage>1745</fpage>&#x2013;<lpage>1740</lpage>. <pub-id pub-id-type="doi">10.1007/s00705-013-1745-0</pub-id> <pub-id pub-id-type="pmid">23732930</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miller</surname> <given-names>W. A.</given-names></name> <name><surname>Shen</surname> <given-names>R.</given-names></name> <name><surname>Staplin</surname> <given-names>W.</given-names></name> <name><surname>Kanodia</surname> <given-names>P.</given-names></name></person-group> (<year>2016</year>). <article-title>Noncoding RNAs of plant viruses and viroids: Sponges of host translation and RNA interference machinery.</article-title> <source><italic>Mol. Plant Microbe Interact.</italic></source> <volume>29</volume> <fpage>156</fpage>&#x2013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1094/MPMI-10-15-0226-FI</pub-id> <pub-id pub-id-type="pmid">26900786</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morozov</surname> <given-names>S. Y.</given-names></name> <name><surname>Solovyev</surname> <given-names>A. G.</given-names></name></person-group> (<year>2020</year>). <article-title>Small hydrophobic viral proteins involved in intercellular movement of diverse plant virus genomes.</article-title> <source><italic>AIMS Microbiol.</italic></source> <volume>6</volume> <fpage>305</fpage>&#x2013;<lpage>329</lpage>. <pub-id pub-id-type="doi">10.3934/microbiol.2020019</pub-id> <pub-id pub-id-type="pmid">33134746</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morozov</surname> <given-names>S. Y.</given-names></name> <name><surname>Lazareva</surname> <given-names>E. A.</given-names></name> <name><surname>Solovyev</surname> <given-names>A. G.</given-names></name></person-group> (<year>2020</year>). <article-title>Sequence Relationships of RNA Helicases and Other Proteins Encoded by Blunervirus RNAs Highlight Recombinant Evolutionary Origin of Kitaviral Genomes.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>11</volume>:<issue>561092</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2020.561092</pub-id> <pub-id pub-id-type="pmid">33193144</pub-id></citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mueller</surname> <given-names>S.</given-names></name> <name><surname>Papamichail</surname> <given-names>D.</given-names></name> <name><surname>Coleman</surname> <given-names>J. R.</given-names></name> <name><surname>Skiena</surname> <given-names>S.</given-names></name> <name><surname>Wimmer</surname> <given-names>E.</given-names></name></person-group> (<year>2006</year>). <article-title>Reduction of the Rate of Poliovirus Protein Synthesis through Large-Scale Codon Deoptimization Causes Attenuation of Viral Virulence by Lowering Specific Infectivity.</article-title> <source><italic>J. Virol.</italic></source> <volume>80</volume> <fpage>9687</fpage>&#x2013;<lpage>9696</lpage>. <pub-id pub-id-type="doi">10.1128/jvi.00738-06</pub-id> <pub-id pub-id-type="pmid">16973573</pub-id></citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Newburn</surname> <given-names>L. R.</given-names></name> <name><surname>White</surname> <given-names>K. A.</given-names></name></person-group> (<year>2015</year>). <article-title>Cis-acting RNA elements in positive-strand RNA plant virus genomes.</article-title> <source><italic>Virology</italic></source> <volume>479&#x2013;480</volume> <fpage>434</fpage>&#x2013;<lpage>443</lpage>. <pub-id pub-id-type="doi">10.1016/j.virol.2015.02.032</pub-id> <pub-id pub-id-type="pmid">25759098</pub-id></citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ng</surname> <given-names>W. M.</given-names></name> <name><surname>Stelfox</surname> <given-names>A. J.</given-names></name> <name><surname>Bowden</surname> <given-names>T. A.</given-names></name></person-group> (<year>2020</year>). <article-title>Unraveling virus relationships by structure-based phylogenetic classification.</article-title> <source><italic>Virus Evol.</italic></source> <volume>6</volume>:<issue>veaa003</issue>. <pub-id pub-id-type="doi">10.1093/ve/veaa003</pub-id> <pub-id pub-id-type="pmid">32064119</pub-id></citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olmedo-Velarde</surname> <given-names>A.</given-names></name> <name><surname>Hu</surname> <given-names>J.</given-names></name> <name><surname>Melzer</surname> <given-names>M. J.</given-names></name></person-group> (<year>2021</year>). <article-title>A Virus Infecting Hibiscus rosa-sinensis Represents an Evolutionary Link Between Cileviruses and Higreviruses.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>12</volume>:<issue>988</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2021.660237</pub-id> <pub-id pub-id-type="pmid">34012426</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Omer</surname> <given-names>S.</given-names></name> <name><surname>Harlow</surname> <given-names>T. J.</given-names></name> <name><surname>Gogarten</surname> <given-names>J. P.</given-names></name></person-group> (<year>2017</year>). <article-title>Does Sequence Conservation Provide Evidence for Biological Function?</article-title> <source><italic>Trends Microbiol.</italic></source> <volume>25</volume> <fpage>11</fpage>&#x2013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1016/j.tim.2016.09.010</pub-id> <pub-id pub-id-type="pmid">27773523</pub-id></citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ouzounis</surname> <given-names>C.</given-names></name> <name><surname>Sander</surname> <given-names>C.</given-names></name> <name><surname>Scharf</surname> <given-names>M.</given-names></name> <name><surname>Schneider</surname> <given-names>R.</given-names></name></person-group> (<year>1993</year>). <article-title>Prediction of protein structure by evaluation of sequence-structure fitness: Aligning sequences to contact profiles derived from three-dimensional structures.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>232</volume> <fpage>805</fpage>&#x2013;<lpage>825</lpage>. <pub-id pub-id-type="doi">10.1006/jmbi.1993.1433</pub-id> <pub-id pub-id-type="pmid">8355272</pub-id></citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peluso</surname> <given-names>R.</given-names></name> <name><surname>Moyer</surname> <given-names>S.</given-names></name></person-group> (<year>1988</year>). <article-title>Viral proteins required for the <italic>in vitro</italic> replication of vesicular stomatitis virus defective interfering particle genome RNA.</article-title> <source><italic>Virology</italic></source> <volume>162</volume> <fpage>369</fpage>&#x2013;<lpage>376</lpage>. <pub-id pub-id-type="doi">10.1016/0042-6822(88)90477-1</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ponting</surname> <given-names>C. P.</given-names></name></person-group> (<year>2017</year>). <article-title>Biological function in the twilight zone of sequence conservation.</article-title> <source><italic>BMC Biol.</italic></source> <volume>15</volume>:<issue>71</issue>. <pub-id pub-id-type="doi">10.1186/s12915-017-0411-5</pub-id> <pub-id pub-id-type="pmid">28814299</pub-id></citation></ref>
<ref id="B85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Puigb&#x00F2;</surname> <given-names>P.</given-names></name> <name><surname>Aragon&#x00E8;s</surname> <given-names>L.</given-names></name> <name><surname>Garcia-Vallv&#x00E9;</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>RCDI/eRCDI: A web-server to estimate codon usage deoptimization.</article-title> <source><italic>BMC Res. Notes</italic></source> <volume>3</volume>:<issue>87</issue>. <pub-id pub-id-type="doi">10.1186/1756-0500-3-87</pub-id> <pub-id pub-id-type="pmid">20356391</pub-id></citation></ref>
<ref id="B86"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quito-Avila</surname> <given-names>D. F.</given-names></name> <name><surname>Brannen</surname> <given-names>P. M.</given-names></name> <name><surname>Cline</surname> <given-names>W. O.</given-names></name> <name><surname>Harmon</surname> <given-names>P. F.</given-names></name> <name><surname>Martin</surname> <given-names>R. R.</given-names></name></person-group> (<year>2013</year>). <article-title>Genetic characterization of Blueberry necrotic ring blotch virus, a novel RNA virus with unique genetic features.</article-title> <source><italic>J. Gen. Virol.</italic></source> <volume>94</volume> <fpage>1426</fpage>&#x2013;<lpage>1434</lpage>. <pub-id pub-id-type="doi">10.1099/vir.0.050393-0</pub-id> <pub-id pub-id-type="pmid">23486668</pub-id></citation></ref>
<ref id="B87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quito-Avila</surname> <given-names>D. F.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name> <name><surname>Melzer</surname> <given-names>M. J.</given-names></name></person-group> (<year>2021</year>). <article-title>Bluner-, Cile-, and Higreviruses (Kitaviridae).</article-title> <source><italic>Encycl. Virol.</italic></source> <volume>2021</volume> <fpage>247</fpage>&#x2013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-809633-8.21248-X</pub-id></citation></ref>
<ref id="B88"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Chabi-Jesus</surname> <given-names>C.</given-names></name> <name><surname>Arena</surname> <given-names>G. D.</given-names></name> <name><surname>Tassi</surname> <given-names>A. D.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name> <name><surname>Freitas-Ast&#x00FA;a</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>Citrus leprosis: a unique multietiologic disease.</article-title> <source><italic>Citrus Am.</italic></source> <volume>1</volume> <fpage>4</fpage>&#x2013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.6084/m9.figshare.13513857.v2</pub-id></citation></ref>
<ref id="B89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Chabi-Jesus</surname> <given-names>C.</given-names></name> <name><surname>Guerra-Peraza</surname> <given-names>O.</given-names></name> <name><surname>Breton</surname> <given-names>M. C.</given-names></name> <name><surname>Arena</surname> <given-names>G. D.</given-names></name> <name><surname>Nunes</surname> <given-names>M. A.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Phylogenetic and molecular variability studies reveal a new genetic clade of Citrus leprosis virus C.</article-title> <source><italic>Viruses</italic></source> <volume>8</volume>:<issue>153</issue>. <pub-id pub-id-type="doi">10.3390/v8060153</pub-id> <pub-id pub-id-type="pmid">27275832</pub-id></citation></ref>
<ref id="B90"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramos-Gonz&#x00E1;lez</surname> <given-names>P. L.</given-names></name> <name><surname>Santos</surname> <given-names>G. F.</given-names></name> <name><surname>dos</surname></name> <name><surname>Chabi-Jesus</surname> <given-names>C.</given-names></name> <name><surname>Harakava</surname> <given-names>R.</given-names></name> <name><surname>Kitajima</surname> <given-names>E. W.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Passion Fruit Green Spot Virus Genome Harbors a New Orphan ORF and Highlights the Flexibility of the 5&#x2032;-End of the RNA2 Segment Across Cileviruses.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>11</volume>:<issue>00206</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2020.00206</pub-id> <pub-id pub-id-type="pmid">32117189</pub-id></citation></ref>
<ref id="B91"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ravantti</surname> <given-names>J. J.</given-names></name> <name><surname>Martinez-Castillo</surname> <given-names>A.</given-names></name> <name><surname>Abrescia</surname> <given-names>N. G. A.</given-names></name></person-group> (<year>2020</year>). <article-title>Superimposition of viral protein structures: A means to decipher the phylogenies of viruses.</article-title> <source><italic>Viruses</italic></source> <volume>12</volume>:<issue>v12101146</issue>. <pub-id pub-id-type="doi">10.3390/v12101146</pub-id> <pub-id pub-id-type="pmid">33050291</pub-id></citation></ref>
<ref id="B92"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rhys</surname> <given-names>G. G.</given-names></name> <name><surname>Wood</surname> <given-names>C. W.</given-names></name> <name><surname>Beesley</surname> <given-names>J. L.</given-names></name> <name><surname>Zaccai</surname> <given-names>N. R.</given-names></name> <name><surname>Burton</surname> <given-names>A. J.</given-names></name> <name><surname>Brady</surname> <given-names>R. L.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Navigating the Structural Landscape of de Novo &#x03B1;-Helical Bundles.</article-title> <source><italic>J. Am. Chem. Soc.</italic></source> <volume>141</volume> <fpage>8787</fpage>&#x2013;<lpage>8797</lpage>. <pub-id pub-id-type="doi">10.1021/jacs.8b13354</pub-id> <pub-id pub-id-type="pmid">31066556</pub-id></citation></ref>
<ref id="B93"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rokyta</surname> <given-names>D.</given-names></name> <name><surname>Badgett</surname> <given-names>M. R.</given-names></name> <name><surname>Molineux</surname> <given-names>I. J.</given-names></name> <name><surname>Bull</surname> <given-names>J. J.</given-names></name></person-group> (<year>2002</year>). <article-title>Experimental Genomic Evolution: Extensive Compensation for Loss of DNA Ligase Activity in a Virus.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>19</volume> <fpage>230</fpage>&#x2013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a004076</pub-id> <pub-id pub-id-type="pmid">11861882</pub-id></citation></ref>
<ref id="B94"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roy</surname> <given-names>A.</given-names></name> <name><surname>Choudhary</surname> <given-names>N.</given-names></name> <name><surname>Guillermo</surname> <given-names>L. M.</given-names></name> <name><surname>Shao</surname> <given-names>J.</given-names></name> <name><surname>Govindarajulu</surname> <given-names>A.</given-names></name> <name><surname>Achor</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>A novel virus of the genus <italic>Cilevirus</italic> causing symptoms similar to citrus leprosis.</article-title> <source><italic>Phytopathology</italic></source> <volume>103</volume> <fpage>488</fpage>&#x2013;<lpage>500</lpage>. <pub-id pub-id-type="doi">10.1094/PHYTO-07-12-0177-R</pub-id> <pub-id pub-id-type="pmid">23268581</pub-id></citation></ref>
<ref id="B95"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rozas</surname> <given-names>J.</given-names></name> <name><surname>Ferrer-Mata</surname> <given-names>A.</given-names></name> <name><surname>S&#x00E1;nchez-DelBarrio</surname> <given-names>J. C.</given-names></name> <name><surname>Guirao-Rico</surname> <given-names>S.</given-names></name> <name><surname>Librado</surname> <given-names>P.</given-names></name> <name><surname>Ramos-Onsins</surname> <given-names>S. E.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>34</volume> <fpage>3299</fpage>&#x2013;<lpage>3302</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msx248</pub-id> <pub-id pub-id-type="pmid">29029172</pub-id></citation></ref>
<ref id="B96"><citation citation-type="journal"><collab>Rstudio_team</collab>, (<year>2021</year>). <source><italic>RStudio | Open source &#x0026; professional software for data science teams - RStudio.</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.rstudio.com/">https://www.rstudio.com/</ext-link> <comment>(accessed June 9, 2021)</comment></citation></ref>
<ref id="B97"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schlub</surname> <given-names>T. E.</given-names></name> <name><surname>Buchmann</surname> <given-names>J. P.</given-names></name> <name><surname>Holmes</surname> <given-names>E. C.</given-names></name></person-group> (<year>2018</year>). <article-title>A simple method to detect candidate overlapping genes in viruses using single genome sequences.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>35</volume> <fpage>2572</fpage>&#x2013;<lpage>2581</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msy155</pub-id> <pub-id pub-id-type="pmid">30099499</pub-id></citation></ref>
<ref id="B98"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharp</surname> <given-names>P.</given-names></name> <name><surname>Li</surname> <given-names>W.</given-names></name></person-group> (<year>1987</year>). <article-title>The codon Adaptation Index&#x2013;a measure of directional synonymous codon usage bias, and its potential applications.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>15</volume> <fpage>1281</fpage>&#x2013;<lpage>1295</lpage>. <pub-id pub-id-type="doi">10.1093/NAR/15.3.1281</pub-id> <pub-id pub-id-type="pmid">3547335</pub-id></citation></ref>
<ref id="B99"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>M.</given-names></name> <name><surname>Lin</surname> <given-names>X.-D.</given-names></name> <name><surname>Tian</surname> <given-names>J.-H.</given-names></name> <name><surname>Chen</surname> <given-names>L.-J.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>C.-X.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Redefining the invertebrate RNA virosphere.</article-title> <source><italic>Nature</italic></source> <volume>540</volume> <fpage>539</fpage>&#x2013;<lpage>543</lpage>. <pub-id pub-id-type="doi">10.1038/nature20167</pub-id> <pub-id pub-id-type="pmid">27880757</pub-id></citation></ref>
<ref id="B100"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sievers</surname> <given-names>F.</given-names></name> <name><surname>Higgins</surname> <given-names>D. G.</given-names></name></person-group> (<year>2014</year>). <article-title>Clustal omega, accurate alignment of very large numbers of sequences.</article-title> <source><italic>Methods Mol. Biol.</italic></source> <volume>1079</volume> <fpage>105</fpage>&#x2013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-62703-646-7_6</pub-id></citation></ref>
<ref id="B101"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Singh</surname> <given-names>U.</given-names></name> <name><surname>Wurtele</surname> <given-names>E. S.</given-names></name></person-group> (<year>2020</year>). <article-title>Genetic novelty: How new genes are born.</article-title> <source><italic>Elife</italic></source> <volume>9</volume>:<issue>55136</issue>. <pub-id pub-id-type="doi">10.7554/eLife.55136</pub-id> <pub-id pub-id-type="pmid">32072921</pub-id></citation></ref>
<ref id="B102"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Solovyev</surname> <given-names>A. G.</given-names></name> <name><surname>Morozov</surname> <given-names>S. Y.</given-names></name></person-group> (<year>2017</year>). <article-title>Non-replicative Integral Membrane Proteins Encoded by Plant Alpha-Like Viruses: Emergence of Diverse Orphan ORFs and Movement Protein Genes.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>8</volume>:<issue>1820</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2017.01820</pub-id> <pub-id pub-id-type="pmid">29163564</pub-id></citation></ref>
<ref id="B103"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tautz</surname> <given-names>D.</given-names></name> <name><surname>Domazet-Lo&#x0161;o</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>The evolutionary origin of orphan genes.</article-title> <source><italic>Nat. Rev. Genet.</italic></source> <volume>12</volume> <fpage>692</fpage>&#x2013;<lpage>702</lpage>. <pub-id pub-id-type="doi">10.1038/nrg3053</pub-id> <pub-id pub-id-type="pmid">21878963</pub-id></citation></ref>
<ref id="B104"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vakirlis</surname> <given-names>N.</given-names></name> <name><surname>Carvunis</surname> <given-names>A.-R.</given-names></name> <name><surname>McLysaght</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>Synteny-based analyses indicate that sequence divergence is not the main source of orphan genes.</article-title> <source><italic>Elife</italic></source> <volume>9</volume>:<issue>53500</issue>. <pub-id pub-id-type="doi">10.7554/eLife.53500</pub-id> <pub-id pub-id-type="pmid">32066524</pub-id></citation></ref>
<ref id="B105"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Oss</surname> <given-names>S. B.</given-names></name> <name><surname>Carvunis</surname> <given-names>A. R.</given-names></name></person-group> (<year>2019</year>). <article-title>De novo gene birth.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>15</volume>:<issue>1008160</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1008160</pub-id> <pub-id pub-id-type="pmid">31120894</pub-id></citation></ref>
<ref id="B106"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vasilakis</surname> <given-names>N.</given-names></name> <name><surname>Forrester</surname> <given-names>N. L.</given-names></name> <name><surname>Palacios</surname> <given-names>G.</given-names></name> <name><surname>Nasar</surname> <given-names>F.</given-names></name> <name><surname>Savji</surname> <given-names>N.</given-names></name> <name><surname>Rossi</surname> <given-names>S. L.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Negevirus: a Proposed New Taxon of Insect-Specific Viruses with Wide Geographic Distribution.</article-title> <source><italic>J. Virol.</italic></source> <volume>87</volume> <fpage>2475</fpage>&#x2013;<lpage>2488</lpage>. <pub-id pub-id-type="doi">10.1128/JVI.00776-12</pub-id> <pub-id pub-id-type="pmid">23255793</pub-id></citation></ref>
<ref id="B107"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vignuzzi</surname> <given-names>M.</given-names></name> <name><surname>L&#x00F3;pez</surname> <given-names>C. B.</given-names></name></person-group> (<year>2019</year>). <article-title>Defective viral genomes are key drivers of the virus&#x2013;host interaction.</article-title> <source><italic>Nat. Microbiol.</italic></source> <volume>4</volume> <fpage>1075</fpage>&#x2013;<lpage>1087</lpage>. <pub-id pub-id-type="doi">10.1038/s41564-019-0465-y</pub-id> <pub-id pub-id-type="pmid">31160826</pub-id></citation></ref>
<ref id="B108"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vinokurov</surname> <given-names>K. S.</given-names></name> <name><surname>Koloniuk</surname> <given-names>I.</given-names></name></person-group> (<year>2019</year>). <article-title>Discovery and characterization of a novel alphavirus-like RNA virus from the red firebug Pyrrhocoris apterus L. (Heteroptera).</article-title> <source><italic>J. Invertebr. Pathol.</italic></source> <volume>166</volume>:<issue>107213</issue>. <pub-id pub-id-type="doi">10.1016/j.jip.2019.107213</pub-id> <pub-id pub-id-type="pmid">31260668</pub-id></citation></ref>
<ref id="B109"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>F.</given-names></name></person-group> (<year>1990</year>). <article-title>The &#x201C;effective number of codons&#x201D; used in a gene.</article-title> <source><italic>Gene</italic></source> <volume>87</volume> <fpage>23</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1016/0378-1119(90)90491-9</pub-id></citation></ref>
<ref id="B110"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name></person-group> (<year>2015</year>). <article-title>I-TASSER server: New development for protein structure and function predictions.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume> <fpage>W174</fpage>&#x2013;<lpage>W181</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv342</pub-id> <pub-id pub-id-type="pmid">25883148</pub-id></citation></ref>
<ref id="B111"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yin</surname> <given-names>Y.</given-names></name> <name><surname>Fischer</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Identification and investigation of ORFans in the viral world.</article-title> <source><italic>BMC Genomics</italic></source> <volume>9</volume>:<issue>24</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-9-24</pub-id> <pub-id pub-id-type="pmid">18205946</pub-id></citation></ref>
<ref id="B112"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Skolnick</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>Scoring function for automated assessment of protein structure template quality.</article-title> <source><italic>Proteins Struct. Funct. Bioinforma.</italic></source> <volume>57</volume> <fpage>702</fpage>&#x2013;<lpage>710</lpage>. <pub-id pub-id-type="doi">10.1002/prot.20264</pub-id> <pub-id pub-id-type="pmid">15476259</pub-id></citation></ref>
<ref id="B113"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zwart</surname> <given-names>M. P.</given-names></name> <name><surname>Willemsen</surname> <given-names>A.</given-names></name> <name><surname>Dar&#x00F2;s</surname> <given-names>J. A.</given-names></name> <name><surname>Elena</surname> <given-names>S. F.</given-names></name></person-group> (<year>2014</year>). <article-title>Experimental evolution of pseudogenization and gene loss in a plant RNA virus.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>31</volume> <fpage>121</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst175</pub-id> <pub-id pub-id-type="pmid">24109604</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://cousin.ird.fr/index.php">http://cousin.ird.fr/index.php</ext-link></p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="https://ppuigbo.me/programs/CAIcal/">https://ppuigbo.me/programs/CAIcal/</ext-link></p></fn>
<fn id="footnote3">
<label>3</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.kazusa.or.jp/codon/">http://www.kazusa.or.jp/codon/</ext-link></p></fn>
<fn id="footnote4">
<label>4</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.kazusa.or.jp/codon/countcodon.html">http://www.kazusa.or.jp/codon/countcodon.html</ext-link></p></fn>
<fn id="footnote5">
<label>5</label>
<p><ext-link ext-link-type="uri" xlink:href="https://genomevolution.org/coge/">https://genomevolution.org/coge/</ext-link></p></fn>
<fn id="footnote6">
<label>6</label>
<p><ext-link ext-link-type="uri" xlink:href="https://prosite.expasy.org/scanprosite/">https://prosite.expasy.org/scanprosite/</ext-link></p></fn>
<fn id="footnote7">
<label>7</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.uniprot.org/statistics/Swiss-Prot">https://www.uniprot.org/statistics/Swiss-Prot</ext-link></p></fn>
<fn id="footnote8">
<label>8</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.uniprot.org/statistics/TrEMBL">https://www.uniprot.org/statistics/TrEMBL</ext-link></p></fn>
<fn id="footnote9">
<label>9</label>
<p><ext-link ext-link-type="uri" xlink:href="https://zhanglab.ccmb.med.umich.edu/I-TASSER/">https://zhanglab.ccmb.med.umich.edu/I-TASSER/</ext-link></p></fn>
<fn id="footnote10">
<label>10</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.rcsb.org">https://www.rcsb.org</ext-link></p></fn>
<fn id="footnote11">
<label>11</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.ebi.ac.uk/msd-srv/ssm/">https://www.ebi.ac.uk/msd-srv/ssm/</ext-link></p></fn>
<fn id="footnote12">
<label>12</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.genome.jp/tools/motif/">https://www.genome.jp/tools/motif/</ext-link></p></fn>
<fn id="footnote13">
<label>13</label>
<p><ext-link ext-link-type="uri" xlink:href="http://bioinf.cs.ucl.ac.uk/psipred/">http://bioinf.cs.ucl.ac.uk/psipred/</ext-link></p></fn>
<fn id="footnote14">
<label>14</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.cbs.dtu.dk/services/TMHMM/">http://www.cbs.dtu.dk/services/TMHMM/</ext-link></p></fn>
</fn-group>
</back>
</article>