<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2017.00037</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A Comprehensive Analysis of RALF Proteins in Green Plants Suggests There Are Two Distinct Functional Groups</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Campbell</surname> <given-names>Liam</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/389743/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Turner</surname> <given-names>Simon R.</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/296098/overview"/>
</contrib>
</contrib-group>
<aff><institution>Faculty of Biology, Medicine and Health, School of Biological Science, University of Manchester</institution> <country>Manchester, UK</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Madelaine Elisabeth Bartlett, University of Massachusetts Amherst, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Tatiana Arias, The Corporation for Biological Research, Colombia; Ive De Smet, Flanders Institute for Biotechnology, Belgium</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Simon R. Turner <email>simon.turner&#x00040;manchester.ac.uk</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Plant Evolution and Development, a section of the journal Frontiers in Plant Science</p></fn></author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>01</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>37</elocation-id>
<history>
<date date-type="received">
<day>03</day>
<month>11</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>01</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Campbell and Turner.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Campbell and Turner</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Rapid Alkalinization Factors (RALFs) are small, cysteine-rich peptides known to be involved in various aspects of plant development and growth. Although RALF peptides have been identified within many species, a single wide-ranging phylogenetic analysis of the family across the plant kingdom has not yet been undertaken. Here, we identified RALF proteins from 51 plant species that represent a variety of land plant lineages. The inferred evolutionary history of the 795 identified RALFs suggests that the family has diverged into four major clades. We found that much of the variation across the family exists within the mature peptide region, suggesting clade-specific functional diversification. Clades I, II, and III contain the features that have been identified as important for RALF activity, including the RRXL cleavage site and the YISY motif required for receptor binding. In contrast, members of clades IV that represent a third of the total dataset, is highly diverged and lacks these features that are typical of RALFs. Members of clade IV also exhibit distinct expression patterns and physico-chemical properties. These differences suggest a functional divergence of clades and consequently, we propose that the peptides within clade IV are not true RALFs, but are more accurately described as RALF-related peptides. Expansion of this RALF&#x02013;related clade in the Brassicaceae is responsible for the large number of RALF genes that have been previously described in <italic>Arabidopsis thaliana</italic>. Future experimental work will help to establish the nature of the relationship between the true RALFs and the RALF-related peptides, and whether they function in a similar manner.</p>
</abstract>
<kwd-group>
<kwd>RALF</kwd>
<kwd>peptide</kwd>
<kwd>development</kwd>
<kwd>phylogeny</kwd>
<kwd>growth</kwd>
<kwd>evolution</kwd>
</kwd-group>
<contract-num rid="cn001">BB/J014478/1</contract-num>
<contract-sponsor id="cn001">Biotechnology and Biological Sciences Research Council<named-content content-type="fundref-id">10.13039/501100000268</named-content></contract-sponsor>
<counts>
<fig-count count="6"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="54"/>
<page-count count="14"/>
<word-count count="9886"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Complex, multicellular organisms such as plants use short- and long-distance signaling networks to allow for communication across different regions of the organism. Although this signaling is fundamentally important during plant growth, these networks also function to coordinate systemic responses to environmental stimuli. In recent years, it has emerged that small secreted peptides are a key component of these networks, controlling many critical processes, including the regulation of stem cell division and differentiation, cell expansion, stomatal development and gravitropism (reviewed by Czyzewicz et al., <xref ref-type="bibr" rid="B9">2013</xref>; Delay et al., <xref ref-type="bibr" rid="B10">2013</xref>; Matsubayashi, <xref ref-type="bibr" rid="B32">2014</xref>). One such small peptide family are the Rapid Alkalinization Factors (RALFs), first discovered through their ability to rapidly alkalinize tobacco cell cultures (Pearce et al., <xref ref-type="bibr" rid="B39">2001</xref>). The canonical peptide, RALF1, was found to arrest tomato and Arabidopsis root growth with no activation of defense pathways (Pearce et al., <xref ref-type="bibr" rid="B39">2001</xref>), initially suggesting a developmental role for these peptides. However, subsequent studies have identified a wide variety of roles for members of the RALF family, including cell expansion (Haruta et al., <xref ref-type="bibr" rid="B22">2014</xref>), lateral root development (Murphy et al., <xref ref-type="bibr" rid="B37">2016</xref>), root hair growth (Wu et al., <xref ref-type="bibr" rid="B52">2007</xref>), pollen tube elongation (Covey et al., <xref ref-type="bibr" rid="B7">2010</xref>), as well as stress (Atkinson et al., <xref ref-type="bibr" rid="B3">2013</xref>). The diversity of these roles indicates that RALF peptides are fundamentally important for plant development.</p>
<p>RALF proteins are cysteine-rich and typically have a full length of 80&#x02013;120 amino acids. They are translated as a preproprotein containing an N-terminal signal peptide that leads to their secretion and a C-terminal mature peptide with four di-sulfide bond-forming cysteine residues (Pearce et al., <xref ref-type="bibr" rid="B39">2001</xref>, <xref ref-type="bibr" rid="B40">2010</xref>). AtRALF23 was found to be cleaved at a di-basic RRXL site by the serine protease SITE-1 PROTEASE (AtSIP1), and this processing is essential for proper functioning of the peptide (Srivastava et al., <xref ref-type="bibr" rid="B50">2009</xref>). Until recently, the downstream mechanisms through which RALFs function was unknown, although a conserved YISY motif was known to be required for AtRALF1-receptor binding (Pearce et al., <xref ref-type="bibr" rid="B40">2010</xref>). FERONIA (FER), a receptor-kinase from the <italic>Catharantus roseus</italic> RLK1-like (CrRLK1L) subfamily (Lindner et al., <xref ref-type="bibr" rid="B29">2012</xref>), has been identified as a receptor for AtRALF1 and is involved in the rapid alkalization response (Haruta et al., <xref ref-type="bibr" rid="B22">2014</xref>). Knock-out alleles of <italic>fer</italic>, resulting from T-DNA insertion, are insensitive to AtRALF1 treatment. However, co-immunoprecipitation experiments suggest that AtRALF1 also binds to other receptors (Haruta et al., <xref ref-type="bibr" rid="B22">2014</xref>). Whether FER acts as a receptor for other RALFs is currently not clear, and no other RALF receptors have been identified to date. Very recently, the receptor-like cytoplasmic RPM1-induced protein kinase (RIPK) has been identified as an intracellular, interacting partner that is directly phosphorylated by FER and is crucial for the relaying of the RALF1-FER signal (Du et al., <xref ref-type="bibr" rid="B11">2016</xref>).</p>
<p>Consistent with their diverse roles in development and stress responses, RALFs have so far been identified in a variety of species, including monocots, eudicots and early-diverging lineages (Pearce et al., <xref ref-type="bibr" rid="B39">2001</xref>; Haruta and Constabel, <xref ref-type="bibr" rid="B21">2003</xref>; Germain et al., <xref ref-type="bibr" rid="B14">2005</xref>; Silverstein et al., <xref ref-type="bibr" rid="B49">2007</xref>; Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>; Ghorbani et al., <xref ref-type="bibr" rid="B15">2015</xref>; Sharma et al., <xref ref-type="bibr" rid="B47">2016</xref>). A duplication analysis has previously found that a large percentage of plant RALF proteins have evolved through tandem duplication, with this being responsible for the varying numbers of RALF proteins within previously investigated plant species (Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>). This is further demonstrated by the presence of pairs of RALF proteins exhibiting high homology to one another (Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>). Intriguingly, biologically-active RALF homologs, typically of RALF1, have also been identified within numerous fungal phytopathogens, with these potentially acting in plant-pathogen interactions (Thynne et al., <xref ref-type="bibr" rid="B51">2016</xref>). The seemingly ubiquitous presence of RALFs across the plant kingdom is further evidence of their general importance. However, no single wide-ranging, species-rich phylogenetic study of the RALF family has yet been undertaken. In an attempt to uncover new insights into the evolutionary history of the RALF family, we here present a comprehensive identification and analysis of RALF proteins from more than 50 diverse green plant proteomes obtained in a consistent format from Phytozome (Goodstein et al., <xref ref-type="bibr" rid="B17">2012</xref>). We reveal that RALFs have diverged into distinct groups, each containing identifiable differences in amino acid sequence. Importantly, much of this sequence diversification exists within the mature peptide region and is likely to define receptor binding and hence biological activity. One of the identified clades, which represents a third of all identified proteins, lacks many typical RALF features. We propose that these do not represent true RALFs and should be considered independently to the more typical RALF proteins.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Identification of RALF proteins</title>
<p>Conserved motifs in RALF proteins (Figure <xref ref-type="supplementary-material" rid="SM3">S1</xref>) were identified using MEME (Bailey and Elkan, <xref ref-type="bibr" rid="B4">1994</xref>) using 37 previously identified RALFS as a query (Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>). These motifs were identified in other plant species by scanning with FIMO (Grant et al., <xref ref-type="bibr" rid="B18">2011</xref>). Proteomes for the following species were retrieved from Phytozome v11.0 (Goodstein et al., <xref ref-type="bibr" rid="B17">2012</xref>): <italic>Amaranthus hypochondriacus, Amborella trichopoda, Ananas comosus, Aquilegia coerulea, Arabidopsis halleri, Arabidopsis lyrata, Arabidopsis thaliana, Brachypodium distachyon, Brachypodium stacei, Brassica rapa, Capsella grandiflora, Capsella rubella, Carica papaya, Chlamydomonas reinhardtii, Citrus clementina, Citrus sinensis, Coccomyxa subellipsoidea C-169, Cucumis sativus, Eucalyptus grandis, Eutrema salsugineum, Fragaria vesca, Glycine max, Gossypium raimondii, Linum usitatissimum, Malus domestica, Medicago truncatula, Micromonas pusilla, Mimulus guttatus, Musa acuminate, Oryza sativa, Ostreococcus lucimarinus, Panicum hallii, Panicum virgatum, Phaseolus vulgaris, Physcomitrella patens, Populus trichocarpa, Prunus persica, Ricinus communis, Salix purpurea, Selaginella moellendorffii, Setaria italic, Setaria viridis, Solanum lycopersicum, Solanum tuberosum, Sorghum bicolor, Spirodela polyrhiza, Theobroma cacao, Vitis vinifera, Volvox carteri, Zea mays, and Zostera marina</italic>. Genomic data such as genome size and total gene number were extracted from the relevant publication for each species (see Table <xref ref-type="supplementary-material" rid="SM1">S1</xref>). A stringent <italic>q</italic>-value cut off of 0.05 was used to remove low-scoring hits to the six motifs identified by MEME.</p>
</sec>
<sec>
<title>Construction of RALF alignments and phylogenetic trees</title>
<p>All protein alignments were initially created using the MUSCLE algorithm (Edgar, <xref ref-type="bibr" rid="B12">2004</xref>) within the AliView alignment editor (Larsson, <xref ref-type="bibr" rid="B27">2014</xref>). This was followed by manual optimisation to improve the alignment in regions that had been clearly misaligned by MUSCLE. Approximately-maximum likelihood phylogenetic trees were created for full-length proteins and mature peptides using FastTree v2.1 (Price et al., <xref ref-type="bibr" rid="B42">2010</xref>), with 4 rounds of minimum-evolution SPR moves (option: -spr 4) and exhaustive nearest-neighbor interchanges (options: -mlacc 2 -slownni) to improve accuracy. All other parameters were left as default, including the calculation of local-support values by the Shimodaira-Hasegawa test (Shimodaira and Hasegawa, <xref ref-type="bibr" rid="B48">1999</xref>). The inferred phyloXML trees were viewed in Archaeopteryx v0.9916 (Han and Zmasek, <xref ref-type="bibr" rid="B20">2009</xref>) to identify RALF clades and sub-clades. WebLogo3 (Crooks et al., <xref ref-type="bibr" rid="B8">2004</xref>) (<ext-link ext-link-type="uri" xlink:href="http://weblogo.threeplusone.com/">http://weblogo.threeplusone.com/</ext-link>) was used to provide a visual summary of conserved residues within the alignments. The inferred phylogenetic trees have been uploaded to the TreeBASE repository (Piel et al., <xref ref-type="bibr" rid="B44">2002</xref>) and can be accessed at <ext-link ext-link-type="uri" xlink:href="http://purl.org/phylo/treebase/phylows/study/TB2:S20366">http://purl.org/phylo/treebase/phylows/study/TB2:S20366</ext-link>.</p>
</sec>
<sec>
<title>CLANS pairwise-similarity plots</title>
<p>To perform pairwise BLAST (Altschul et al., <xref ref-type="bibr" rid="B1">1990</xref>) hits between each individual protein, full-length unaligned sequences of all 795 RALFs were uploaded to the online CLANS server (Frickey and Lupas, <xref ref-type="bibr" rid="B13">2004</xref>) that is a part of the MPI Bioinformatics Toolkit (Alva et al., <xref ref-type="bibr" rid="B2">2016</xref>). The BLOSUM80 substitution matrix was used for scoring and the HSP option was enabled. The output from this server was used within the standalone CLANS software to create the 2D similarity plots. A <italic>p</italic>-value threshold of 1e<sup>&#x02212;10</sup> was used to remove low-scoring BLAST searches and a minimum attraction value of 50 was used to restrict the proteins to within a reasonable 2D space. As CLANS is non-deterministic, we ran the analysis many times, each from different initialization states, to confirm that the separation of the proteins was reproducible and consistent. We stopped each analysis when the proteins had become stationary within the 2D plot.</p>
</sec>
<sec>
<title>Protein physico-chemical property prediction</title>
<p>The online multi-cleverMachine tool (Klus et al., <xref ref-type="bibr" rid="B26">2014</xref>) was used to compare the physico-chemical properties of RALF proteins. Unaligned sequences for each clade were uploaded to the server in FASTA format, with &#x0201C;clade IV&#x0201D; proteins being denoted as the positive set and clade I, II, and III proteins as negative sets. Ten scales were used to predict each property and detailed statistical comparisons were viewed using the boxplot and ROC curve functions found within the output screen.</p>
</sec>
<sec>
<title>Expression analysis</title>
<p>The mRNA expression of <italic>A. thaliana</italic> and <italic>Z. mays</italic> RALF genes was analyzed across publically-available RNAseq datasets that are included within Genevestigator (Hruz et al., <xref ref-type="bibr" rid="B23">2008</xref>). A total of 1031 samples across all datasets were analyzed. A clustered heat-map representing the log-2 absolute expression values throughout the anatomy of the plant was obtained using the &#x0201C;Hierarchical Clustering&#x0201D; tool within Genevestigator, with both the genes and conditions subjected to Euclidian-distance clustering.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Identification of RALF proteins from 51 plant genomes</title>
<p>Previous genome-wide identifications and phylogenetic analyses of the RALF family have focused on either six (Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>) or four (Sharma et al., <xref ref-type="bibr" rid="B47">2016</xref>) plant species. We sought to broaden this range by taking advantage of the Phytozome v11.0 genomic resource (Goodstein et al., <xref ref-type="bibr" rid="B17">2012</xref>), which provides comprehensive sequence data and accompanying annotation for a variety of green plant species. A full list of the 51 species included in this study can be found in Table <xref ref-type="table" rid="T1">1</xref>. This diverse range of species includes lineages that have been excluded from previous RALF phylogenetic analyses, such as the Rosaceae, and generally allows for a more informative analysis with greater resolution. Additionally, though a small number of RALFs have been previously found in plants such as potato (Germain et al., <xref ref-type="bibr" rid="B14">2005</xref>) and <italic>M. truncatula</italic> (Pearce et al., <xref ref-type="bibr" rid="B39">2001</xref>; Combier et al., <xref ref-type="bibr" rid="B6">2008</xref>), these studies were performed before a full genome was available, and were instead reliant upon expressed sequence tags (ESTs). Therefore, revisiting such species in light of the numerous full plant genomes now available should allow for a more accurate representation of the RALF family.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>The species analyzed within this study</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Species</bold></th>
<th valign="top" align="left"><bold>Order</bold></th>
<th valign="top" align="left"><bold>Family</bold></th>
<th valign="top" align="left"><bold>Total</bold></th>
<th valign="top" align="center" colspan="4" style="border-bottom: thin solid #000000;"><bold>Clade counts</bold></th>
</tr>
<tr>
<th valign="top" align="left" colspan="4"/>
<th valign="top" align="left"><bold>1</bold></th>
<th valign="top" align="left"><bold>2</bold></th>
<th valign="top" align="left"><bold>3</bold></th>
<th valign="top" align="left"><bold>4</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" colspan="8" style="background-color:#bbbdc0"><bold>EUDICOTS</bold></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Amaranthus hypochondriacus</italic></td>
<td valign="top" align="left">Caryophyllales</td>
<td valign="top" align="left">Amaranthaceae</td>
<td valign="top" align="center">12</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Aquilegia coerulea</italic></td>
<td valign="top" align="left">Ranunculales</td>
<td valign="top" align="left">Ranunculaceae</td>
<td valign="top" align="center">12</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Arabidopsis halleri</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Brassicaceae</td>
<td valign="top" align="center">25</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">16</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Arabidopsis lyrata</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Brassicaceae</td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left">20</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Arabidopsis thaliana</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Brassicaceae</td>
<td valign="top" align="center">37</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">23</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Brassica rapa</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Brassicaceae</td>
<td valign="top" align="center">32</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">12</td>
<td valign="top" align="left">9</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Capsella grandiflora</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Brassicaceae</td>
<td valign="top" align="center">24</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left">12</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Capsella rubella</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Brassicaceae</td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">18</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Carica papaya</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Caricaceae</td>
<td valign="top" align="center">17</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">11</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Citrus clementina</italic></td>
<td/>
<td/>
<td valign="top" align="left">13</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Citrus sinensis</italic></td>
<td valign="top" align="left">Sapindales</td>
<td valign="top" align="left">Rutaceae</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Cucumis sativus</italic></td>
<td valign="top" align="left">Cucurbitales</td>
<td valign="top" align="left">Cucurbitaceae</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">3</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Eucalyptus grandis</italic></td>
<td valign="top" align="left">Myrtales</td>
<td valign="top" align="left">Myrtaceae</td>
<td valign="top" align="center">16</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Eutrema salsugineum</italic></td>
<td valign="top" align="left">Brassicales</td>
<td valign="top" align="left">Brassicaceae</td>
<td valign="top" align="center">35</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">10</td>
<td valign="top" align="left">19</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Fragaria vesca</italic></td>
<td valign="top" align="left">Rosales</td>
<td valign="top" align="left">Rosaceae</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Glycine max</italic></td>
<td valign="top" align="left">Fabales</td>
<td valign="top" align="left">Fabaceae</td>
<td valign="top" align="center">23</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Gossypium raimondii</italic></td>
<td valign="top" align="left">Malvales</td>
<td valign="top" align="left">Malvaceae</td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">10</td>
<td valign="top" align="left">12</td>
<td valign="top" align="left">11</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Linum usitatissimum</italic></td>
<td valign="top" align="left">Malpighiales</td>
<td valign="top" align="left">Linaceae</td>
<td valign="top" align="center">20</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Malus domestica</italic></td>
<td valign="top" align="left">Rosales</td>
<td valign="top" align="left">Rosaceae</td>
<td valign="top" align="center">33</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">14</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">6</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Medicago truncatula</italic></td>
<td valign="top" align="left">Fabales</td>
<td valign="top" align="left">Fabaceae</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Mimulus guttatus</italic></td>
<td valign="top" align="left">Lamiales</td>
<td valign="top" align="left">Phrymaceae</td>
<td valign="top" align="center">17</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">4</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Phaseolus vulgaris</italic></td>
<td valign="top" align="left">Fabales</td>
<td valign="top" align="left">Fabaceae</td>
<td valign="top" align="center">9</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Populus trichocarpa</italic></td>
<td valign="top" align="left">Malpighiales</td>
<td valign="top" align="left">Salicaceae</td>
<td valign="top" align="center">20</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Prunus persica</italic></td>
<td valign="top" align="left">Rosales</td>
<td valign="top" align="left">Rosaceae</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">3</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Ricinus communis</italic></td>
<td valign="top" align="left">Malpighiales</td>
<td valign="top" align="left">Euphorbiaceae</td>
<td valign="top" align="center">18</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">9</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Salix purpurea</italic></td>
<td valign="top" align="left">Malpighiales</td>
<td valign="top" align="left">Salicaceae</td>
<td valign="top" align="center">32</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">10</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left">11</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Solanum lycopersicum</italic></td>
<td valign="top" align="left">Solanales</td>
<td valign="top" align="left">Solanaceae</td>
<td valign="top" align="center">8</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">4</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Solanum tuberosum</italic></td>
<td valign="top" align="left">Solanales</td>
<td valign="top" align="left">Solanaceae</td>
<td valign="top" align="center">16</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">4</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Theobroma cacao</italic></td>
<td valign="top" align="left">Malvales</td>
<td valign="top" align="left">Malvaceae</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">3</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Vitis vinifera</italic></td>
<td valign="top" align="left">Vitales</td>
<td valign="top" align="left">Vitaceae</td>
<td valign="top" align="center">4</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8" style="background-color:#bbbdc0"><bold>MONOCOTS</bold></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Ananas comosus</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Bromeliaceae</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">10</td>
<td valign="top" align="left">3</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Brachypodium distachyon</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">10</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">9</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Brachypodium stacei</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">11</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">10</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Musa acuminata</italic></td>
<td valign="top" align="left">Zingiberales</td>
<td valign="top" align="left">Musaceae</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">7</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">6</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Oryza sativa</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">14</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">13</td>
<td valign="top" align="left">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Panicum hallii</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">13</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">8</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Panicum virgatum</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">31</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">22</td>
<td valign="top" align="left">9</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Setaria italica</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">15</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">9</td>
<td valign="top" align="left">6</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Setaria viridis</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">15</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">9</td>
<td valign="top" align="left">6</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Sorghum bicolor</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">16</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Spirodela polyrhiza</italic></td>
<td valign="top" align="left">Alismatales</td>
<td valign="top" align="left">Araceae</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">3</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Zea mays</italic></td>
<td valign="top" align="left">Poales</td>
<td valign="top" align="left">Poaceae</td>
<td valign="top" align="center">20</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">15</td>
<td valign="top" align="left">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Zostera marina</italic></td>
<td valign="top" align="left">Alismatales</td>
<td valign="top" align="left">Zosteraceae</td>
<td valign="top" align="center">7</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8" style="background-color:#bbbdc0"><bold>EARLY-DIVERGING ANGIOSPERM</bold></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Amborella trichopoda</italic></td>
<td/>
<td/>
<td valign="top" align="center">9</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">5</td>
<td valign="top" align="left">3</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8" style="background-color:#bbbdc0"><bold>EARLY-DIVERGING PLANTS</bold></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Physcomitrella patens</italic></td>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">2</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Selaginella moellendorffii</italic></td>
<td/>
<td/>
<td valign="top" align="center">1</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">1</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8" style="background-color:#bbbdc0"><bold>CHLOROPHYTES</bold></td>
</tr>
<tr>
<td valign="top" align="left"><italic>Chlamydomonas reinhardtii</italic></td>
<td/>
<td/>
<td valign="top" align="center">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Coccomyxa subellipsoidea C-169</italic></td>
<td/>
<td/>
<td valign="top" align="center">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Micromonas pusilla</italic></td>
<td/>
<td/>
<td valign="top" align="center">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Ostreococcus lucimarinus</italic></td>
<td/>
<td/>
<td valign="top" align="center">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Volvox carteri</italic></td>
<td/>
<td/>
<td valign="top" align="center">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">0</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The total number of identified RALFs and the number found within each clade are shown</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>To find RALF proteins across the 51 included species, MEME (Bailey and Elkan, <xref ref-type="bibr" rid="B4">1994</xref>) was first used to detect up to six conserved amino acid motifs within the previously identified 37 <italic>A. thaliana</italic> RALFs (Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>; Figure <xref ref-type="supplementary-material" rid="SM3">S1</xref>). This was followed by the identification of regions matching to these motifs across the proteome of all 51 species using FIMO (Grant et al., <xref ref-type="bibr" rid="B18">2011</xref>). We discovered a total of 795 RALFs, with a breakdown of the number per species shown in Table <xref ref-type="table" rid="T1">1</xref>. None of the analyzed proteomes were found to contain more than the 37 RALFs identified in <italic>A. thaliana</italic>, though there are other eudicot species with large numbers of RALFs, such as the 33 found in the apple (<italic>M. domestica</italic>) proteome. In general, the eudicots contain more RALF proteins on average (&#x0007E;20) than the monocots (&#x0007E;14), however, the angiosperm with the fewest number of RALFs is the early-diverging rosid domesticated grape (<italic>V. vinifera</italic>), which only has four RALF proteins.</p>
</sec>
<sec>
<title>Rapid expansion of the RALF family in the Brassicaceae</title>
<p>The difference in the average number of RALF genes between the monocots and eudicots could mean that the genetic mechanisms underlying the evolution of the RALF family was distinct in each group. On the other hand, such differences may simply be a consequence of the eudicots analyzed having larger genomes than the monocots. To investigate this, for each species we compared the number of RALF genes to the total number of genes in the genome (Figure <xref ref-type="fig" rid="F1">1</xref>), genome size (Mbp; Figure <xref ref-type="supplementary-material" rid="SM3">S2</xref>), and gene density (genes/Mbp). We found a strong positive correlation (<italic>r</italic> &#x0003D; 0.66) between the number of RALFs and genome size for monocots, but a weak negative correlation (<italic>r</italic> &#x0003D; &#x02212;0.13) for eudicots. Furthermore, the number of RALF genes correlates very strongly (<italic>r</italic> &#x0003D; 0.93) with the total number of genes across the monocots, but there is only a weak correlation (<italic>r</italic> &#x0003D; 0.32) for the eudicots. This data suggests that in monocots, RALF diversification has occurred at a very consistent rate that is proportional to overall changes in genome size, such as those caused by genome duplications. In eudicots however, some species appear to have far more RALFs than can be explained by expansion of the genome alone. These have been circled on Figure <xref ref-type="fig" rid="F1">1</xref> and Figure <xref ref-type="supplementary-material" rid="SM3">S2</xref>. Interestingly, we noted that five of these species belong to the Brassicaceae. When the Brassicaceae are omitted from this analysis, the number of RALF genes correlates more strongly with the genome size (<italic>r</italic> &#x0003D; 0.31) and gene number (<italic>r</italic> &#x0003D; 0.69) for eudicots, coefficients that are much higher than before but still below those of the monocots.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>The relationship between the total number of genes within the genome and the number of identified RALFs for the monocots (red) and eudicots (blue)</bold>. Species with unusually high numbers of RALFs based upon their genome content are circled.</p></caption>
<graphic xlink:href="fpls-08-00037-g0001.tif"/>
</fig>
</sec>
<sec>
<title>The RALF family has diverged into four major clades</title>
<p>In order to understand the evolution of the RALF family in more detail we aligned the 795 identified RALF protein sequences and inferred a phylogenetic tree The tree separates into four major clades (Figure <xref ref-type="fig" rid="F2">2</xref>) that consistently shows very high support values (&#x0003E;0.8; Figure <xref ref-type="fig" rid="F2">2A</xref>) Clade III is the largest of the four major clades with 320 members, followed by clade IV with 264, clade II with 151 and clade I with only 49 proteins. All four clades contain RALFs from a variety of species, including both monocots and eudicots (Table <xref ref-type="table" rid="T1">1</xref>), suggesting that all clades evolved before the divergence of these two angiosperm lineages. However, 90% of the genes within clades I and II are from eudicot species, with the Poaceae (grasses) being entirely absent from these clades. Conversely, there is a notable overrepresentation of monocots within clade III, as 70% of the monocot RALFs are found here.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>An unrooted approximately-maximum likelihood phylogenetic tree of the 795 aligned RALF proteins from 51 plant species. (A)</bold> A simplified cladogram representing the high-level splits of the tree, with the four major clades denoted by a number and sub-clades denoted by a letter. Local support values of these splits are provided, as calculated by the Shimodaira-Hasegawa test. <bold>(B)</bold> The full tree inferred from the 795 proteins. Pink, red, green and blue colors indicate clades I, II, III, and IV respectively and sub-clades are shaded appropriately.</p></caption>
<graphic xlink:href="fpls-08-00037-g0002.tif"/>
</fig>
<p>Nine RALFs were identified within the <italic>A. trichopoda</italic> proteome, the most basal angiosperm (Zuccolo et al., <xref ref-type="bibr" rid="B54">2011</xref>). Only 4 of the 43 angiosperm species studied here contain fewer than nine RALFs, meaning that there has been a general diversification, rather than a contraction, of the RALF family within almost all lineages since the early beginnings of the angiosperms. In support of this, <italic>A. trichopoda</italic> contains RALF proteins belonging to clades II, III and IV, explaining why these clades are represented across both the monocots and eudicots. The early-diverging species <italic>P. patens</italic> and <italic>S. moellendorffii</italic> have fewer RALF genes than <italic>A. trichopoda</italic> and these are instead found within clade III. We could identify no RALF genes within the five chlorophyte species (Table <xref ref-type="table" rid="T1">1</xref>).</p>
<p>Clades II, III, and IV can be further split into distinct sub-clades, with strong local support (Figure <xref ref-type="fig" rid="F2">2</xref>). Each of the nine sub-clades contains a range of species from across the angiosperms (Table <xref ref-type="table" rid="T2">2</xref>), suggesting that these sub-clades evolved within the ancient angiosperms. This is evidenced by the spread of the 9 <italic>A. trichopoda</italic> RALFs across 6 of the 10 sub-clades. However, clade III(D) is almost entirely absent from the eudicots, but in monocots has expanded to become the most prevalent clade. How each of these subclades differs in terms of amino acid sequence will be considered below.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>The number of RALF proteins found within each subclade per species</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Species</bold></th>
<th valign="top" align="center"><bold>Total</bold></th>
<th valign="top" align="center"><bold>II</bold></th>
<th valign="top" align="center"><bold>II</bold></th>
<th valign="top" align="center"><bold>III</bold></th>
<th valign="top" align="center"><bold>III</bold></th>
<th valign="top" align="center"><bold>III</bold></th>
<th valign="top" align="center"><bold>III</bold></th>
<th valign="top" align="center"><bold>IV</bold></th>
<th valign="top" align="center"><bold>IV</bold></th>
<th valign="top" align="center"><bold>IV</bold></th>
</tr>
<tr>
<th/>
<th/>
<th valign="top" align="center"><bold>A</bold></th>
<th valign="top" align="center"><bold>B</bold></th>
<th valign="top" align="center"><bold>A</bold></th>
<th valign="top" align="center"><bold>B</bold></th>
<th valign="top" align="center"><bold>C</bold></th>
<th valign="top" align="center"><bold>D</bold></th>
<th valign="top" align="center"><bold>A</bold></th>
<th valign="top" align="center"><bold>B</bold></th>
<th valign="top" align="center"><bold>C</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>Amaranthus hypochondriacus</italic></td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Amborella trichopoda</italic></td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Ananas comosus</italic></td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Aquilegia coerulea</italic></td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Arabidopsis halleri</italic></td>
<td valign="top" align="center">25</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">12</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Arabidopsis lyrata</italic></td>
<td valign="top" align="center">33</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">12</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Arabidopsis thaliana</italic></td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">14</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Brachypodium distachyon</italic></td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Brachypodium stacei</italic></td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Brassica rapa</italic></td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Capsella grandiflora</italic></td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Capsella rubella</italic></td>
<td valign="top" align="center">33</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">13</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Carica papaya</italic></td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Chlamydomonas reinhardtii</italic></td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Citrus clementina</italic></td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Citrus sinensis</italic></td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Coccomyxa subellipsoidea C-169</italic></td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Cucumis sativus</italic></td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Eucalyptus grandis</italic></td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Eutrema salsugineum</italic></td>
<td valign="top" align="center">35</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">13</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Fragaria vesca</italic></td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Glycine max</italic></td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Gossypium raimondii</italic></td>
<td valign="top" align="center">33</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Linum usitatissimum</italic></td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Malus domestica</italic></td>
<td valign="top" align="center">33</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Medicago truncatula</italic></td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">4</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Micromonas pusilla</italic></td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Mimulus guttatus</italic></td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Musa acuminata</italic></td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Oryza sativa</italic></td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Ostreococcus lucimarinus</italic></td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Panicum hallii</italic></td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Panicum virgatum</italic></td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Phaseolus vulgaris</italic></td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Physcomitrella patens</italic></td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Populus trichocarpa</italic></td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">5</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Prunus persica</italic></td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Ricinus communis</italic></td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Salix purpurea</italic></td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Selaginella moellendorffii</italic></td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Setaria italica</italic></td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Setaria viridis</italic></td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Solanum lycopersicum</italic></td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Solanum tuberosum</italic></td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Sorghum bicolor</italic></td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Spirodela polyrhiza</italic></td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Theobroma cacao</italic></td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Vitis vinifera</italic></td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Volvox carteri</italic></td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Zea mays</italic></td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left"><italic>Zostera marina</italic></td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Pairwise-similarity approaches support the existence of distinct RALF clades</title>
<p>It has been established that using a large number of aligned sequences does not necessarily lead to an accurate phylogenetic tree (Philippe et al., <xref ref-type="bibr" rid="B43">2011</xref>). In order to assess the accuracy of our tree and to further investigate the relationship between the various clades, we sought an alternate method of visualizing protein relationships. CLANS performs all-against-all BLAST searches upon an unaligned dataset and returns a non-deterministic 2D or 3D map in which protein similarity can be visualized (Frickey and Lupas, <xref ref-type="bibr" rid="B13">2004</xref>). Figure <xref ref-type="fig" rid="F3">3</xref> shows a typical CLANS output of the 795 full-length RALF sequences, in which each protein is represented by a colored dot. The color of each dot denotes the sub-clade that the protein fell into within the phylogenetic tree (Figure <xref ref-type="fig" rid="F2">2B</xref>) and the distance between the proteins signifies their similarity. Of note is the high level of correspondence between CLANS and the phylogenetic tree, demonstrated by scarce intermixing of proteins from different major clades within the 2D space. Hence, CLANS independently finds that RALFs generally have more sequence similarity with other proteins from the same clade than they do with members of other clades. This confirms that the major splits of the phylogenetic tree (Figure <xref ref-type="fig" rid="F2">2B</xref>) represent detectable differences in the underlying amino acid sequences. Clade IV RALFs appear to be the most diverged, as they are mostly restricted to the periphery of the CLANS map. Clade I, II and III instead fall closely together, with the central mix of clade I and II proteins suggesting that some RALFs within these clades have very similar sequences.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>A CLANS analysis of RALF protein similarity</bold>. Each of the 795 full-length protein sequences is represented by a colored dot that relates to the placement of that protein within the phylogenetic tree clades and subclades. Proteins that are closer within the 2D space are considered to have more sequence similarity.</p></caption>
<graphic xlink:href="fpls-08-00037-g0003.tif"/>
</fig>
<p>Furthermore, RALFs consistently cluster within the CLANS plot (Figure <xref ref-type="fig" rid="F3">3</xref>) into their respective sub-clades, signifying that these sub-clades also represent distinguishable variations in sequence. Of interest is the separation of clades II(A) and II(B), with II(A) intermixing with clade I whilst II(B) clusters slightly further away. This is in contrast with the inferred phylogenetic tree and suggests that clades I and II(A) share more sequence similarity than either does with clade II(B).</p>
</sec>
<sec>
<title>The clades represent divergence in the mature peptide sequence</title>
<p>We questioned whether the separation of the 795 identified RALFs into four clear clades was due to variations within the mature, functional peptide sequence or because of residues outside of this region that are perhaps subject to less selection pressure. We removed the N-terminus region from the aligned sequences, leaving only the YISY motif and all downstream residues. Although the beginning of the peptide is thought to be a few residues upstream of the YISY motif (Matos et al., <xref ref-type="bibr" rid="B31">2008</xref>), variation within this region makes it difficult to accurately identify the start of the peptide and hence these residues were omitted for simplicity. A CLANS analysis of this peptide region (Figure <xref ref-type="fig" rid="F4">4A</xref>) produces a very similar distribution to the full-length preproprotein CLANS, suggesting that residues within the mature peptide have sufficiently diverged alongside those outside of this region. Of note, the RALFs found within clade IV on the full-length tree again fall to the periphery of the peptide-only CLANS plot, demonstrating substantial differences within their peptide sequence compared to the other clades. Conversely, there is a tight cluster of peptides from the full-length clades I, II, and III, highlighting the similarities of these peptides. In light of this, we aligned the mature peptide region from clades I, II, and III and created an additional approximate-maximum likelihood tree to see whether the peptides from these three clades could be distinguished. The tree (Figure <xref ref-type="fig" rid="F4">4B</xref>) reliably separated the clade III mature peptides from those in clades I and II, which appear to be mostly indistinguishable. The clear overlap between the phylogenetic analyses of the full-length preproprotein and mature peptide suggests that there have been divergences across the whole length of the protein, including the functional peptide region.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>Divergence of the mature peptide region. (A)</bold> A CLANS sequence similarity analysis of the region downstream of the YISY motif from 795 RALF proteins. The colored dots represent the placement of each peptide within the full-length preproprotein tree (Figure <xref ref-type="fig" rid="F2">2</xref>), as denoted by the key. <bold>(B)</bold> An approximate-maximum likelihood tree of the mature RALF peptides placed within the full-length preproprotein clades I, II, and III. As a measure of the correspondence between the two trees, the numbers to the side of each clade show the percentage of RALFs that are placed within clade I/clade II/clade III on the full-length tree, respectively.</p></caption>
<graphic xlink:href="fpls-08-00037-g0004.tif"/>
</fig>
</sec>
<sec>
<title>Clade-specific variations in protein sequence</title>
<p>As both methods of assessing protein similarity broadly agreed that the RALF family has diverged into distinct groups, we carried out a more detailed analysis of the underlying protein sequences. We aligned individual clades in order to identify any distinguishing characteristics of each. Inspection of the underlying sequences reveals that the proteins of clades I and II(A) are indeed very similar. As shown in Figure <xref ref-type="fig" rid="F5">5A</xref>, the consensus mature peptide sequence of clade II(B) is much distinct from clades I and II(A), lacking a conserved YYNC motif whilst containing additional proline residues toward the C-terminus. Such differences are consistent with the relative placement of these three clade/sub-clades within the CLANS output (Figure <xref ref-type="fig" rid="F3">3</xref>).</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Divergence of RALF protein sequences across the four major phylogenetic clades. (A)</bold> WebLogo3 plots to demonstrate residue conservation within the mature peptide region of the four major clades. Clade I and II (A) are shown together as the mature peptides of these sub-clades are very similar. <bold>(B)</bold> A schematic representation of the motif structure of RALF proteins from clades I, II and III in comparison to the shorter RALFs of clade IV. Proteins from both clades contain an N-terminal signal peptide, but the variable and acidic region downstream of this is much shorter and frequently absent within the clade IV RALFs. The di-basic mature peptide cleavage site is absent within clade IV, suggesting an alternate cleavage mechanism. Additionally, the YISY motif thought to be required for receptor binding is highly variable within clade IV RALFs and many proteins in this clade do not contain the second of the four typically conserved cysteine residues.</p></caption>
<graphic xlink:href="fpls-08-00037-g0005.tif"/>
</fig>
<p>The RALFs that occupy clade III are also very similar to those of clade I, in agreement with their close proximity in Figure <xref ref-type="fig" rid="F3">3</xref>. However, these proteins appear to have diversified somewhat from the remarkably well-conserved RALFs that belong to clades I and II(A). There are no clear characteristics that can distinguish clade III as a whole, demonstrated by the amount of variation at many residue positions of the mature peptide (Figure <xref ref-type="fig" rid="F5">5A</xref>). Instead, each of the four subclades has seemingly diversified differently, though there is noticeable diversification even within each subclade. The most distinguishing variations are found within the mature peptide region, as can be seen in Figure <xref ref-type="supplementary-material" rid="SM3">S3</xref>. Obvious examples include the insertion of a proline residue at position 29 within clade III(D), and the insertion of an alanine at position nine of clade III(C) RALFs. Additionally, the CRG motif that occupies the three terminal residues of almost all clade III(D) peptides is entirely unique to that sub-clade. Notably, clades III(A/B) commonly contain an additional di-basic site upstream of the YISY motif that is mostly absent from clade III(C) and entirely absent from clade III(D). It is possible that the presence of this di-basic site within close proximity to the di-basic RALF cleavage site (RRIL; Matos et al., <xref ref-type="bibr" rid="B31">2008</xref>) could affect the processing of these preproproteins.</p>
</sec>
<sec>
<title>Clade IV RALFs are distinct and divergent</title>
<p>Whereas clades I, II, and III all possess the conserved YISY motif that is thought to be responsible for the binding of the peptide to its receptor (Pearce et al., <xref ref-type="bibr" rid="B40">2010</xref>), remarkably this motif is rarely found within clade IV. Only 3/264 clade IV RALFs contain &#x0201C;YISY,&#x0201D; with the remainder showing a diverse range of substitutions, though many still contain an isoleucine and tyrosine at the second and fourth positions (XIXY). This lack of YISY conservation can be visualized in Figure <xref ref-type="fig" rid="F5">5A</xref>. Furthermore, many other typically-conserved residues are absent from this clade. The RRXL protease cleavage site, found upstream of YISY within the vast majority of clade I, II, and III RALFs, is almost entirely absent across the 264 clade IV proteins. This likely means that these RALFs are processed and cleaved through a different mechanism, if at all. Likewise, the acidic (glutamate/aspartate) region usually found between the signal peptide and the mature peptide is missing, with this probably impacting substantially upon the protein&#x00027;s structure and stability. In fact, only a minority of peptide residues are conserved within clade IV, with most residues being extremely variable (Figure <xref ref-type="fig" rid="F5">5A</xref>). The absence of these motifs and other residues results in the clade IV RALFs having a mean length of only 88 amino acids, in contrast to the other clades (Figure <xref ref-type="fig" rid="F5">5B</xref>), which contain RALF proteins with an average length of 125 amino acids. The missing regions likely explain the position of clade IV RALFs at the periphery of the CLANS 2D plot, away from the central zone occupied by the other three clades (Figure <xref ref-type="fig" rid="F3">3</xref>). The higher frequency of RALF proteins within the Brassicaceae is specifically due to an overrepresentation of clade IV RALFs, with this clade representing 56% of the RALF proteins within the Brassicaceae species analyzed, in comparison to the average representation of 34% across all 51 species.</p>
</sec>
<sec>
<title>Clade IV has distinct physico-chemical properties and expression patterns</title>
<p>We questioned whether the distinctive sequence patterns of the clade IV RALFs are likely to have any significant impact upon the physico-chemical properties of the translated protein. CleverMachine (Klus et al., <xref ref-type="bibr" rid="B26">2014</xref>) allows for the detection of protein properties that differ between two datasets and has been used to distinguish P-bodies and stress granules from other globular proteins (Marchese et al., <xref ref-type="bibr" rid="B30">2016</xref>) and to classify homo-repeat proteins (Yu Lobanov et al., <xref ref-type="bibr" rid="B53">2016</xref>). A multi-cleverMachine property prediction and comparison of the RALFs reveals that the clade IV proteins differ in a variety of physico-chemical properties from the other clades (Figure <xref ref-type="supplementary-material" rid="SM3">S4</xref>), such as a reduced disorder propensity. The analysis revealed that most properties could distinguish the clade IV proteins with high accuracy, as demonstrated by the typical area under the ROC curves being &#x0003E;0.9, a score typically considered to indicate a highly accurate test (Greiner et al., <xref ref-type="bibr" rid="B19">2000</xref>).</p>
<p>To identify whether genes within clades might have a common function and hence share similar expression profiles, we analyzed the expression of each <italic>A. thaliana</italic> RALF gene across various publically-available RNAseq datasets using Genevestigator (Hruz et al., <xref ref-type="bibr" rid="B23">2008</xref>; Figure <xref ref-type="fig" rid="F6">6</xref>). We found that the expression of clade IV RALFs is almost exclusively restricted to inflorescence tissues, with only 1 of the 23 <italic>A. thaliana</italic> clade IV genes (AT4G14020) showing notable levels of expression within other anatomical regions such as the root. In contrast, genes from other clades exhibit a more widespread expression profile and expression is found within the root and shoot as well as flowers, with the specific exception of the subclade III(B) which has a very similar expression pattern to the clade IV genes. This data further suggests that there have been functional diversifications between clades and genes within a clade share expression patterns. We found similar expression patterns within <italic>Z. mays</italic> (Figure <xref ref-type="supplementary-material" rid="SM3">S5</xref>), with the clade IV genes again being restricted to the inflorescence tissues.</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p><bold>Clustered mRNA expression values of 37 <italic><bold>Arabidopsis thaliana</bold></italic> RALF genes across a variety of tissues</bold>. Each gene is colored according to its phylogenetic clade (see key). Asterisks indicate the five genes belonging to the sub-clade III(B).</p></caption>
<graphic xlink:href="fpls-08-00037-g0006.tif"/>
</fig>
</sec>
<sec>
<title>Diverse C-terminals of RALF peptides</title>
<p>Whereas most residues within the mature peptide are highly conserved, there exists a great deal of variation at the C-terminus. Although the majority of RALFs terminate with an RCRR motif, many have additional residues downstream. The composition of these residues is highly variable and each is usually restricted to a few closely related species, suggesting that these are relatively recent additions to the protein. A selection of these is shown in Figure <xref ref-type="supplementary-material" rid="SM3">S6</xref> to demonstrate their variability. The longest of these are found within <italic>L. usitatissimum</italic>, which contains two RALF proteins with lysine-rich C-terminal tails that are over 250 residues in length. We also identified a <italic>F. vesca</italic> gene (gene10567-v1.0-hybrid) which is a fusion of an N-terminus LOW PSII ACCUMULATION1 (LPA1) homolog to a C-terminus RALF protein. In Arabidopsis, LPA1 is known to be involved in the assembly of photosystem II (Peng et al., <xref ref-type="bibr" rid="B41">2006</xref>). Finally, whereas some CLAVATA3/ESR-related (CLE) and C-TERMINALLY ENCODED PEPTIDE (CEP) genes containing multiple peptide motifs have been reported (Oelkers et al., <xref ref-type="bibr" rid="B38">2008</xref>; Sawa et al., <xref ref-type="bibr" rid="B46">2008</xref>; Roberts et al., <xref ref-type="bibr" rid="B45">2013</xref>), we could find no evidence of RALF genes that contain more than one RALF peptide motif.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In this study, we undertook a comprehensive identification and analysis of the RALF protein family across 51 plant species and more than 20 families. Previously published phylogenetic analyses of the RALF family (Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>; Sharma et al., <xref ref-type="bibr" rid="B47">2016</xref>) have been limited by low species numbers (six and four, respectively), which thereby restricts their inferred evolutionary history. By including a wider variety of species in the analysis, our data should provide a more accurate representation of RALF evolution with greater resolution. We found a widespread presence of RALFs across the land plants, with the eudicots containing more RALF members on average compared to the monocots, although this is partially due to differences in genome size. This is in accordance with a previous study that found cysteine-rich small peptides such as the RALFs have generally diversified more in eudicots than monocots (Silverstein et al., <xref ref-type="bibr" rid="B49">2007</xref>). Cao and Shi (<xref ref-type="bibr" rid="B5">2012</xref>) predicted that the most recent common ancestor of the monocots and eudicots contained two RALF proteins, based upon their identification of RALFs in <italic>A. thaliana</italic>, poplar, rice, and maize. However, we found that the early-diverging angiosperm <italic>A. trichopoda</italic> has nine RALFs, suggesting a much more widespread presence of RALF proteins within the early-diverging flowering plants than previously anticipated. We found marginally reduced estimated numbers of maize, poplar, and rice RALF proteins than Cao and Shi (<xref ref-type="bibr" rid="B5">2012</xref>) and Sharma et al. (<xref ref-type="bibr" rid="B47">2016</xref>). This is likely due to a more stringent cut-off point during our initial FIMO searches, with the benefit of our more conservative analysis being that our identified RALFs are highly likely to be genuine. Conversely, a number of the putative RALFs identified by Sharma et al. (<xref ref-type="bibr" rid="B47">2016</xref>) using BLAST, such as Os04g28520, appear to have very little sequence similarity to typical RALFs, to the extent that they are unlikely to be actual members of the RALF family. Additionally, a Pfam database entry exists for the RALF family (PF05498). Where the same species were analyzed by both methods, we found good correspondence between the Pfam and MEME/FIMO datasets (see Table <xref ref-type="supplementary-material" rid="SM2">S2</xref>). We found that a small number of highly diverged proteins were identified as RALFs by Pfam that were below the cut-off threshold of our study. Conversely, a similar number of proteins were identified by our method that Pfam did not identify, including an additional <italic>A. thaliana</italic> RALF. For the 29 species common to both methods, 511 RALFs were identified by Pfam in comparison to the 500 identified by MEME/FIMO, representing a minor 2% difference, suggesting that the two methods are broadly comparable in this instance. This validation of our approach allowed us to confidently apply our method across the wider range of plant species found in the 51 genomes available in Phytozome. Very recently an analogous study for the CLV3/ESR-related (CLE) family was published, in which CLE proteins were detected across all species available in Phytozome and distinct groups identified using CLANS (Goad et al., <xref ref-type="bibr" rid="B16">2016</xref>). Their methods differed from ours in that Hidden-Markov Models (HMMs) were used by Goad et al. (<xref ref-type="bibr" rid="B16">2016</xref>) for initial peptide identification, as opposed to MEME/FIMO. However, as Pfam also uses HMMs for protein detection and our results are in good agreement with Pfam, it would seem that both methods represent valid approaches for the identification of small secretory peptides with similar accuracy.</p>
<p>Our inability to detect RALFs within the chlorophytes (green algae) is consistent with a previous study that could not identify small secretory peptides within these organisms (Ghorbani et al., <xref ref-type="bibr" rid="B15">2015</xref>). This means that the earliest origins of the RALF family occurred after the evolution of the embryophytes (land plants). It may be that the evolution of secreted extracellular peptides allowed for the more complex, larger body plans found within the land plants. Secreted peptides are commonly associated with the local communication and control of cell proliferation, growth and differentiation (Meng et al., <xref ref-type="bibr" rid="B33">2012</xref>), and the relative simplicity of the chlorophytes seemingly does not require such signaling. The recent identification of RALFs within fungi (Thynne et al., <xref ref-type="bibr" rid="B51">2016</xref>) suggests that RALF pathways can be hijacked for the benefit of pathogens, further demonstrating their general importance within plant development. Whether these fungal RALFs originated through horizontal gene transfer or co-evolution is not yet clear. In contrast, the ubiquitous conservation of RALFs within every embryophyte species analyzed to date, in combination with experimentally verified roles in diverse processes such as root growth and pollen germination (reviewed by Murphy and De Smet, <xref ref-type="bibr" rid="B36">2014</xref>), would suggest that the RALF family are core regulators of land plant growth and development. On the other hand, the widespread presence of RALF proteins within early-diverging lineages suggests that RALFs did not first emerge alongside any core aspects of plant development, such as pollination, with the role of RALFs within such processes probably coming later through gene duplication. Our data suggests that these duplications have occurred more rapidly within the Brassicaceae than the other species analyzed.</p>
<p>The inferred RALF phylogenetic trees presented by Cao and Shi (<xref ref-type="bibr" rid="B5">2012</xref>) and Sharma et al. (<xref ref-type="bibr" rid="B47">2016</xref>) frequently showed low local support values for splits, indicating that the algorithms used struggled to reliably separate the RALF family into groups. Our larger dataset allowed for an inferred tree with very high split support, which, in combination with the CLANS analysis and a detailed study of the individual protein sequences, suggests the presence of distinct RALF groups. Our phylogenetic analysis found that the RALF family has diverged into four clades. Two of these, clades I and II, can be considered the basal RALFs and proteins belonging to these clades are very well-conserved. Clade III RALFs share many similarities with those of clades I and II. Although they show some level of diversification, clades I, II, and III contain all of the features previously described to be characteristic of the RALF family, including the N-terminal signal peptide cleavage site (Pearce et al., <xref ref-type="bibr" rid="B39">2001</xref>), C-terminal cysteines that form di-sulfide bridges (Pearce et al., <xref ref-type="bibr" rid="B39">2001</xref>), the mature peptide YISY motif (Pearce et al., <xref ref-type="bibr" rid="B40">2010</xref>), and the RRXL di-basic site (Matos et al., <xref ref-type="bibr" rid="B31">2008</xref>). However, clade IV, that represents a third of the RALF family, does not contain all characteristic RALF features. Almost all clade IV RALFs lack the RRXL motif, exhibit much more variation within the YISY motif than clades I, II, and III and are much shorter and variable than the other clades. The YISY motif has been previously shown to be required for binding of AtRALF1 to its receptor (Pearce et al., <xref ref-type="bibr" rid="B40">2010</xref>). Although most clade IV RALFs do possess the isoleucine residue known to be the most important within the YISY motif (Pearce et al., <xref ref-type="bibr" rid="B40">2010</xref>), the widespread conservation of the four residues outside of clade IV suggests that they also have a functional role. We question whether the clade IV RALFs should be considered as a separate group from those in other clades, as such dramatic differences within their protein sequences are likely to alter their structure and processing. We therefore propose that the members of this clade are not true RALFs. A similar nomenclature has been applied to the CLE/CLE-like peptide families (Meng et al., <xref ref-type="bibr" rid="B33">2012</xref>). In this case, however, RALF and RALF-like are already used by different authors to describe the same gene. Consequently, to avoid further confusion we suggest that members of clade IV should instead be referred to as RALF-related proteins. Other authors have noted that not all RALFs contain the YISY and RRXL motifs (Srivastava et al., <xref ref-type="bibr" rid="B50">2009</xref>; Pearce et al., <xref ref-type="bibr" rid="B40">2010</xref>; Cao and Shi, <xref ref-type="bibr" rid="B5">2012</xref>; Murphy and De Smet, <xref ref-type="bibr" rid="B36">2014</xref>) but here, we provide more insight into these differences within a wider phylogenetic context.</p>
<p>Until now, most experimental <italic>in planta</italic> studies assessing RALF function have focused on a minority of family members. FER is the only experimentally verified RALF receptor at this time (Haruta et al., <xref ref-type="bibr" rid="B22">2014</xref>) and much work is needed to be done on the relationship between RALFs and their receptors. For instance, FER is known to control crucial fertility events such as pollen tube-ovule interactions (Huck et al., <xref ref-type="bibr" rid="B24">2003</xref>), and there is also evidence for the role of RALF peptides in pollen development (Covey et al., <xref ref-type="bibr" rid="B7">2010</xref>). However, we do not know how these are linked and it is not yet clear if the binding of different RALFs to FER is responsible for the extensive influence that FER has upon many aspects of development. The widespread conservation of the YISY motif across clade I-III RALFs suggests that these peptides bind to the same receptor.</p>
<p>In Arabidopsis, less than a third of the RALFs have been studied in any detail (Murphy and De Smet, <xref ref-type="bibr" rid="B36">2014</xref>). Of these, only AtRALF8 belongs to clade IV and this is the only Arabidopsis RALF to have a proven role in regulating the response to biotic and abiotic stresses thus far (Atkinson et al., <xref ref-type="bibr" rid="B3">2013</xref>). This also indicates that the clade IV RALFs are indeed functional. Furthermore, there is evidence that other receptors exist. AtRALFL4, here placed within clade III(B), actually increased in alkalinization activity in the presence of the suramin, a general inhibitor of peptide-ligand-receptor interactions, in contrast to the decreased activity of AtRALFL1, 19, 22, 23, 24, 31, 33, and 34 (Morato do Canto et al., <xref ref-type="bibr" rid="B35">2014</xref>). This would suggest that some RALFs instead bind to other receptors that are not susceptible to suramin. The binding to these other receptors may or may not depend upon the YISY motif and it may be that the clade IV RALFs with their more variable motif bind to different receptors to those of other clades. As these proteins are missing the RRXL di-basic site, their cleavage and processing may also occur in an alternate manner. One other possibility is that the RALF-related peptides bind to the same receptors but act synergistically or antagonistically to the true RALFs. The presence of such antagonistic interactions between closely related peptides has recently become apparent in plants for the first time in stomatal patterning. STOMAGEN and EFP2, members of the same peptide family, competitively bind to the ERECTA receptor kinase to promote or inhibit stomatal development, respectively (Lee et al., <xref ref-type="bibr" rid="B28">2015</xref>). Future experimental work could investigate whether such interactions exist between the clade I&#x02013;III and the clade IV RALF peptides described here.</p>
<p>It is not yet known to what extent redundancy exists across the RALF family. It has been shown for other types of small peptide that genes with similar peptide motifs are more likely to have redundant and overlapping functions (Ito et al., <xref ref-type="bibr" rid="B25">2006</xref>; Meng et al., <xref ref-type="bibr" rid="B34">2010</xref>). As an example, CLE41, CLE42, and CLE44 are functionally redundant within vascular cell differentiation and have almost identical CLE motif sequences (Ito et al., <xref ref-type="bibr" rid="B25">2006</xref>). Additionally, domain-swap experiments have demonstrated that the CLE motif itself is largely responsible for specifying the overall function of the gene, rather than the sequences outside of the motif, such as the signal peptide (Meng et al., <xref ref-type="bibr" rid="B34">2010</xref>). No equivalent data exists for the RALF family, although such experiments could help to identify whether the variations that we are described here within the mature peptide region relate to their function.</p>
</sec>
<sec id="s5">
<title>Author contributions</title>
<p>LC carried out the analysis. LC and ST devised the analysis and wrote the manuscript</p>
</sec>
<sec id="s6">
<title>Funding</title>
<p>The author wish to acknowledge the BBSRC who was supported by an award for the BBSRC Doctoral Training Partnership programme (BB/J014478/1) awarded to the University of Manchester.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<ack><p>We would like to thank Professor Sam Griffiths-Jones, Dr. Manoj Kumar, and Dr. Matthew Cooper for their critical reading of the manuscript. LC is funded through the BBSRC Doctoral Training Partnership programme (BB/J014478/1).</p>
</ack>
<sec sec-type="supplementary-material" id="s7">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fpls.2017.00037/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fpls.2017.00037/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table1.XLSX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S1</label>
<caption><p><bold>An annotated list of the 795 identified RALFs and the sources of the genomes used in this study</bold>.</p></caption></supplementary-material>
<supplementary-material xlink:href="Table2.XLSX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S2</label>
<caption><p><bold>A comparison of the RALF identification methods used by our study and Pfam</bold>.</p></caption></supplementary-material>
<supplementary-material xlink:href="DataSheet1.DOCX" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname> <given-names>S. F.</given-names></name> <name><surname>Gish</surname> <given-names>W.</given-names></name> <name><surname>Miller</surname> <given-names>W.</given-names></name> <name><surname>Myers</surname> <given-names>E. W.</given-names></name> <name><surname>Lipman</surname> <given-names>D. J.</given-names></name></person-group> (<year>1990</year>). <article-title>Basic local alignment search tool</article-title>. <source>J. Mol. Biol.</source> <volume>215</volume>, <fpage>403</fpage>&#x02013;<lpage>410</lpage>. <pub-id pub-id-type="doi">10.1016/S0022-2836(05)80360-2</pub-id><pub-id pub-id-type="pmid">2231712</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alva</surname> <given-names>V.</given-names></name> <name><surname>Nam</surname> <given-names>S.-Z.</given-names></name> <name><surname>S&#x000F6;ding</surname> <given-names>J.</given-names></name> <name><surname>Lupas</surname> <given-names>A. N.</given-names></name></person-group> (<year>2016</year>). <article-title>The MPI bioinformatics Toolkit as an integrative platform for advanced protein sequence and structure analysis</article-title>. <source>Nucleic Acids Res.</source> <volume>44</volume>, <fpage>410</fpage>&#x02013;<lpage>415</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw348</pub-id><pub-id pub-id-type="pmid">27131380</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Atkinson</surname> <given-names>N. J.</given-names></name> <name><surname>Lilley</surname> <given-names>C. J.</given-names></name> <name><surname>Urwin</surname> <given-names>P. E.</given-names></name></person-group> (<year>2013</year>). <article-title>Identification of genes involved in the response of Arabidopsis to simultaneous biotic and abiotic stresses</article-title>. <source>Plant Physiol.</source> <volume>162</volume>, <fpage>2028</fpage>&#x02013;<lpage>2041</lpage>. <pub-id pub-id-type="doi">10.1104/pp.113.222372</pub-id><pub-id pub-id-type="pmid">23800991</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bailey</surname> <given-names>T. L.</given-names></name> <name><surname>Elkan</surname> <given-names>C.</given-names></name></person-group> (<year>1994</year>). <article-title>Fitting a mixture model by expectation maximization to discover motifs in bipolymers</article-title>. <source>Proc. Int. Conf. Intell. Syst. Mol. Biol</source>. <volume>2</volume>, <fpage>28</fpage>&#x02013;<lpage>36</lpage>. <pub-id pub-id-type="pmid">7584402</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cao</surname> <given-names>J.</given-names></name> <name><surname>Shi</surname> <given-names>F.</given-names></name></person-group> (<year>2012</year>). <article-title>Evolution of the RALF gene family in plants: gene duplication and selection patterns</article-title>. <source>Evol. Bioinform.</source> <volume>8</volume>, <fpage>271</fpage>&#x02013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.4137/ebo.s9652</pub-id><pub-id pub-id-type="pmid">22745530</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Combier</surname> <given-names>J.</given-names></name> <name><surname>K&#x000FC;ster</surname> <given-names>H.</given-names></name> <name><surname>Journet</surname> <given-names>E. P.</given-names></name> <name><surname>Hohnjec</surname> <given-names>N.</given-names></name> <name><surname>Gamas</surname> <given-names>P.</given-names></name> <name><surname>Niebel</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Evidence for the involvement in nodulation of the two small putative regulatory peptide-encoding genes MtRALFL1 and MtDVL1</article-title>. <source>Mol. Plant Microbe Interact.</source> <volume>21</volume>, <fpage>1118</fpage>&#x02013;<lpage>1127</lpage>. <pub-id pub-id-type="doi">10.1094/MPMI-21-8-1118</pub-id><pub-id pub-id-type="pmid">18616408</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Covey</surname> <given-names>P. A.</given-names></name> <name><surname>Subbaiah</surname> <given-names>C. C.</given-names></name> <name><surname>Parsons</surname> <given-names>R. L.</given-names></name> <name><surname>Pearce</surname> <given-names>G.</given-names></name> <name><surname>Lay</surname> <given-names>F. T.</given-names></name> <name><surname>Anderson</surname> <given-names>M. A.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>A pollen-specific RALF from tomato that regulates pollen tube elongation</article-title>. <source>Plant Physiol.</source> <volume>153</volume>, <fpage>703</fpage>&#x02013;<lpage>715</lpage>. <pub-id pub-id-type="doi">10.1104/pp.110.155457</pub-id><pub-id pub-id-type="pmid">20388667</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crooks</surname> <given-names>G. E.</given-names></name> <name><surname>Hon</surname> <given-names>G.</given-names></name> <name><surname>Chandonia</surname> <given-names>J. M.</given-names></name> <name><surname>Brenner</surname> <given-names>S. E.</given-names></name></person-group> (<year>2004</year>). <article-title>WebLogo: a sequence logo generator</article-title>. <source>Genome Res.</source> <volume>14</volume>, <fpage>1188</fpage>&#x02013;<lpage>1190</lpage>. <pub-id pub-id-type="doi">10.1101/gr.849004</pub-id><pub-id pub-id-type="pmid">15173120</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Czyzewicz</surname> <given-names>N.</given-names></name> <name><surname>Yue</surname> <given-names>K.</given-names></name> <name><surname>Beeckman</surname> <given-names>T.</given-names></name> <name><surname>De Smet</surname> <given-names>I.</given-names></name></person-group> (<year>2013</year>). <article-title>Message in a bottle: small signalling peptide outputs during growth and development</article-title>. <source>J. Exp. Bot.</source> <volume>64</volume>, <fpage>5281</fpage>&#x02013;<lpage>5296</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/ert283</pub-id><pub-id pub-id-type="pmid">24014870</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delay</surname> <given-names>C.</given-names></name> <name><surname>Imin</surname> <given-names>N.</given-names></name> <name><surname>Djordjevic</surname> <given-names>M. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Regulation of Arabidopsis root development by small signaling peptides</article-title>. <source>Front. Plant Sci.</source> <volume>4</volume>:<fpage>352</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2013.00352</pub-id><pub-id pub-id-type="pmid">24046775</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Du</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Chen</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>B.</given-names></name> <name><surname>Li</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Receptor kinase complex transmits RALF peptide signal to inhibit root growth in Arabidopsis</article-title>. <source>Proc. Nat. Acad. Sci. U.S.A.</source> <volume>13</volume>, <fpage>8326</fpage>&#x02013;<lpage>8334</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1609626113</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R. C.</given-names></name></person-group> (<year>2004</year>). <article-title>MUSCLE: multiple sequence alignment with high accuracy and high throughput</article-title>. <source>Nucleic Acids Res.</source> <volume>32</volume>, <fpage>1792</fpage>&#x02013;<lpage>1797</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkh340</pub-id><pub-id pub-id-type="pmid">15034147</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frickey</surname> <given-names>T.</given-names></name> <name><surname>Lupas</surname> <given-names>A.</given-names></name></person-group> (<year>2004</year>). <article-title>CLANS: a Java application for visualizing protein families based on pairwise similarity</article-title>. <source>Bioinformatics</source> <volume>20</volume>, <fpage>3702</fpage>&#x02013;<lpage>3704</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bth444</pub-id><pub-id pub-id-type="pmid">15284097</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Germain</surname> <given-names>H.</given-names></name> <name><surname>Chevalier</surname> <given-names>&#x000C9;.</given-names></name> <name><surname>Caron</surname> <given-names>S.</given-names></name> <name><surname>Matton</surname> <given-names>D. P.</given-names></name></person-group> (<year>2005</year>). <article-title>Characterization of five RALF-like genes from Solanum chacoense provides support for a developmental role in plants</article-title>. <source>Planta</source> <volume>220</volume>, <fpage>447</fpage>&#x02013;<lpage>454</lpage>. <pub-id pub-id-type="doi">10.1007/s00425-004-1352-0</pub-id><pub-id pub-id-type="pmid">15293049</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghorbani</surname> <given-names>S.</given-names></name> <name><surname>Lin</surname> <given-names>Y.-C.</given-names></name> <name><surname>Parizot</surname> <given-names>B.</given-names></name> <name><surname>Fernandez</surname> <given-names>A.</given-names></name> <name><surname>Njo</surname> <given-names>M. F.</given-names></name> <name><surname>Van de Peer</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Expanding the repertoire of secretory peptides controlling root development with comparative genome analysis and functional assays</article-title>. <source>J. Exp. Bot.</source> <volume>66</volume>, <fpage>5257</fpage>&#x02013;<lpage>5269</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/erv346</pub-id><pub-id pub-id-type="pmid">26195730</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goad</surname> <given-names>D.</given-names></name> <name><surname>Zhu</surname> <given-names>C.</given-names></name> <name><surname>Kellogg</surname> <given-names>E.</given-names></name></person-group> (<year>2016</year>). <article-title>Comprehensive identification and clustering of CLV3/ESR-related (CLE) genes in plants finds groups with potentially shared function</article-title>. <source>New Phytol</source>. [Epub ahead of print]. <pub-id pub-id-type="doi">10.1111/nph.14348</pub-id><pub-id pub-id-type="pmid">27911469</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goodstein</surname> <given-names>D. M.</given-names></name> <name><surname>Shu</surname> <given-names>S.</given-names></name> <name><surname>Howson</surname> <given-names>R.</given-names></name> <name><surname>Neupane</surname> <given-names>R.</given-names></name> <name><surname>Hayes</surname> <given-names>R. D.</given-names></name> <name><surname>Fazo</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Phytozome: a comparative platform for green plant genomics</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume>, <fpage>D1178</fpage>&#x02013;<lpage>D1186</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr944</pub-id><pub-id pub-id-type="pmid">22110026</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grant</surname> <given-names>C. E.</given-names></name> <name><surname>Bailey</surname> <given-names>T. L.</given-names></name> <name><surname>Noble</surname> <given-names>W. S.</given-names></name></person-group> (<year>2011</year>) <article-title>FIMO: Scanning for occurrences of a given motif</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>1017</fpage>&#x02013;<lpage>1018</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr064</pub-id><pub-id pub-id-type="pmid">21330290</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Greiner</surname> <given-names>M.</given-names></name> <name><surname>Pfeiffer</surname> <given-names>D.</given-names></name> <name><surname>Smith</surname> <given-names>R. D.</given-names></name></person-group> (<year>2000</year>). <article-title>Principles and practical application of the receiver-operating characteristic analysis for diagnostic tests</article-title>. <source>Prev. Vet. Med.</source> <volume>45</volume>, <fpage>23</fpage>&#x02013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1016/S0167-5877(00)00115-X</pub-id><pub-id pub-id-type="pmid">10802332</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Han</surname> <given-names>M. V.</given-names></name> <name><surname>Zmasek</surname> <given-names>C. M.</given-names></name></person-group> (<year>2009</year>). <article-title>phyloXML: XML for evolutionary biology and comparative genomics</article-title>. <source>BMC Bioinformatics</source> <volume>10</volume>:<fpage>356</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-10-356</pub-id><pub-id pub-id-type="pmid">19860910</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haruta</surname> <given-names>M.</given-names></name> <name><surname>Constabel</surname> <given-names>C. P.</given-names></name></person-group> (<year>2003</year>). <article-title>Rapid alkalinization factors in poplar cell cultures. Peptide isolation, cDNA cloning, and differential expression in leaves and methyl jasmonate-treated cells</article-title>. <source>Plant Physiol.</source> <volume>131</volume>, <fpage>814</fpage>&#x02013;<lpage>823</lpage>. <pub-id pub-id-type="doi">10.1104/pp.014597</pub-id><pub-id pub-id-type="pmid">12586905</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haruta</surname> <given-names>M.</given-names></name> <name><surname>Sabat</surname> <given-names>G.</given-names></name> <name><surname>Stecker</surname> <given-names>K.</given-names></name> <name><surname>Minkoff</surname> <given-names>B. B.</given-names></name> <name><surname>Sussman</surname> <given-names>M. R.</given-names></name></person-group> (<year>2014</year>). <article-title>A peptide hormone and its receptor protein kinase regulate plant cell expansion</article-title>. <source>Science</source> <volume>343</volume>, <fpage>408</fpage>&#x02013;<lpage>411</lpage>. <pub-id pub-id-type="doi">10.1126/science.1244454</pub-id><pub-id pub-id-type="pmid">24458638</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hruz</surname> <given-names>T.</given-names></name> <name><surname>Laule</surname> <given-names>O.</given-names></name> <name><surname>Szabo</surname> <given-names>G.</given-names></name> <name><surname>Wessendorp</surname> <given-names>F.</given-names></name> <name><surname>Bleuler</surname> <given-names>S.</given-names></name> <name><surname>Oertle</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Genevestigator v3: a reference expression database for the meta-analysis of transcriptomes</article-title>. <source>Adv. Bioinformatics</source> <volume>2008</volume>:<fpage>420747</fpage>. <pub-id pub-id-type="doi">10.1155/2008/420747</pub-id><pub-id pub-id-type="pmid">19956698</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huck</surname> <given-names>N.</given-names></name> <name><surname>Moore</surname> <given-names>J. M.</given-names></name> <name><surname>Federer</surname> <given-names>M.</given-names></name> <name><surname>Grossniklaus</surname> <given-names>U.</given-names></name></person-group> (<year>2003</year>). <article-title>The Arabidopsis mutant feronia disrupts the female gametophytic control of pollen tube reception</article-title>. <source>Development</source> <volume>130</volume>, <fpage>2149</fpage>&#x02013;<lpage>2159</lpage>. <pub-id pub-id-type="doi">10.1242/dev.00458</pub-id><pub-id pub-id-type="pmid">12668629</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ito</surname> <given-names>Y.</given-names></name> <name><surname>Nakanomyo</surname> <given-names>I.</given-names></name> <name><surname>Motose</surname> <given-names>H.</given-names></name> <name><surname>Iwamoto</surname> <given-names>K.</given-names></name> <name><surname>Sawa</surname> <given-names>S.</given-names></name> <name><surname>Dohmae</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Dodeca-CLE peptides as suppressors of plant stem cell differentiation</article-title>. <source>Science</source> <volume>313</volume>, <fpage>842</fpage>&#x02013;<lpage>845</lpage>. <pub-id pub-id-type="doi">10.1126/science.1128436</pub-id><pub-id pub-id-type="pmid">16902140</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klus</surname> <given-names>P.</given-names></name> <name><surname>Bolognesi</surname> <given-names>B.</given-names></name> <name><surname>Agostini</surname> <given-names>F.</given-names></name> <name><surname>Marchese</surname> <given-names>D.</given-names></name> <name><surname>Zanzoni</surname> <given-names>A.</given-names></name> <name><surname>Tartaglia</surname> <given-names>G. G.</given-names></name></person-group> (<year>2014</year>). <article-title>The cleverSuite approach for protein characterization: predictions of structural properties, solubility, chaperone requirements and RNA-binding abilities</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>1601</fpage>&#x02013;<lpage>1608</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu074</pub-id><pub-id pub-id-type="pmid">24493033</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larsson</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>AliView: a fast and lightweight alignment viewer and editor for large datasets</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>3276</fpage>&#x02013;<lpage>3278</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu531</pub-id><pub-id pub-id-type="pmid">25095880</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>J. S.</given-names></name> <name><surname>Hnilova</surname> <given-names>M.</given-names></name> <name><surname>Maes</surname> <given-names>M.</given-names></name> <name><surname>Lin</surname> <given-names>Y.-C. L.</given-names></name> <name><surname>Putarjunan</surname> <given-names>A.</given-names></name> <name><surname>Han</surname> <given-names>S.-K.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Competitive binding of antagonistic peptides fine-tunes stomatal patterning</article-title>. <source>Nature</source> <volume>522</volume>, <fpage>439</fpage>&#x02013;<lpage>443</lpage>. <pub-id pub-id-type="doi">10.1038/nature14561</pub-id><pub-id pub-id-type="pmid">26083750</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindner</surname> <given-names>H.</given-names></name> <name><surname>M&#x000FC;ller</surname> <given-names>L. M.</given-names></name> <name><surname>Boisson-Dernier</surname> <given-names>A.</given-names></name> <name><surname>Grossniklaus</surname> <given-names>U.</given-names></name></person-group> (<year>2012</year>). <article-title>CrRLK1L receptor-like kinases: not just another brick in the wall</article-title>. <source>Curr. Opin. Plant Biol.</source> <volume>15</volume>, <fpage>659</fpage>&#x02013;<lpage>669</lpage>. <pub-id pub-id-type="doi">10.1016/j.pbi.2012.07.003</pub-id><pub-id pub-id-type="pmid">22884521</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marchese</surname> <given-names>D.</given-names></name> <name><surname>de Groot</surname> <given-names>N. S.</given-names></name> <name><surname>Lorenzo Gotor</surname> <given-names>N.</given-names></name> <name><surname>Livi</surname> <given-names>C. M.</given-names></name> <name><surname>Tartaglia</surname> <given-names>G. G.</given-names></name></person-group> (<year>2016</year>). <article-title>Advances in the characterization of RNA-binding proteins</article-title>. <source>Wiley Interdiscip. Rev. RNA.</source> <volume>7</volume>, <fpage>793</fpage>&#x02013;<lpage>810</lpage>. <pub-id pub-id-type="doi">10.1002/wrna.1378</pub-id><pub-id pub-id-type="pmid">27503141</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matos</surname> <given-names>J. L.</given-names></name> <name><surname>Fiori</surname> <given-names>C. S.</given-names></name> <name><surname>Silva-Filho</surname> <given-names>M. C.</given-names></name> <name><surname>Moura</surname> <given-names>D. S.</given-names></name></person-group> (<year>2008</year>). <article-title>A conserved dibasic site is essential for correct processing of the peptide hormone AtRALF1 in <italic>Arabidopsis thaliana</italic></article-title>. <source>FEBS Lett.</source> <volume>582</volume>, <fpage>3343</fpage>&#x02013;<lpage>3347</lpage>. <pub-id pub-id-type="doi">10.1016/j.febslet.2008.08.025</pub-id><pub-id pub-id-type="pmid">18775699</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsubayashi</surname> <given-names>Y.</given-names></name></person-group> (<year>2014</year>). <article-title>Posttranslationally modified small-peptide signals in plants</article-title>. <source>Annu. Rev. Plant Biol</source>. <volume>65</volume>, <fpage>385</fpage>&#x02013;<lpage>413</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-arplant-050312-120122</pub-id><pub-id pub-id-type="pmid">24779997</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meng</surname> <given-names>L.</given-names></name> <name><surname>Buchanan</surname> <given-names>B. B.</given-names></name> <name><surname>Feldman</surname> <given-names>L. J.</given-names></name> <name><surname>Luan</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>CLE-like (CLEL) peptides control the pattern of root growth and lateral root development in Arabidopsis</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>109</volume>, <fpage>1760</fpage>&#x02013;<lpage>1765</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1119864109</pub-id><pub-id pub-id-type="pmid">22307643</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meng</surname> <given-names>L.</given-names></name> <name><surname>Ruth</surname> <given-names>K. C.</given-names></name> <name><surname>Fletcher</surname> <given-names>J. C.</given-names></name> <name><surname>Feldman</surname> <given-names>L.</given-names></name> <name><surname>Brand</surname> <given-names>U.</given-names></name> <name><surname>Fletcher</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>The roles of different CLE Domains in Arabidopsis CLE polypeptide activity and functional specificity</article-title>. <source>Mol. Plant</source> <volume>3</volume>, <fpage>760</fpage>&#x02013;<lpage>772</lpage>. <pub-id pub-id-type="doi">10.1093/mp/ssq021</pub-id><pub-id pub-id-type="pmid">20494950</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morato do Canto</surname> <given-names>A.</given-names></name> <name><surname>Ceciliato</surname> <given-names>P. H. O.</given-names></name> <name><surname>Ribeiro</surname> <given-names>B.</given-names></name> <name><surname>Ortiz Morea</surname> <given-names>F. A.</given-names></name> <name><surname>Franco Garcia</surname> <given-names>A. A.</given-names></name> <name><surname>Silva-Filho</surname> <given-names>M. C.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Biological activity of nine recombinant AtRALF peptides: implications for their perception and function in Arabidopsis</article-title>. <source>Plant Physiol. Biochem.</source> <volume>75</volume>, <fpage>45</fpage>&#x02013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1016/j.plaphy.2013.12.005</pub-id><pub-id pub-id-type="pmid">24368323</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murphy</surname> <given-names>E.</given-names></name> <name><surname>De Smet</surname> <given-names>I.</given-names></name></person-group> (<year>2014</year>). <article-title>Understanding the RALF family: a tale of many species</article-title>. <source>Trends Plant Sci.</source> <volume>19</volume>, <fpage>664</fpage>&#x02013;<lpage>671</lpage>. <pub-id pub-id-type="doi">10.1016/j.tplants.2014.06.005</pub-id><pub-id pub-id-type="pmid">24999241</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murphy</surname> <given-names>E.</given-names></name> <name><surname>Vu</surname> <given-names>L. D.</given-names></name> <name><surname>Van den Broeck</surname> <given-names>L.</given-names></name> <name><surname>Lin</surname> <given-names>Z.</given-names></name> <name><surname>Ramakrishna</surname> <given-names>P.</given-names></name> <name><surname>van de Cotte</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>RALFL34 regulates formative cell divisions in Arabidopsis pericycle during lateral root initiation</article-title>. <source>J. Exp. Bot.</source> <volume>67</volume>, <fpage>4863</fpage>&#x02013;<lpage>4875</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/erw281</pub-id><pub-id pub-id-type="pmid">27521602</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oelkers</surname> <given-names>K.</given-names></name> <name><surname>Goffard</surname> <given-names>N.</given-names></name> <name><surname>Weiller</surname> <given-names>G. F.</given-names></name> <name><surname>Gresshoff</surname> <given-names>P. M.</given-names></name> <name><surname>Mathesius</surname> <given-names>U.</given-names></name> <name><surname>Frickey</surname> <given-names>T.</given-names></name></person-group> (<year>2008</year>). <article-title>Bioinformatic analysis of the CLE signaling peptide family</article-title>. <source>BMC Plant Biol.</source> <volume>8</volume>:<fpage>1</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2229-8-1</pub-id><pub-id pub-id-type="pmid">18171480</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pearce</surname> <given-names>G.</given-names></name> <name><surname>Moura</surname> <given-names>D. S.</given-names></name> <name><surname>Stratmann</surname> <given-names>J.</given-names></name> <name><surname>Ryan</surname> <given-names>C. A.</given-names></name></person-group> (<year>2001</year>). <article-title>RALF, a 5-kDa ubiquitous polypeptide in plants, arrests root growth and development</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>98</volume>, <fpage>12843</fpage>&#x02013;<lpage>12847</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.201416998</pub-id><pub-id pub-id-type="pmid">11675511</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pearce</surname> <given-names>G.</given-names></name> <name><surname>Yamaguchi</surname> <given-names>Y.</given-names></name> <name><surname>Munske</surname> <given-names>G.</given-names></name> <name><surname>Ryan</surname> <given-names>C. A.</given-names></name></person-group> (<year>2010</year>). <article-title>Structure-activity studies of RALF, Rapid Alkalinization Factor, reveal an essential - YISY - motif</article-title>. <source>Peptides</source> <volume>31</volume>, <fpage>1973</fpage>&#x02013;<lpage>1977</lpage>. <pub-id pub-id-type="doi">10.1016/j.peptides.2010.08.012</pub-id><pub-id pub-id-type="pmid">20800638</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>L.</given-names></name> <name><surname>Ma</surname> <given-names>J.</given-names></name> <name><surname>Chi</surname> <given-names>W.</given-names></name> <name><surname>Guo</surname> <given-names>J.</given-names></name> <name><surname>Zhu</surname> <given-names>S.</given-names></name> <name><surname>Lu</surname> <given-names>Q.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>LOW PSII ACCUMULATION1 is involved in efficient assembly of photosystem II in <italic>Arabidopsis thaliana</italic></article-title>. <source>Plant Cell</source> <volume>18</volume>, <fpage>955</fpage>&#x02013;<lpage>969</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.105.037689</pub-id><pub-id pub-id-type="pmid">16531500</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Price</surname> <given-names>M. N.</given-names></name> <name><surname>Dehal</surname> <given-names>P. S.</given-names></name> <name><surname>Arkin</surname> <given-names>A. P.</given-names></name></person-group> (<year>2010</year>). <article-title>FastTree 2 - Approximately maximum-likelihood trees for large alignments</article-title>. <source>PLoS ONE</source> <volume>5</volume>:<fpage>e9490</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0009490</pub-id><pub-id pub-id-type="pmid">20224823</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Philippe</surname> <given-names>H.</given-names></name> <name><surname>Brinkmann</surname> <given-names>H.</given-names></name> <name><surname>Lavrov</surname> <given-names>D. V.</given-names></name> <name><surname>Littlewood</surname> <given-names>D. T. J.</given-names></name> <name><surname>Manuel</surname> <given-names>M.</given-names></name> <name><surname>W&#x000F6;rheide</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Resolving difficult phylogenetic questions: why more sequences are not enough</article-title>. <source>PLoS Biol.</source> <volume>9</volume>:<fpage>e1000602</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pbio.1000602</pub-id><pub-id pub-id-type="pmid">21423652</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Piel</surname> <given-names>W. H.</given-names></name> <name><surname>Donoghue</surname> <given-names>M.</given-names></name> <name><surname>Sanderson</surname> <given-names>M.</given-names></name></person-group> (<year>2002</year>). <article-title>TreeBASE: a database of phylogenetic information</article-title>, in <source>Proceedings of the 2nd International Workshop of Species 2000</source>. <publisher-loc>Tsukuba</publisher-loc>.</citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roberts</surname> <given-names>I.</given-names></name> <name><surname>Smith</surname> <given-names>S.</given-names></name> <name><surname>Rybel</surname> <given-names>B.</given-names></name> <name><surname>De</surname> <given-names>Van Den Broeke, J.</given-names></name> <name><surname>Smet</surname> <given-names>W.</given-names></name> <name><surname>De Cokere</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>The CEP family in land plants: evolutionary analyses, expression studies, and role in Arabidopsis shoot development</article-title>. <source>J. Exp. Bot.</source> <volume>64</volume>, <fpage>5371</fpage>&#x02013;<lpage>5381</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/ert331</pub-id><pub-id pub-id-type="pmid">24179095</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sawa</surname> <given-names>S.</given-names></name> <name><surname>Kinoshita</surname> <given-names>A.</given-names></name> <name><surname>Betsuyaku</surname> <given-names>S.</given-names></name> <name><surname>Fukuda</surname> <given-names>H.</given-names></name></person-group> (<year>2008</year>). <article-title>A large family of genes that share homology with CLE domain in Arabidopsis and rice</article-title>. <source>Plant Signal. Behav.</source> <volume>3</volume>, <fpage>337</fpage>&#x02013;<lpage>339</lpage>. <pub-id pub-id-type="doi">10.4161/psb.3.5.5344</pub-id><pub-id pub-id-type="pmid">19841664</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharma</surname> <given-names>A.</given-names></name> <name><surname>Hussain</surname> <given-names>A.</given-names></name> <name><surname>Mun</surname> <given-names>B.-G.</given-names></name> <name><surname>Imran</surname> <given-names>Q. M.</given-names></name> <name><surname>Falak</surname> <given-names>N.</given-names></name> <name><surname>Lee</surname> <given-names>S.-U.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Comprehensive analysis of plant rapid alkalization factor (RALF) genes</article-title>. <source>Plant Physiol. Biochem.</source> <volume>106</volume>, <fpage>82</fpage>&#x02013;<lpage>90</lpage>. <pub-id pub-id-type="doi">10.1016/j.plaphy.2016.03.037</pub-id><pub-id pub-id-type="pmid">27155375</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shimodaira</surname> <given-names>H.</given-names></name> <name><surname>Hasegawa</surname> <given-names>M.</given-names></name></person-group> (<year>1999</year>). <article-title>Multiple comparisons of log-likelihoods with applications to phylogenetic inference</article-title>. <source>Mol. Biol. Evol.</source> <volume>16</volume>, <fpage>1114</fpage>&#x02013;<lpage>1116</lpage>. <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a026201</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Silverstein</surname> <given-names>K. A. T.</given-names></name> <name><surname>Moskal</surname> <given-names>W. A.</given-names></name> <name><surname>Wu</surname> <given-names>H. C.</given-names></name> <name><surname>Underwood</surname> <given-names>B. A.</given-names></name> <name><surname>Graham</surname> <given-names>M. A.</given-names></name> <name><surname>Town</surname> <given-names>C. D.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>Small cysteine-rich peptides resembling antimicrobial peptides have been under-predicted in plants</article-title>. <source>Plant J.</source> <volume>51</volume>, <fpage>262</fpage>&#x02013;<lpage>280</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2007.03136.x</pub-id><pub-id pub-id-type="pmid">17565583</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Srivastava</surname> <given-names>R.</given-names></name> <name><surname>Liu</surname> <given-names>J. X.</given-names></name> <name><surname>Guo</surname> <given-names>H.</given-names></name> <name><surname>Yin</surname> <given-names>Y.</given-names></name> <name><surname>Howell</surname> <given-names>S. H.</given-names></name></person-group> (<year>2009</year>). <article-title>Regulation and processing of a plant peptide hormone, AtRALF23, in Arabidopsis</article-title>. <source>Plant J.</source> <volume>59</volume>, <fpage>930</fpage>&#x02013;<lpage>939</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2009.03926.x</pub-id><pub-id pub-id-type="pmid">19473327</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thynne</surname> <given-names>E.</given-names></name> <name><surname>Saur</surname> <given-names>I. M. L.</given-names></name> <name><surname>Simbaqueba</surname> <given-names>J.</given-names></name> <name><surname>Ogilvie</surname> <given-names>H. A.</given-names></name> <name><surname>Gonzalez-Cendales</surname> <given-names>Y.</given-names></name> <name><surname>Mead</surname> <given-names>O.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Fungal phytopathogens encode functional homologues of plant rapid alkalinisation factor (RALF) peptides</article-title>. <source>Mol. Plant Pathol</source>. [Epub ahead of print]. <pub-id pub-id-type="doi">10.1111/mpp.12444</pub-id><pub-id pub-id-type="pmid">27291634</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>J.</given-names></name> <name><surname>Kurten</surname> <given-names>E. L.</given-names></name> <name><surname>Monshausen</surname> <given-names>G.</given-names></name> <name><surname>Hummel</surname> <given-names>G. M.</given-names></name> <name><surname>Gilroy</surname> <given-names>S.</given-names></name> <name><surname>Baldwin</surname> <given-names>I. T.</given-names></name></person-group> (<year>2007</year>). <article-title>NaRALF, a peptide signal essential for the regulation of root hair tip apoplastic pH in <italic>Nicotiana attenuata</italic>, is required for root hair development and plant growth in native soils</article-title>. <source>Plant J.</source> <volume>52</volume>, <fpage>877</fpage>&#x02013;<lpage>890</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2007.03289.x</pub-id><pub-id pub-id-type="pmid">17916115</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu Lobanov</surname> <given-names>M.</given-names></name> <name><surname>Klus</surname> <given-names>P.</given-names></name> <name><surname>Sokolovsky</surname> <given-names>I. V.</given-names></name> <name><surname>Gaetano Tartaglia</surname> <given-names>G.</given-names></name> <name><surname>Galzitskaya</surname> <given-names>O. V.</given-names></name></person-group> (<year>2016</year>). <article-title>Non-random distribution of homo- repeats: links with biological functions and human diseases</article-title>. <source>Sci. Rep.</source> <volume>6</volume>:<fpage>26941</fpage>. <pub-id pub-id-type="doi">10.1038/srep26941</pub-id><pub-id pub-id-type="pmid">27256590</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zuccolo</surname> <given-names>A.</given-names></name> <name><surname>Bowers</surname> <given-names>J. E.</given-names></name> <name><surname>Estill</surname> <given-names>J. C.</given-names></name> <name><surname>Xiong</surname> <given-names>Z.</given-names></name> <name><surname>Luo</surname> <given-names>M.</given-names></name> <name><surname>Sebastian</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2011</year>) <article-title>A physical map for the Amborella trichopoda genome sheds light on the evolution of angiosperm genome structure</article-title>. <source>Genome Biol.</source> <volume>12</volume>:<fpage>R48</fpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://genomebiology.com/2011/12/5/R48">http://genomebiology.com/2011/12/5/R48</ext-link> <pub-id pub-id-type="pmid">21619600</pub-id></citation>
</ref>
</ref-list>
</back>
</article>