<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2022.870818</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Genomic Architecture and Evolution of the <italic>Cellulose synthase</italic> Gene Superfamily as Revealed by Phylogenomic Analysis</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Pancaldi</surname> <given-names>Francesco</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/700376/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>van Loo</surname> <given-names>Eibertus N.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/308907/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Schranz</surname> <given-names>M. Eric</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/190885/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Trindade</surname> <given-names>Luisa M.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/65848/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Plant Breeding, Wageningen University &#x0026; Research</institution>, <addr-line>Wageningen</addr-line>, <country>Netherlands</country></aff>
<aff id="aff2"><sup>2</sup><institution>Biosystematics group, Wageningen University &#x0026; Research</institution>, <addr-line>Wageningen</addr-line>, <country>Netherlands</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Peter Ulvskov, University of Copenhagen, Denmark</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Monika S. Doblin, La Trobe University, Australia; Jesper Harholt, Carlsberg Research Laboratory, Denmark</p></fn>
<corresp id="c001">&#x002A;Correspondence: Luisa M. Trindade, <email>luisa.trindade@wur.nl</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Plant Systematics and Evolution, a section of the journal Frontiers in Plant Science</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>18</day>
<month>04</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>870818</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>15</day>
<month>03</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Pancaldi, van Loo, Schranz and Trindade.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Pancaldi, van Loo, Schranz and Trindade</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The <italic>Cellulose synthase</italic> superfamily synthesizes cellulose and different hemicellulosic polysaccharides in plant cell walls. While much has been discovered about the evolution and function of these genes, their genomic architecture and relationship with gene (sub-)functionalization and evolution remains unclear. By using 242 genomes covering plant evolution from green algae to eudicots, we performed a large-scale analysis of synteny, phylogenetic, and functional data of the <italic>CesA</italic> superfamily. Results revealed considerable gene copy number variation across species and gene families, and also two patterns &#x2013; singletons vs. tandem arrays &#x2013; in chromosomic gene arrangement. Synteny analysis revealed exceptional conservation of gene architecture across species, but also lineage-specific patterns across gene (sub-)families. Synteny patterns correlated with gene sub-functionalization into primary and secondary <italic>CesAs</italic> and distinct CslD functional isoforms. Furthermore, a genomic context shift of a group of cotton secondary <italic>CesAs</italic> was associated with peculiar properties of cotton fiber synthesis. Finally, phylogenetics suggested that primary <italic>CesA</italic> sequences appeared before the secondary <italic>CesAs</italic>, while phylogenomic analyses unveiled the genomic trace of the <italic>CslD</italic> duplication that initiated the <italic>CslF</italic> family. Our results describe in detail the genomic architecture of the <italic>CesA</italic> superfamily in plants, highlighting its crucial relevance for gene diversification and sub-functionalization, and for understanding their evolution.</p>
</abstract>
<kwd-group>
<kwd><italic>cellulose synthase</italic> superfamily</kwd>
<kwd>CesA</kwd>
<kwd>CslD</kwd>
<kwd>CslF</kwd>
<kwd>evolution</kwd>
<kwd>phylogenomics</kwd>
<kwd>plant cell walls</kwd>
<kwd>synteny analysis</kwd>
</kwd-group>
<contract-sponsor id="cn001">European Commission<named-content content-type="fundref-id">10.13039/501100000780</named-content></contract-sponsor>
<counts>
<fig-count count="6"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="69"/>
<page-count count="15"/>
<word-count count="10770"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Plant cell walls (CWs) are versatile structures that surround plant cells and fulfil key plant functions, including providing tensile strength and mediating plant-environment interactions (<xref ref-type="bibr" rid="B50">Sarkar et al., 2009</xref>). About 60-90% of CWs&#x2019; dry weight is constituted by cellulose and hemicellulosic polymers (<xref ref-type="bibr" rid="B46">Pettolino et al., 2012</xref>), synthesized by the enzymes coded by <italic>Cellulose synthase</italic> (<italic>Ces) A</italic> gene superfamily (<xref ref-type="bibr" rid="B49">Richmond and Somerville, 2000</xref>). The genes of this superfamily are present in all land plants and globally divided into 12 gene families (<xref ref-type="bibr" rid="B49">Richmond and Somerville, 2000</xref>; <xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). Among these families, <italic>CesA</italic> genes were the first discovered members of the <italic>CesA</italic> superfamily and are involved in cellulose synthesis (<xref ref-type="bibr" rid="B45">Pear et al., 1996</xref>; <xref ref-type="bibr" rid="B57">Turner and Somerville, 1997</xref>; <xref ref-type="bibr" rid="B2">Arioli et al., 1998</xref>). These genes are a monophyletic group that the algal ancestors of land plants acquired through horizontal transfer from cyanobacteria that expanded, diversified, and sub-functionalized during plant evolution (<xref ref-type="bibr" rid="B3">Banasiak, 2014</xref>, <xref ref-type="bibr" rid="B51">Schwerdt et al., 2015</xref>). Specifically, angiosperms typically contain 10-20 <italic>CesA</italic> genes (<xref ref-type="bibr" rid="B49">Richmond and Somerville, 2000</xref>; <xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>), and different CesA isoforms mediate cellulose synthesis in either primary or secondary cell walls (<xref ref-type="bibr" rid="B26">Kalluri and Joshi, 2004</xref>; <xref ref-type="bibr" rid="B6">Burton et al., 2004</xref>; <xref ref-type="bibr" rid="B47">Ranik and Myburg, 2006</xref>; <xref ref-type="bibr" rid="B61">Wang et al., 2010</xref>; <xref ref-type="bibr" rid="B10">Carroll et al., 2012</xref>; <xref ref-type="bibr" rid="B43">Liu et al., 2012</xref>; <xref ref-type="bibr" rid="B28">Kaur et al., 2016</xref>). Moreover, different CesA members assemble into functional hetero-multimeric cellulose synthesis complexes that accomplish the actual cellulose synthesis (<xref ref-type="bibr" rid="B22">Haigler and Roberts, 2019</xref>; <xref ref-type="bibr" rid="B10">Carroll et al., 2012</xref>). Despite the recent advances in the knowledge of <italic>CesA</italic> biology, neither the evolutionary trajectories that led to the current <italic>CesA</italic> diversity nor the genomic architecture of this family and its relationship with CW biology are fully understood (<xref ref-type="bibr" rid="B42">Little et al., 2018</xref>).</p>
<p>The other <italic>CesA</italic> superfamily genes besides the <italic>CesA</italic> family are termed <italic>Cellulose synthase-like</italic> (<italic>Csl</italic>) genes. <italic>Csl</italic> genes participate in the synthesis of different hemicellulosic polysaccharides (<xref ref-type="bibr" rid="B49">Richmond and Somerville, 2000</xref>; <xref ref-type="bibr" rid="B51">Schwerdt et al., 2015</xref>, <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>) and include 11 families, termed <italic>CslA</italic> to <italic>CslM</italic> (<xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). The <italic>CslB</italic>/<italic>D</italic>/<italic>E</italic>/<italic>F</italic>/<italic>G</italic>/<italic>H</italic>/<italic>J</italic>/<italic>M</italic> families belong to the same monophyletic lineage of the <italic>CesA</italic> genes (<xref ref-type="bibr" rid="B3">Banasiak, 2014</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>), while the <italic>CslA</italic>/<italic>C</italic>/<italic>K</italic> families form a different clade that originated through an independent endosymbiosis in green algae (<xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>; <xref ref-type="bibr" rid="B3">Banasiak, 2014</xref>, <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). Within the <italic>CslB</italic>/<italic>D</italic>/<italic>E</italic>/<italic>F</italic>/<italic>G</italic>/<italic>H</italic>/<italic>J</italic>/<italic>M</italic> families, the <italic>CslD</italic> and <italic>CslF</italic> clades are phylogenetically closer to <italic>CesA</italic> genes than the others (<xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>; <xref ref-type="bibr" rid="B51">Schwerdt et al., 2015</xref>). Specifically, <italic>CslDs</italic> form a sister clade <italic>CesAs</italic>, while <italic>CslFs</italic> form a Poaceae-specific sister clade of <italic>CslDs</italic>. While phylogenetic analyses suggest that <italic>CslF</italic> genes originated from <italic>CslD</italic> duplication (<xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>; <xref ref-type="bibr" rid="B51">Schwerdt et al., 2015</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>), it is unclear if such duplication took place after the divergence of the <italic>CslD</italic> clade, or in an ancestral clade of both the families (<xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). Similarly, the <italic>CslB</italic>/<italic>E</italic>/<italic>G</italic>/<italic>H</italic>/<italic>J</italic>/<italic>M</italic> families diverged independently of the <italic>CslD</italic> and <italic>CslF</italic> genes from the <italic>CesA</italic> family (<xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>), but the timing and modes of their evolution is largely undefined (<xref ref-type="bibr" rid="B3">Banasiak, 2014</xref>). Furthermore, the role of the genomic architecture of genes in the evolutionary trajectories of all these families is unclear.</p>
<p>Knowledge gaps are also currently open regarding the function of the <italic>Csl</italic> families. In this respect, the <italic>CslF</italic>, <italic>CslH</italic>, and <italic>CslJ</italic> families are known to be involved in the synthesis of (1,3;1,4)-&#x03B2;-glucans (or mixed-linkage glucans), a group of polymers mainly found in Poaceae (grass) CWs that consists of (1,3;1,4)-&#x03B2; linked glucosyl residues (<xref ref-type="bibr" rid="B7">Burton et al., 2006</xref>; <xref ref-type="bibr" rid="B18">Doblin et al., 2009</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). While the involvement of the <italic>CslF</italic>, <italic>CslH</italic>, and <italic>CslJ</italic> families in the synthesis of mixed-linkage glucans has been widely established over the last decades, two members of the barley <italic>CslF</italic> family &#x2013; <italic>HvCslF3</italic> and <italic>HvCslF10</italic> &#x2013; have recently been shown to synthesize (1,4)-&#x03B2;-linked glucoxylans (<xref ref-type="bibr" rid="B41">Little et al., 2019</xref>). This finding can challenge the concept that <italic>CslF</italic>, <italic>CslH</italic>, and <italic>CslJ</italic> families synthesize a single type of polysaccharide. Nevertheless, the <italic>CslF</italic>, <italic>CslH</italic>, and <italic>CslJ</italic> families remain the best studied <italic>Csl</italic> families, while the other <italic>Csl</italic> genes are much less characterized. Some <italic>CslA</italic> genes synthesize mannans and glucomannans (<xref ref-type="bibr" rid="B16">Dhugga et al., 2004</xref>; <xref ref-type="bibr" rid="B40">Liepman et al., 2005</xref>), while certain CslC isoforms mediate xyloglucan biosynthesis (<xref ref-type="bibr" rid="B12">Cocuron et al., 2007</xref>; <xref ref-type="bibr" rid="B31">Kim et al., 2020</xref>). However, the size and diversity of these families suggests that they could also synthesize other hemicellulosic polysaccharides or different forms of mannans and xyloglucans (<xref ref-type="bibr" rid="B39">Liepman and Cavalier, 2012</xref>). <italic>CslD</italic> genes are believed to synthesize the non-crystalline single chains of cellulose in root hairs and pollen tubes (<xref ref-type="bibr" rid="B17">Doblin et al., 2001</xref>; <xref ref-type="bibr" rid="B30">Kim et al., 2007</xref>; <xref ref-type="bibr" rid="B4">Bernal et al., 2008</xref>), but their involvement in mannan synthesis has also been proposed (<xref ref-type="bibr" rid="B59">Verhertbruggen et al., 2011</xref>). In addition, <italic>CslD</italic> mutations affect pollen development (<xref ref-type="bibr" rid="B4">Bernal et al., 2008</xref>), root morphology (<xref ref-type="bibr" rid="B62">Wang et al., 2001</xref>; <xref ref-type="bibr" rid="B30">Kim et al., 2007</xref>; <xref ref-type="bibr" rid="B4">Bernal et al., 2008</xref>; <xref ref-type="bibr" rid="B23">Hu et al., 2018</xref>), and vegetative organ size (<xref ref-type="bibr" rid="B37">Li et al., 2009</xref>; <xref ref-type="bibr" rid="B44">Luan et al., 2011</xref>; <xref ref-type="bibr" rid="B24">Hunter et al., 2012</xref>; <xref ref-type="bibr" rid="B38">Li et al., 2018</xref>) in several plants, but the molecular basis of these effects is unclear. Finally, the function of the evolutionarily-related <italic>CslB</italic>/<italic>E</italic>/<italic>G</italic>/<italic>M</italic> families is currently unknown (<xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). Moreover, as for the <italic>CesA</italic> genes, the role of the genetic architecture of the <italic>Csl</italic> families in determining gene function has not yet been investigated.</p>
<p>The study of the genomic architecture of the <italic>CesA</italic> superfamily in plants and its relationship with the evolution and function of these genes is the focus of this research. To this aim, we performed a combined phylogenetic and synteny analysis (phylogenomic analysis) of the <italic>CesA</italic> superfamily genes from 242 species covering plant evolution from green algae to eudicots. Large-scale phylogenomic analyses are a rather novel approach, but have turned powerful in studying the genetics and the evolution of complex gene families (<xref ref-type="bibr" rid="B69">Zhao et al., 2017</xref>; <xref ref-type="bibr" rid="B29">Kerstens et al., 2020</xref>). Moreover, if coupled with relevant phenotypic and functional data, these analyses can reveal relationships between the genomic architecture of target gene families and phenotypic adaptations of plants (<xref ref-type="bibr" rid="B69">Zhao et al., 2017</xref>; <xref ref-type="bibr" rid="B29">Kerstens et al., 2020</xref>). Large-scale phylogenomic analyses have become feasible thanks to the increasing availability of sequenced plant genomes and the development of bioinformatic tools for their analysis, like network approaches for large-scale synteny computation (<xref ref-type="bibr" rid="B67">Zhao and Schranz, 2017</xref>). The application of these tools to the <italic>CesA</italic> superfamily highlighted interesting patterns in the genomic arrangement of these genes across plants and relevant associations between phylogenomic patterns and key events in the evolution and sub-functionalization of different gene families, including <italic>CesA</italic>, <italic>CslD</italic>, and <italic>CslF</italic> genes. This also led to the formulation of novel hypotheses on the evolution of different <italic>CesA</italic> and <italic>Csl</italic> families.</p>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Genomic Data Sources</title>
<p>All the plant genomes (<italic>sensu lato</italic>) sequenced and published by 2018 and available with a scaffold-level assembly were searched in online databases (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). For each genome, a general feature format (GFF)/browser extensible data (BED) file of gene positions and a protein FASTA file reporting the main protein coded by each gene were retrieved. Moreover, genomes were analysed for completeness and fragmentation by using the BUSCO Viridiplantae gene set (<xref ref-type="bibr" rid="B52">Seppey et al., 2019</xref>) and by assessing the number of scaffolds and the N50 statistics. In total, 242 genomes from 212 species were collected (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). For each species with an available genome, information on its ploidy level and the number of genome duplications were searched online and on scientific literature [see especially <xref ref-type="bibr" rid="B58">Van de Peer et al. (2017)</xref> for information about genome duplications].</p>
</sec>
<sec id="S2.SS2">
<title>Identification of CesA and Csl Genes</title>
<p>A group of 445 protein sequences of known <italic>CesA</italic>/<italic>Csl</italic> genes from 13 species spanning plant diversity were retrieved from literature and used as BLAST queries (<xref ref-type="bibr" rid="B1">Altschul et al., 1990</xref>) against the 242 proteomes of the study (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 2</xref>). In parallel, all the proteins of the 242 genomes were screened for the characteristic PFAM domains of the <italic>CesA</italic> superfamily &#x2013; PF03553 and PF00535 (<xref ref-type="bibr" rid="B42">Little et al., 2018</xref>) &#x2013; using HMMER (<xref ref-type="bibr" rid="B19">Eddy, 2009</xref>; <xref ref-type="bibr" rid="B20">El-Gebali et al., 2019</xref>). The outputs of the BLAST and HMMER searches were merged and filtered to exclude partial sequences not starting with Methionine and/or shorter than the residue length of the PFAM domains annotated onto them (total residues spanned by each PFAM; 170 AAs for PF00535 and 722 AAs for PF03552). The remaining sequences were assigned to <italic>specific CesA</italic>/<italic>Csl</italic> families through a second BLAST against the <italic>CesA</italic>/<italic>Csl</italic> genes from the initial 13 species for which a <italic>specific CesA</italic>/<italic>Csl</italic> function was and by plotting them in a phylogenetic tree to identify wrongly annotated and spurious sequences (R, custom script).</p>
</sec>
<sec id="S2.SS3">
<title>Identification of Primary and Secondary CesA Genes</title>
<p>For the phylogenomic analysis of primary and secondary <italic>CesA</italic> genes, a set of 49 experimentally validated primary and secondary <italic>CesA</italic> sequences was retrieved from literature (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 3</xref>) and used in a BLAST search (<xref ref-type="bibr" rid="B1">Altschul et al., 1990</xref>) against all the <italic>CesA</italic> genes found in the genomes of the study. BLAST results were used to preliminarily categorize all <italic>CesA</italic> genes as primary or secondary <italic>CesAs</italic>. This initial assignment was refined by checking the simultaneous presence of two motifs (CQIC and SVICEXWFA) previously shown to characterize only primary <italic>CesA</italic> proteins in a wide range of plants (<xref ref-type="bibr" rid="B28">Kaur et al., 2016</xref>). Moreover, the position of each <italic>CesA</italic> gene found in the BLAST search, relative to the clades where the initial 49 primary and secondary <italic>CesAs</italic> were placed, was manually inspected in each phylogenetic tree of this study to further help the categorization of primary and secondary <italic>CesA</italic> genes following the BLAST search.</p>
</sec>
<sec id="S2.SS4">
<title>Synteny Network Construction</title>
<p>The synteny network of the <italic>CesA</italic> superfamily was built by following the methodology of <xref ref-type="bibr" rid="B67">Zhao and Schranz (2017)</xref>. In brief, we used Diamond (<xref ref-type="bibr" rid="B5">Buchfink et al., 2015</xref>) to perform BLAST-like alignments of all the proteins of each genome against all the other proteins of that genome and all the proteins of every other genome (Evalue = 1E&#x2013;3). In this way, we identified homologous genes between different species. Subsequently, MCScanX (<xref ref-type="bibr" rid="B63">Wang et al., 2012</xref>) was run with default parameters (except -s &#x2013; the number of colinear genes to claim a syntenic block &#x2013; set to 3) to detect gene synteny (i.e., conserved gene order across multiple genomes). The results of MCScanX were organized in a synteny network, in which each node is a gene and edges represent syntenic connections between genes. The set of edges in which at least one node was a <italic>CesA</italic>/<italic>Csl</italic> gene was extracted from the overall synteny network and represents the <italic>CesA</italic>/<italic>Csl</italic> synteny network (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 4</xref>).</p>
</sec>
<sec id="S2.SS5">
<title>Analysis of Syntenic Communities</title>
<p>The R package igraph and the Multi-level algorithm (<xref ref-type="bibr" rid="B64">Yang et al., 2016</xref>) were used to detect syntenic communities within the <italic>CesA</italic>/<italic>Csl</italic> network (i.e., groups of <italic>CesA</italic>/<italic>Csl</italic> genes displaying a higher degree of synteny with each other than with the rest of the network) formed by at least four nodes. Communities were profiled to assess the type(s) of <italic>CesA</italic>/<italic>Csl</italic> families and species included in each of them, and the gene copy number of each <italic>CesA</italic>/<italic>Csl</italic>-taxa combination was found. Finally, communities were hierarchically clustered based on <italic>CesA</italic>/<italic>Csl</italic> copy number per species (R, hclust function, ward.2 algorithm).</p>
</sec>
<sec id="S2.SS6">
<title>Multiple Sequence Alignment and Phylogenetic Analysis</title>
<p>Phylogenetic trees were built for relevant groups of <italic>CesA</italic>/<italic>Csl</italic> genes. For each tree, full-length CesA/Csl protein sequences were aligned with MAFFT v7.453 (FFT-NS-2 algorithm) (<xref ref-type="bibr" rid="B27">Katoh and Standley, 2013</xref>), with default parameters except gap opening penalty, at 1.0. MAFFT alignments were set and trimmed with TrimAl v1.2 (<xref ref-type="bibr" rid="B8">Capella-Guti&#x00E9;rrez et al., 2009</xref>), with manual optimization of the -gt and -cons flags to obtain alignments lengths similar to the median lengths of the initial proteins included in each tree. Phylogenetic trees were built from trimmed alignments using RAxML v8.2.9 (PROTCATBLOSUM62 substitution matrix; 100 bootstraps) (<xref ref-type="bibr" rid="B55">Stamatakis, 2014</xref>), and plotted and annotated using iTOL (<xref ref-type="bibr" rid="B34">Letunic and Bork, 2019</xref>).</p>
</sec>
<sec id="S2.SS7">
<title>Statistical Analyses</title>
<p>All the statistical analyses (<italic>t</italic>-tests, ANOVAs, LSDs, correlations) described in this research were performed with the Statistical Package for the Social Sciences (SPSS; IBM Corp., Armonk, NY, United States; Version 26.0).</p>
</sec>
</sec>
<sec id="S3" sec-type="results">
<title>Results</title>
<sec id="S3.SS1">
<title>Genomic Architecture of the CesA Superfamily</title>
<sec id="S3.SS1.SSS1">
<title>Gene Copy Number</title>
<p>We used 242 plant genomes, covering plant evolution from green algae to eudicots (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>), to perform an extensive genomic and synteny analysis of the <italic>CesA</italic> superfamily in plants, of which the first step was the study of its genomic architecture. The 222 plant genomes with a BUSCO representation &#x2265;75% contained 7,997 <italic>CesA</italic> superfamily genes, with an average of 36 members per species (CV = 48.6%; <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 5</xref>; CV: Coefficient of Variation). The copy number of <italic>CesA</italic>/<italic>Csl</italic> genes correlates with both the ploidy level and the number of genome duplications of each species (&#x03C1; = 0.57 and &#x03C1; = 0.55, respectively; <italic>p</italic> &#x003C; 0.001 for both; <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 1</xref>). Moreover, the <italic>CesA</italic> superfamily size increased considerably during plant evolution, with an acceleration at the rise of angiosperms. In fact, while charophytes, bryophytes, lycophytes, ferns, and gymnosperms contain a relatively similar number of <italic>CesA</italic>/<italic>Csl</italic> genes (&#x223C;18 per species, CV = 36.9%), this figure doubles in angiosperms despite similar inter-species variability (&#x223C;40 <italic>CesA</italic>/<italic>Csl</italic> genes per species, CV = 35.9%, <italic>p</italic> &#x003C; .001; <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 2</xref>).</p>
<p><italic>Cellulose synthase A</italic> is by far the most abundant <italic>CesA</italic>/<italic>Csl</italic> family, with 12 genes per species on average (CV = 43.1%; <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 3</xref>). This is roughly two times the size of the <italic>CslD</italic>, <italic>CslA</italic>, and <italic>CslC</italic> families (<italic>p</italic> &#x003C; 0.001), which all display &#x223C;6 genes per species. These three families are in turn significantly larger than the <italic>CslB</italic>, <italic>CslE</italic>, <italic>CslG</italic>, and <italic>CslM</italic> families (1-3 genes per species each; <italic>p</italic> &#x003C; 0.001). Finally, the monocot-specific <italic>CslF</italic> and <italic>CslH</italic> families typically display 7 and 1 genes per species on average, respectively. Overall, the copy number of all the <italic>CesA</italic>/<italic>Csl</italic> families varies extensively across plant families, with standard deviations often equal to the mean size of gene families across species.</p>
</sec>
<sec id="S3.SS1.SSS2">
<title>Gene Distribution Along Genomes</title>
<p>The relative positions of the <italic>CesA/Csl</italic> genes within each genome of the study were also assessed, highlighting two main patterns (<xref ref-type="supplementary-material" rid="TS1">Supplementary Tables 6</xref>&#x2013;<xref ref-type="supplementary-material" rid="TS1">8</xref>). On the one hand, the <italic>CesA</italic>, <italic>CslA</italic>, <italic>CslC</italic>, and <italic>CslD</italic> families display a scattered genomic distribution among multiple singleton loci. On the other hand, the <italic>CslB</italic>, <italic>CslE</italic>, <italic>CslG</italic>, and <italic>CslM</italic> clades are usually organized in tandem arrays of 2&#x2013;4 genes. These arrays were detected in both monocots and dicots, and synteny analysis showed their extensive conservation across species (see section &#x201C;Gene Synteny&#x201D;). A large conserved gene array was also found for the <italic>CslF</italic> family in all the Poaceae species evaluated [as previously reported by <xref ref-type="bibr" rid="B51">Schwerdt et al. (2015)</xref> for rice, sorghum, Brachypodium and barley only]. However, we also found 20&#x2013;30% of the <italic>CslF</italic> genes of each Poaceae species organized as conserved singletons or tandems at separated loci. Interestingly, this second group of <italic>CslF</italic> genes was syntenic with <italic>CslD</italic> members of several eudicot species (see section &#x201C;Gene synteny&#x201D; and &#x201C;Synteny Reveals the Genomic Trace of CslF Evolution&#x201D;). Finally, the <italic>CslH</italic> and <italic>CslJ</italic> families displayed both configurations &#x2013; tandem arrays and singleton loci. However, these families are present at too low frequencies and in too few species to allow generalizations.</p>
</sec>
<sec id="S3.SS1.SSS3">
<title>Gene Synteny</title>
<p>The syntenic conservation of the <italic>CesA</italic> superfamily was also assessed in detail by using network synteny analysis. Results indicated that <italic>CesA</italic> superfamily genes are highly syntenic across diverse plants, more than what observed for other plant gene families, including regulatory genes. In fact, of the 7,005 <italic>CesA</italic> superfamily genes from the 193 genomes with a BUSCO representation &#x2265;75% and at least five genes per scaffold on average, 6,262 (89%) were included into the <italic>CesA</italic>/<italic>Csl</italic> synteny network (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 9</xref>). This is a remarkably high proportion, 9 and 24% higher than what is found in comparable studies on the <italic>MADS-box</italic> and the <italic>APETALA2</italic> gene families, respectively (<xref ref-type="bibr" rid="B69">Zhao et al., 2017</xref>; <xref ref-type="bibr" rid="B29">Kerstens et al., 2020</xref>). Furthermore, the synteny of the <italic>CesA</italic>/<italic>Csl</italic> genes is dense and extensive across diverse plant species and families. In fact, each <italic>CesA</italic>/<italic>Csl</italic> gene is syntenic to another 69 homologs from 46 different plant species on average. Moreover, the degree of gene synteny does not significantly differ between intra- and inter-family syntenic comparisons (<italic>p</italic> = 0.054, <xref ref-type="fig" rid="F1">Figure 1A</xref>). Remarkably, for angiosperms only, syntenic conservation is even higher between than within plant families (<italic>p</italic> &#x003C; 0.001, <xref ref-type="fig" rid="F1">Figure 1B</xref>) and crosses the boundaries of monocots and dicots. This trend is clearly divergent from the patterns commonly observed for plant genes (<xref ref-type="bibr" rid="B68">Zhao and Schranz, 2019</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>The degree and structure of <italic>Cellulose synthase/Cellulose synthase-like</italic> (<italic>CesA</italic>/<italic>Csl)</italic> gene synteny. Synteny is, overall, high and extensive across plant families, but arranged differently across gene families. <bold>(A)</bold> Distribution of the coefficients representing the average number of syntenic connections per gene in the <italic>CesA</italic>/<italic>Csl</italic> synteny network with other genes from the same (intrafamily) or from different (interfamily) plant species. Coefficients were calculated using data from all the plant families included in the analysis. <bold>(B)</bold> Distribution of the coefficients representing the average number of syntenic connections per gene in the <italic>CesA</italic>/<italic>Csl</italic> synteny network with other genes from the same (intrafamily) or from different (interfamily) plant species. Coefficients were calculated using data from only angiosperm families. <bold>(C)</bold> Total number of syntenic connections detected between gene families within the <italic>CesA</italic> superfamily.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-870818-g001.tif"/>
</fig>
<p>While synteny is dense and extensive across different plants, the same does not hold across different <italic>CesA</italic>/<italic>Csl</italic> families, which are organized in separate conserved genomic contexts. Accordingly, 99% of syntenic connections within the <italic>CesA</italic>/<italic>Csl</italic> synteny network takes place between genes from the same <italic>CesA</italic>/<italic>Csl</italic> family (<xref ref-type="fig" rid="F1">Figure 1C</xref>). Moreover, each of the 48 syntenic communities identified by decomposing the <italic>CesA</italic>/<italic>Csl</italic> synteny network in groups of highly syntenic genes typically contains genes from only one <italic>CesA</italic>/<italic>Csl</italic> family (<xref ref-type="fig" rid="F2">Figure 2</xref>). Furthermore, multiple communities were detected for most of the <italic>CesA</italic>/<italic>Csl</italic> families, revealing distinct conserved genomic contexts even at the level of gene subclades within <italic>CesA</italic>/<italic>Csl</italic> families (<xref ref-type="fig" rid="F2">Figure 2</xref>). In this respect, the marked differentiation in the genomic organization of the <italic>CslA</italic> genes between monocots (especially Poaceae) and dicots is noteworthy (group B and parts of Groups A and C, <xref ref-type="fig" rid="F2">Figure 2</xref>). Moreover, syntenic communities for specific plant families were detected for some <italic>CesA</italic> and <italic>CslA</italic> genes within Salicaceae, Fabaceae, and Brassicaceae (Group A of <xref ref-type="fig" rid="F2">Figure 2</xref>). Next to these examples of intra-family sub-organization of gene architecture, we also found communities that cross the boundaries of single <italic>CesA</italic>/<italic>Csl</italic> gene families (Group C of <xref ref-type="fig" rid="F2">Figure 2</xref>). Specifically, two communities, including 580 syntenic connections between <italic>CslF</italic> and <italic>CslD</italic> genes, appeared particularly relevant as they grouped &#x223C;40% of all the syntenic connections involving <italic>CslF</italic> genes (<xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>The 48 syntenic communities detected within the <italic>CesA/Csl</italic> synteny network. Syntenic communities are groups of genes displaying higher degrees of synteny with each other than with the rest of the network. Therefore, they constitute conserved architectural configurations of genes across genomes of different species. The figure highlights that <italic>CesA/Csl</italic> communities typically harbour genes belonging to the same <italic>CesA/Csl</italic> family, except for two communities grouping <italic>CslF</italic> and <italic>CslD</italic> genes (green dots on the left of the figure). <italic>CesA/Csl</italic> syntenic communities are also divided into three main clusters based on the spread of their genes across the plant kingdom (<bold>A&#x2013;C</bold> panels and dendrogram at the right of the figure). In the figure, rows represent communities while columns represent species. Cells are coloured according to the number of syntenic genes harboured by each community and each species within communities. Row headers on the left side indicate the predominant gene family harboured by each community (&#x003E;90% of the community members).</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-870818-g002.tif"/>
</fig>
</sec>
</sec>
<sec id="S3.SS2">
<title>Relationship Between Genomic Architecture and Gene Properties</title>
<sec id="S3.SS2.SSS1">
<title>Distinct Phylogenomic Features for Distinct <italic>CesA</italic> Isoforms</title>
<p>To further characterize the genomic patterns described above and to study their relevance for the evolution and sub-functionalization of the <italic>CesA</italic>/<italic>Csl</italic> genes, we investigated how such patterns relate to phylogenetic and functional gene data. For the <italic>CesA</italic> family, our results revealed striking correspondence between the genomic organization of genes, the syntenic conservation of gene architecture, the phylogenetic relationships between genes, and the diversification of genes into distinct isoforms with different functional properties. Specifically, the <italic>CesA</italic> phylogeny was divided into six main clades supported by moderate-to-high bootstraps, corresponding to the six main groups of CesA isoforms known in <italic>Arabidopsis thaliana</italic> and <italic>Oryza sativa</italic> (<xref ref-type="fig" rid="F3">Figures 3A</xref>, <xref ref-type="fig" rid="F5">5</xref>). These six clades are grouped into two separated super-groups of three clades each, corresponding to the subdivision into primary and secondary CW <italic>CesA</italic> genes (<xref ref-type="fig" rid="F3">Figures 3A</xref>, <xref ref-type="fig" rid="F5">5</xref> and <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 3</xref>). Interestingly, each of the six phylogenetic clades is differentially organized and differentially conserved at the genomic level by spanning only one of the six largest <italic>CesA</italic> syntenic communities found in the <italic>CesA</italic>/<italic>Csl</italic> synteny network (<xref ref-type="fig" rid="F3">Figures 3B,D</xref>). Overall, these six clades and communities span 84% of the syntenic <italic>CesA</italic> genes and cover 93% of the angiosperm families included in the synteny analysis. The correspondence between <italic>CesA</italic> phylogeny and synteny is therefore striking, with only one major exception found for clade 1 of <xref ref-type="fig" rid="F3">Figures 3A,B</xref>, which is subdivided into a main syntenic community largely dicot-specific (community 19 of <xref ref-type="fig" rid="F3">Figure 3B</xref>) and other two smaller communities, of which one is specific to monocots (community 42 of <xref ref-type="fig" rid="F3">Figure 3B</xref>) and one is a group of diverse <italic>CesA</italic> members (community 21 of <xref ref-type="fig" rid="F3">Figure 3B</xref>). Besides this clade, only few other minor groups of genes specific to certain plant families deviate from the common phylogenetic and syntenic structure described above by being organized in distinct genomic contexts. Examples are a Fabaceae-specific community from clade 6 of <xref ref-type="fig" rid="F3">Figure 3B</xref> (community 22 of <xref ref-type="fig" rid="F3">Figure 3B</xref>), a Brassicaceae-specific community from clade 3 of <xref ref-type="fig" rid="F3">Figure 3B</xref> (community 13 of <xref ref-type="fig" rid="F3">Figure 3B</xref>), and a community from clade 4 of <xref ref-type="fig" rid="F3">Figure 3B</xref> specific to Brassicaceae and Malvaceae (community 7 of <xref ref-type="fig" rid="F3">Figure 3B</xref>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>The phylogenomic structure of the <italic>CesA</italic> family. <bold>(A)</bold> The phylogenetic tree of all the <italic>CesA</italic> genes from the 193 genomes with a BUSCO representation &#x2265;75% and at least five genes per scaffold on average. The tree highlights the subdivision of the <italic>CesA</italic> sequences into six main phylogenetic groups corresponding to known primary and secondary cell wall <italic>CesA</italic> isoforms from different species. Rings A and B, respectively, represent the species taxonomy of the different genes (legend at the bottom of the figure) and the tree position of 49 experimentally validated primary (blue) and secondary (red) cell wall <italic>CesA</italic> genes. Circles around tree branches highlight the six major <italic>CesA</italic> clades detected in the tree, divided into three primary (blue) and three secondary (red) cell wall <italic>CesA</italic> clades. <bold>(B)</bold> The phylogenetic tree of all the <italic>CesA</italic> genes as in panel A, with highlighted the syntenic connections between genes and the different syntenic communities detected. Lines connecting tree leaves indicate syntenic connections between genes, with different colours highlighting different syntenic communities. Numbers in grey circles refer to gene clades as in panel A, while numbers on coloured syntenic connections indicate community numbers as referred in the article. The figure shows that the six major <italic>CesA</italic> clades of panel A are arranged into distinct conserved architectural genomic configurations (or genomic contexts). <bold>(C)</bold> Highlight of the syntenic connections between clades 2 (pink; primary cell wall <italic>CesA</italic> genes) and 4 (blue; secondary cell wall <italic>CesA</italic> genes). The pink links between the two phylogenetic clades indicate secondary cell wall <italic>CesA</italic> genes placed in genomic contexts typical of primary cell wall <italic>CesAs</italic>. These sequences come mostly from the Malvaceae family, and specifically from the three cotton species therein: <italic>G. hirsutum</italic>, <italic>G. arboreum</italic>, and <italic>G. raimondii</italic>. In <italic>G. hirsutum</italic>, the species involved in these connections were shown to be at the basis of the massive fibre deposition observed in cotton. <bold>(D)</bold> Total number of syntenic connections detected between the six clades of <italic>CesA</italic> genes as represented in the tree of panel A. Row and column headers refer to the six clades of the tree in panel A.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-870818-g003.tif"/>
</fig>
</sec>
<sec id="S3.SS2.SSS2">
<title>A Genomic Context Shift Associated to Specific <italic>CesA</italic> Properties in Cotton</title>
<p>Overall, the six different <italic>CesA</italic> phylogenomic clades are genomically independent from each other, with seldom inter-clade syntenic connections (<xref ref-type="fig" rid="F3">Figures 3B,D</xref>). An exception is represented by the 236 syntenic links between clades 2 (primary CW <italic>CesAs</italic>) and 4 (secondary CW <italic>CesAs</italic>) of <xref ref-type="fig" rid="F3">Figures 3B,C</xref>. A minor part of these connections is spread across different angiosperm families and can be regarded as background noise of the synteny analysis. However, 194 connections specifically involve the Malvaceae family and three cotton species therein: <italic>G. hirsutum</italic>, <italic>G. arboreum</italic>, and <italic>G. raimondii</italic>. These connections mainly involve three primary and five secondary cotton <italic>CesA</italic> genes from branches 2 and 4, respectively, which are syntenic to a total of 104 <italic>CesA</italic> genes from 78 different angiosperm genomes placed in opposite phylogenetic groups (branch 4 for the cotton <italic>CesAs</italic> from branch 2, and branch 2 for the cotton <italic>CesAs</italic> on branch 4) (<xref ref-type="fig" rid="F3">Figure 3C</xref>). This suggests that the cotton secondary CW <italic>CesA</italic> genes involved in these connections changed their genomic position relative to the other angiosperm genes from the same phylogenetic group, ending up in a genomic context typical of primary CW <italic>CesAs</italic> (clade 2 of <xref ref-type="fig" rid="F3">Figures 3B,C</xref>).</p>
<p>Remarkably, the <italic>CesA</italic> genes of <italic>G. hirsutum</italic> involved in the genomic inversion turned out to be associated with distinct functional properties at the basis of the massive fibre production of cotton. These are three different <italic>G. hirsutum CesA</italic> members: an already known <italic>CesA7</italic>, an already known <italic>CesA8</italic>, and a previously uncharacterized homolog of <italic>GhCesA8</italic> (99.6% sequence identity) (<xref ref-type="supplementary-material" rid="TS1">Supplementary Tables 10</xref>, <xref ref-type="supplementary-material" rid="TS1">11</xref>). Interestingly, these three isoforms have been proved to assemble together into functional rosettes for cellulose deposition in <italic>G. hirsutum</italic> fibre cells, with <italic>CesA8</italic> specifically acting as enhancer for massive fibre deposition (<xref ref-type="bibr" rid="B35">Li et al., 2016</xref>, <xref ref-type="bibr" rid="B36">2013</xref>). In addition, the biological mechanism of fibre synthesis involving <italic>G. hirsutum CesA7</italic> and <italic>CesA8</italic> appears conserved across different cotton species, including the ones included in our analysis and showing the same genomic inversion (<xref ref-type="bibr" rid="B36">Li et al., 2013</xref>). Accordingly, all the secondary CW CesA isoforms from <italic>G. arboreum</italic> and <italic>G. raimondii</italic> that are involved in the genomic context shift share 95.6% sequence identity with the <italic>G. hirsutum CesA7</italic> and <italic>CesA8</italic> for which functional data were available in scientific literature (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 11</xref>). Overall, these results suggest that the genomic positioning and organization of <italic>CesA</italic> genes might be critical for determining gene function, representing an important factor at the basis of cotton fibre deposition.</p>
</sec>
<sec id="S3.SS2.SSS3">
<title>Differential Genomic Contexts for Different Gene Functions in the <italic>CslD</italic> Family</title>
<p>Striking correspondence between genomic organization, syntenic conservation, phylogenetic relationships, and functional diversification was also found within the <italic>CslD</italic> family. This family is structured into four main phylogenetic clades (<xref ref-type="fig" rid="F4">Figure 4A</xref>). In addition, previous studies demonstrated the involvement of different <italic>CslD</italic> members into three main plant processes &#x2013; pollen development, determination of vegetative organ size, and root hair formation &#x2013; across several species (<xref ref-type="bibr" rid="B62">Wang et al., 2001</xref>; <xref ref-type="bibr" rid="B30">Kim et al., 2007</xref>; <xref ref-type="bibr" rid="B4">Bernal et al., 2008</xref>; <xref ref-type="bibr" rid="B37">Li et al., 2009</xref>; <xref ref-type="bibr" rid="B44">Luan et al., 2011</xref>; <xref ref-type="bibr" rid="B24">Hunter et al., 2012</xref>; <xref ref-type="bibr" rid="B23">Hu et al., 2018</xref>; <xref ref-type="bibr" rid="B38">Li et al., 2018</xref>). Strikingly, the functional subdivision of the <italic>CslD</italic> members overlapped with the main phylogenetic <italic>CslD</italic> clades (<xref ref-type="fig" rid="F4">Figure 4</xref>). In fact, known <italic>CslD</italic> genes involved in determination of vegetative organ size and root hair formation were only found within clades 1 and 4, respectively, while <italic>CslD</italic> members known to be involved in pollen development were only observed in clades 2 and 3 (<xref ref-type="fig" rid="F4">Figure 4</xref>). In turn, the different phylogenetic and functional <italic>CslD</italic> groups corresponded to distinct and highly conserved <italic>CslD</italic> syntenic communities, highlighting their independent organization and conservation across plants (<xref ref-type="fig" rid="F4">Figure 4B</xref>). However, while one syntenic community was found for each of the phylogenetic clades grouping <italic>CslD</italic> genes involved in organ size determination and root hair formation, respectively, multiple communities specific to different taxonomic groups were detected within the <italic>CslD</italic> branch associated to pollen development (<xref ref-type="fig" rid="F4">Figure 4</xref>). Of these, one is broad and spans both monocot and dicot species (community 39), one is large but restricted to dicots (community 41), and four are minor and restricted to specific taxonomic clades.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>The phylogenomic structure of the <italic>CslD</italic> family. The trees highlight the association between the phylogenetic and the conserved (syntenic) architectural structure of the family, and also the functional diversity of the <italic>CslD</italic> genes. <bold>(A)</bold> The phylogenetic tree of all the <italic>CslD</italic> genes from the 193 genomes with a BUSCO representation &#x2265;75% and at least five genes per scaffold on average. Ring A represents the species taxonomy of the different genes (legend at the bottom of the figure). Ring B represents the tree position of several <italic>CslD</italic> genes known to be involved in the determination of vegetative organ size (green), pollen development (orange), or root hair formation (brown). Circles around tree branches highlight the four major <italic>CslD</italic> clades detected in the tree. Bootstrap values supporting clades subdivisions are reported. <bold>(B)</bold> Syntenic structure of the four <italic>CslD</italic> phylogenetic clades. Lines connecting tree leaves indicate syntenic connections between genes, with different colours highlighting different syntenic communities. Numbers in black circles refer to the four clades of panel A, while numbers on syntenic connections indicate community numbers as referred in the article.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-870818-g004.tif"/>
</fig>
<p>While the subdivision of <italic>CslD</italic> genes described above is shared by all angiosperms, the same does not hold for earlier land plants. In fact, <italic>CslD</italic> members from bryophytes and lycophytes are found in only a single phylogenetic group placed between the <italic>CslD</italic> branches associated to organ size determination and pollen development. Moreover, fern <italic>CslDs</italic> are divided into two groups that are closest to angiosperm <italic>CslD</italic> members involved in organ size determination and root hair formation, respectively. Finally, <italic>CslD</italic> genes from bryophytes, lycophytes, ferns, and gymnosperms do not display any syntenic connection with the angiosperm ones.</p>
</sec>
<sec id="S3.SS2.SSS4">
<title>Phylogenomic Patterns in Other <italic>Csl</italic> Families</title>
<p>Commonalities between phylogenetic, syntenic, and functional patterns were also found in other <italic>Csl</italic> families. However, for the evolutionarily close <italic>CslB</italic>/<italic>H</italic>/<italic>G</italic>/<italic>J</italic>/<italic>M</italic>/<italic>E</italic> families, such commonalities are observed for whole families rather than gene subclades within families (<xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 4</xref>), with deviations observed only in a <italic>CslM</italic> syntenic community specific to Fabaceae (community 23 of <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 4</xref>) and a small <italic>CslE</italic> community with only <italic>Capsicum</italic> genes (community 17 of <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 4</xref>). Finally, <italic>CslA</italic> and <italic>CslC</italic> genes also display syntenic suborganization of their phylogenies (<xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 5</xref>). However, the functional information available for these families are not abundant and do not reveal any clear correspondence with the phylogenetic and/or syntenic structure observed.</p>
</sec>
</sec>
<sec id="S3.SS3">
<title>Evolutionary Dynamics of Specific Gene Families</title>
<sec id="S3.SS3.SSS1">
<title>Evolution of the <italic>CesA</italic> Genes</title>
<p>Phylogenetic and synteny data were used to also study the evolution of the main <italic>CesA</italic>/<italic>Csl</italic> families. For the <italic>CesA</italic> genes, the taxonomic profiling of the six main <italic>CesA</italic> clades (see section &#x201C;Distinct Phylogenomic Features for Distinct CesA Isoforms&#x201D;) revealed that <italic>CesA</italic> sequences of bryophytes and lycophytes are phylogenetically closest to the primary CW <italic>CesA</italic> branches, and specifically to the primary CW <italic>CesA</italic> genes homolog to the redundant Arabidopsis <italic>CesA2/5/6/9</italic> and the <italic>O. sativa CesA3/5/6</italic> (clade 1 of <xref ref-type="fig" rid="F5">Figure 5</xref>). This observation suggests that primary CW <italic>CesAs</italic> are the oldest <italic>CesA</italic> sequences of land plants, and that early land plants likely only had primary CW <italic>CesA</italic> (-like) genes. The positioning of fern <italic>CesAs</italic> suggests that the diversification of <italic>CesA</italic> genes toward the six main angiosperm clades described in section &#x201C;Distinct Phylogenomic Features for Distinct CesA Isoforms&#x201D; started sometime during the late lycophyte or early fern evolution. In fact, while a group of fern <italic>CesAs</italic> is also positioned close to the bryophyte and lycophyte sequences of clade 1 of <xref ref-type="fig" rid="F5">Figure 5</xref>, another group of fern genes precede the split of the other two clades of angiosperm primary CW <italic>CesAs</italic> (clades 2 and 3 of <xref ref-type="fig" rid="F5">Figure 5</xref>). Moreover, a third group of fern <italic>CesAs</italic> is closest to the three groups of angiosperms secondary CW <italic>CesAs</italic> (clades 4, 5, and 6 of <xref ref-type="fig" rid="F5">Figure 5</xref>), just preceding their diversification. Therefore, <italic>CesAs</italic> diversification toward the different phylogenomic groups observed in higher plants significantly progressed during fern evolution. However, gymnosperms are the first group of plants whose <italic>CesA</italic> sequences are found in all the six <italic>CesA</italic> clades genomically conserved across all the angiosperms. To conclude, data on gene copy-number within each of the six <italic>CesA</italic> clades also support the evolutionary model discussed. In fact, the early-diverging clade 1 contains by far the most <italic>CesA</italic> sequences in both <xref ref-type="fig" rid="F3">Figures 3A</xref>, <xref ref-type="fig" rid="F5">5</xref> (771 sequences in the full <italic>CesA</italic> tree of <xref ref-type="fig" rid="F3">Figure 3A</xref>), followed by clades 2 and 3 of primary CW <italic>CesAs</italic> (394 genes on average, CV = 4.3%), and finally by the three secondary CW <italic>CesA</italic> clades (273 genes each on average, CV = 6.3%). These data, together with the presence of redundant Arabidopsis <italic>CesA</italic> copies within two primary CW <italic>CesA</italic> clades (1 and 2 of <xref ref-type="fig" rid="F5">Figure 5</xref>), agree with the view that oldest <italic>CesA</italic> families evolved first into multiple isoforms, of which some underwent full sub-functionalization within the cellulose synthesis machinery [a model known as constructive neutral evolution, recently advanced by <xref ref-type="bibr" rid="B22">Haigler and Roberts (2019)</xref> for the <italic>CesA</italic> genes &#x2013; see discussion].</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Phylogenetic tree of a subset of 151 <italic>CesA</italic> genes including sequences from model plant species, from species where primary and secondary cell wall <italic>CesA</italic> genes have been experimentally validated (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 3</xref>), along with all gymnosperm, fern, lycophyte, bryophyte, and charophyte <italic>CesA</italic> sequences. The tree shows that early-diverging plant sequences are only occurring closest to primary cell wall <italic>CesA</italic> branches (especially clade 1). Moreover, it highlights the subdivision of the <italic>CesA</italic> sequences into the six main phylogenetic groups corresponding to known primary and secondary cell wall CesA isoforms from different species. The inner ring indicates taxonomy, while the outer ring highlights the position of experimentally validated primary and secondary cell wall <italic>CesA</italic> genes (see legend at the bottom of the figure). Coloured clades around branches indicate the six main <italic>CesA</italic> clades, numbered in the same way as in <xref ref-type="fig" rid="F3">Figure 3</xref>. Blue clades indicate primary cell wall <italic>CesA</italic> genes, while red clades indicate secondary cell wall <italic>CesA</italic> sequences. Bootstraps supporting clade subdivisions are reported. Yellow dots indicate the position of the 10 <italic>A. thaliana CesA</italic> genes.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-870818-g005.tif"/>
</fig>
</sec>
<sec id="S3.SS3.SSS2">
<title>Synteny Reveals the Genomic Trace of <italic>CslF</italic> Evolution</title>
<p>In the <italic>CslF</italic> family, our phylogenomic analysis detected 580 syntenic connections between the <italic>CslF</italic> genes organized as singletons or tandems in the Poaceae genomes (see section &#x201C;Gene Distribution Along Genomes&#x201D;) and a subset of <italic>CslD</italic> members, all included within the same syntenic community (community 35 in <xref ref-type="fig" rid="F6">Figure 6</xref> and green dots in <xref ref-type="fig" rid="F2">Figure 2</xref>). Interestingly, 327 of these connections involve <italic>CslD</italic> genes from 53 different eudicot species (27 different plant families) that display simultaneous synteny with a grass <italic>CslD</italic> gene and a grass <italic>CslF</italic> member in 17 of the 21 Poaceae genomes of our study (<xref ref-type="fig" rid="F6">Figure 6B</xref>). The frequency of these connections is negatively correlated with the evolutionary distance of the species harbouring &#x201C;seed&#x201D; <italic>CslD</italic> sequences from Poaceae (<italic>r</italic> = &#x2013;0.57). Moreover, the &#x201C;seed&#x201D; <italic>CslD</italic> genes all belong to the <italic>CslD</italic> phylogenomic clade grouping <italic>CslD</italic> genes involved in root hair formation (see section &#x201C;Gene Distribution Along Genomes&#x201D;). All together, these observations reveal the genomic trace of the <italic>CslF</italic> origin in grasses. On the one hand, they confirm that the <italic>CslF</italic> genes originated through the duplication of some <italic>CslD</italic> members (<xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>; <xref ref-type="bibr" rid="B51">Schwerdt et al., 2015</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). On the other hand, they show that the <italic>CslF</italic> family is nested within the <italic>CslD</italic> one (and specifically within the <italic>CslD</italic> clade involved in root hair formation). This in turn proves that the <italic>CslF</italic> genes took origin when the <italic>CslD</italic> family was already formed, thanks to a relaxed evolutionary pressure on duplicated <italic>CslD</italic> members in grasses. To conclude, it is noteworthy that no synteny was detected between the <italic>CslF</italic> genes syntenic to <italic>CslDs</italic> (red arrows in <xref ref-type="fig" rid="F6">Figure 6</xref>) and the other <italic>CslF</italic> members included in community 43 (<xref ref-type="fig" rid="F6">Figure 6</xref>), which form the conserved <italic>CslF</italic> genomic array (see section 2.1.2). The fact that only some <italic>CslF</italic> genes &#x2013; corresponding to one of the two architectural configurations of this family &#x2013; display synteny with <italic>CslD</italic> members questions whether the <italic>CslF</italic> family originated through a single <italic>CslD</italic> duplication, or if the family underwent different rounds of expansion in Poaceae. In this respect, the phylogenomic positioning of the barley <italic>CslF</italic> members recently shown to synthesize (1,4)-&#x03B2;-linked glucoxylans was checked in the tree of <xref ref-type="fig" rid="F6">Figure 6A</xref> (blue arrow). Interestingly, both <italic>HvCslF3</italic> and <italic>HvCslF10</italic> are positioned within the phylogenetic clade spanned by community 43, which does not display synteny with <italic>CslD</italic> members.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>The 1:2 syntenic connections between a group of angiosperm <italic>CslD</italic> genes and a class of Poaceae <italic>CslD</italic> and <italic>CslF</italic> genes. <bold>(A)</bold> Phylogenetic tree of all the <italic>CslD</italic> and <italic>CslF</italic> genes from the 193 genomes with a BUSCO representation &#x2265;75% and at least five genes per scaffold on average. The outer ring highlights <italic>CslD</italic> and <italic>CslF</italic> tree branches. Red arrows indicate the <italic>CslF</italic> genes syntenic to <italic>CslD</italic> isoforms involved in root morphology (same syntenic community as in <xref ref-type="fig" rid="F4">Figure 4</xref>). Blue arrow indicates the positioning of <italic>HvCslF3</italic> and <italic>HvCslF10</italic>, the two <italic>CslF</italic> genes responsible for the synthesis of 1,4-&#x03B2;-linked glucoxylan in barley. <bold>(B)</bold> Table reporting examples of syntenic connections between three <italic>CslD</italic> genes from three different angiosperm species and both <italic>CslF</italic> and <italic>CslD</italic> genes across different Poaceae species.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpls-13-870818-g006.tif"/>
</fig>
</sec>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<sec id="S4.SS1">
<title>The Genomic Architecture and the Evolution of the CesA Superfamily</title>
<p>In section &#x201C;Genomic Data Sources&#x201D;, we showed that the genomic architecture of the <italic>CesA</italic> superfamily varies considerably in terms of gene copy number, gene distribution along genomes, and gene synteny across different plant and gene families. These results provide insight into the timing and modes of <italic>CesA</italic> superfamily evolution. Specifically, a key finding is that the <italic>CesA</italic>/<italic>Csl</italic> copy number variation correlates with the evolutionary timing of the <italic>CesA</italic> superfamily. Accordingly, we found that <italic>CesA</italic> is the largest <italic>CesA</italic>/<italic>Csl</italic> family, followed by the <italic>CslD/F</italic>, <italic>CslA</italic>, and <italic>CslC</italic> families, and finally by the <italic>CslB</italic>/<italic>H</italic>, <italic>CslG</italic>/<italic>M</italic>, and <italic>CslE</italic> genes. These groups respectively represent the oldest, the intermediate, and the most recent <italic>CesA</italic>/<italic>Csl</italic> families in evolutionary terms (<xref ref-type="bibr" rid="B3">Banasiak, 2014</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). Moreover, we also found positive correlation between <italic>CesA</italic>/<italic>Csl</italic> copy number and whole genome duplications (WGDs) across species, suggesting that WGDs had a prominent role in driving <italic>CesA</italic>/<italic>Csl</italic> expansion and diversification. All together, these observations imply that novel duplicated <italic>CesA</italic>/<italic>Csl</italic> genes were typically retained by positive selection, contributing to expand, diversify, and sub-functionalize the <italic>CesA</italic> superfamily. Interestingly, <xref ref-type="bibr" rid="B51">Schwerdt et al. (2015)</xref> analysed nucleotide substitution rates of <italic>CesA</italic>/<italic>Csl</italic> genes from four grasses and found evidence of past positive selection for several of them followed by genomic stabilization, supporting our conclusion.</p>
<p>Since positive selection takes typically place when recently-duplicated genes provide adaptive advantages (<xref ref-type="bibr" rid="B15">Demuth and Hahn, 2009</xref>; <xref ref-type="bibr" rid="B25">Jensen and Bachtrog, 2010</xref>), it is relevant to investigate the advantages brought by novel <italic>CesA/Csl</italic> genes to plants. In this respect, we demonstrated that the <italic>CesA</italic> and <italic>CslD</italic> diversification accompanied sub-functionalization into primary and secondary <italic>CesA</italic> isoforms and different <italic>CslD</italic> functional groups (i.e., pollen development, determination of vegetative organ size, and root hair formation), respectively. Moreover, conserved phylogenomic patterns were found across all the main <italic>CesA/Csl</italic> families. We therefore hypothesize that the expansion and diversification of the <italic>CesA</italic> superfamily sustained the increased cell wall diversity during plant evolution, which in turn drove the evolution of land plants (<xref ref-type="bibr" rid="B50">Sarkar et al., 2009</xref>; <xref ref-type="bibr" rid="B53">S&#x00F8;rensen et al., 2010</xref>). In fact, cellulose and hemicelluloses have been fundamental for both the rise of land plants (by serving as sinks of photosynthesized carbohydrates and by providing mechanical support) and the rise of vascular and flowering plants (by modulating cell wall composition for vascularization, evolution of flowers and fruits, and pathogen resistance) (<xref ref-type="bibr" rid="B50">Sarkar et al., 2009</xref>). Remarkably, while all the major <italic>CesA</italic>/<italic>Csl</italic> families have members in basal land plants, the largest expansion of all the families was observed at the rise of flowering plants, fitting the evolutionary model just discussed.</p>
<p>The importance of <italic>CesA</italic>/<italic>Csl</italic> diversification in higher plants evolution was also highlighted by synteny data. Specifically, a consistent part of <italic>CesA</italic>/<italic>Csl</italic> syntenic communities is conserved across most angiosperms (Group C in <xref ref-type="fig" rid="F2">Figure 2</xref>), suggesting genomic fixation of the successful diversification of <italic>CesA</italic>/<italic>Csl</italic> families into a highly conserved genomic background. This is remarkable, since the synteny of most angiosperm genes is typically lineage-specific (<xref ref-type="bibr" rid="B68">Zhao and Schranz, 2019</xref>). On the other hand, the fact that lineage-specific syntenic communities were nonetheless found (Groups A and B in <xref ref-type="fig" rid="F2">Figure 2</xref>) demonstrates that <italic>CesA</italic>/<italic>Csl</italic> evolution is still dynamic and novel evolutionary trajectories take place on top, rather than in replacement of the common and stabilized genomic background. Lineage-specific gene arrangements may thus contribute to CW variability across plants. In this sense, the divergent genomic organization found in <italic>CslA</italic> genes between monocots and eudicots appears particularly marked, and we hypothesize that this could be at the basis of the profound differences in mannan and glucomannan content between these groups of plants (<xref ref-type="bibr" rid="B60">Vogel, 2008</xref>). This is further supported by the absence of clear differences between monocots and eudicots in terms of mere <italic>CslA</italic> copy number or gene sequence variability.</p>
</sec>
<sec id="S4.SS2">
<title>Conserved Genomic Contexts in the CesA Superfamily: Biological Implications</title>
<p>A major finding of this research is the association between phylogenetic patterns, conservation (synteny) of such patterns, and gene sub-functionalization found in the <italic>CesA</italic> and <italic>CslD</italic> families. Moreover, similar patterns were detected in other <italic>Csl</italic> families, suggesting that such associations might also hold for other genes, but the absence of sufficient functional data hampered their detection. These observations question whether synteny is merely a trace of past evolutionary dynamics of <italic>CesA</italic>/<italic>Csl</italic> genes or if it can proactively contribute to determine gene function. In this respect, several elements support the second alternative. First, the exceptional extensiveness of <italic>CesA</italic>/<italic>Csl</italic> synteny compared to the typical lineage-specific synteny of angiosperm genes (<xref ref-type="bibr" rid="B68">Zhao and Schranz, 2019</xref>) suggests that selective constraints act against the positional genomic reshuffling of the <italic>CesA</italic> superfamily. In turn, this may indicate that synteny contributes to determine and conserve gene functions and adaptations, as hypothesized for other highly conserved gene families (<xref ref-type="bibr" rid="B69">Zhao et al., 2017</xref>; <xref ref-type="bibr" rid="B29">Kerstens et al., 2020</xref>). Furthermore, if this hypothesis is correct, positional constraints would likely be represented by tight mechanisms of gene regulation based on (epi)genetic properties variable along genomes and potentially disrupted by genome reshuffling (<xref ref-type="bibr" rid="B21">Gould et al., 2018</xref>; <xref ref-type="bibr" rid="B14">Crow et al., 2020</xref>; <xref ref-type="bibr" rid="B32">Lai et al., 2020</xref>). In this respect, several studies showed that cell wall biosynthesis is indeed hierarchically tightly regulated by the concerted action of complex plant gene networks (<xref ref-type="bibr" rid="B56">Taylor-Teeples et al., 2015</xref>; <xref ref-type="bibr" rid="B48">Rao and Dixon, 2018</xref>; <xref ref-type="bibr" rid="B66">Zhang et al., 2018</xref>). In addition, some <italic>CesA</italic> genes were shown to be targets of defined loops within such networks (<xref ref-type="bibr" rid="B56">Taylor-Teeples et al., 2015</xref>). Therefore, positional reshuffling of the <italic>CesA</italic>/<italic>Csl</italic> genes may significantly affect the deposition of cellulose and hemicellulosic polymers, which is strikingly what we found for the cotton secondary <italic>CesA</italic> genes transposed to conserved genomic contexts typical of primary <italic>CesAs</italic> in other plant species. All together, these observations support a central role for the <italic>CesA</italic> superfamily genomic architecture in determining gene function. However, given the fragmented evidence across the different <italic>CesA</italic>/<italic>Csl</italic> families &#x2013; which is mostly due to the scarce functional characterization of several <italic>Csl</italic> genes &#x2013; such roles should be better investigated in the future.</p>
</sec>
<sec id="S4.SS3">
<title>The Evolution of the CesA Family</title>
<p>Based on the phylogenomic analyses, we proposed a novel evolutionary model for the <italic>CesA</italic> family, which hypothesizes that primary CW <italic>CesAs</italic> evolved before the secondary CW <italic>CesAs</italic> (<xref ref-type="fig" rid="F5">Figure 5</xref>). These findings are in contrast with previous studies that claimed secondary <italic>CesA</italic> genes as the oldest ones by observing the primary CW <italic>CesAs</italic> phylogenetically nested within them (<xref ref-type="bibr" rid="B51">Schwerdt et al., 2015</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>). In our case, all the <italic>CesA</italic> phylogenetic trees (<xref ref-type="fig" rid="F3">Figures 3</xref>, <xref ref-type="fig" rid="F5">5</xref>) positioned bryophyte and lycophyte <italic>CesA</italic> sequences within primary CW <italic>CesA</italic> branches (specifically clade 1 of <xref ref-type="fig" rid="F3">Figures 3</xref>, <xref ref-type="fig" rid="F5">5</xref>), providing strong evidence in support of our model. Furthermore, the currently most-accepted framework of plant and cell wall evolution also agrees with our model. First, the detection of three groups of primary and three groups of secondary CesA isoforms only in gymnosperms and angiosperms agree with the pivotal role of secondary CW <italic>CesAs</italic> in the evolution of tall stems, wood, and vessels (<xref ref-type="bibr" rid="B54">Speck and Burgert, 2011</xref>; <xref ref-type="bibr" rid="B13">Cosgrove, 2012</xref>). Second, the fact that primary CWs alone sufficed the developmental needs of bryophytes and lycophytes (<xref ref-type="bibr" rid="B50">Sarkar et al., 2009</xref>) agrees with the finding of only primary <italic>CesA</italic> (-like) genes in these plants. Third, the intermediate evolutionary stage of ferns with a group of sequences clustered within clade 1 of <xref ref-type="fig" rid="F5">Figure 5</xref>, another group preceding the split of the other two primary <italic>CesA</italic> clades, and a final group basal to the division of the three secondary CW <italic>CesA</italic> clades agree with the finding of <italic>ancestral</italic> conductive vessels in these species, with marked differences from the conductive vessels of angiosperms (<xref ref-type="bibr" rid="B9">Carlquist and Schneider, 2001</xref>; <xref ref-type="bibr" rid="B50">Sarkar et al., 2009</xref>). To conclude, the overall positioning of bryophyte/lycophyte, fern, and gymnosperm/angiosperm genes in the trees of <xref ref-type="fig" rid="F3">Figures 3A</xref>, <xref ref-type="fig" rid="F5">5</xref> suggests that the <italic>CesA</italic> family evolved over a long time, with the likely recurrent stabilization of its members into different intermediate diversified forms. Interestingly, this agrees with the most recent hypotheses on the evolution of the Cellulose Synthase Complexes (CSCs), which state that the hetero-oligomeric CSCs of higher plants (i.e. formed by different CesA isoforms) arose through constructive neutral evolution of ancestral homo-oligomeric complexes (CSCs of chlorophytes are formed by linear combination of interchangeable CesA proteins) pushed to differentiation by the diversification of their CesA subunits (<xref ref-type="bibr" rid="B22">Haigler and Roberts, 2019</xref>). While the sequence diversity of our bryophyte and lycophyte CesA sequences does not allow us to conclude whether they represent a single isoform, their positioning basal to the primary CW <italic>CesA</italic> branch in <xref ref-type="fig" rid="F3">Figures 3A</xref>, <xref ref-type="fig" rid="F5">5</xref> does not exclude this possibility. Therefore, the real morphology of CSCs in early-diverging land plants should be better elucidated in the future to answer the latter question. However, we think that the model of <xref ref-type="bibr" rid="B22">Haigler and Roberts (2019)</xref> overall agrees with the phylogenomic patterns observed in this research and further supports our hypotheses.</p>
</sec>
<sec id="S4.SS4">
<title>The Evolutionary Relationship Between the CslD and CslF Families</title>
<p>The syntenic relationships between some <italic>CslD</italic> and <italic>CslF</italic> genes represent a last important finding of our research. Specifically, while the origin of the <italic>CslF</italic> family through the duplication of <italic>CslD</italic> members had been already reported based on phylogenetics (<xref ref-type="bibr" rid="B65">Yin et al., 2009</xref>; <xref ref-type="bibr" rid="B51">Schwerdt et al., 2015</xref>; <xref ref-type="bibr" rid="B42">Little et al., 2018</xref>), our results unveiled the genomic trace of this evolutionary link. This trace lays in the 1:2 syntenic connections found between a set of eudicot <italic>CslD</italic> members and both <italic>CslD</italic> and <italic>CslF</italic> genes from Poaceae. Since synteny involves both eudicot and grass <italic>CslD</italic> members, our results demonstrate that <italic>CslF</italic> genes originated when the <italic>CslD</italic> family was already formed, thanks to the relaxed selection pressure on duplicated grass <italic>CslD</italic> copies. Moreover, since synteny between <italic>CslD</italic> and <italic>CslF</italic> genes is found across all the Poaceae species in our analyses, the duplication at the origin of the <italic>CslF</italic> family may be shared by all the Poaceae. Therefore, assuming that a WGD event was responsible for this, such event could be the so-called &#x03C1; WGD, which is at the origin of the whole Poaceae lineage and took place about 56&#x2013;70 Mya (<xref ref-type="bibr" rid="B11">Clark and Donoghue, 2018</xref>; <xref ref-type="bibr" rid="B33">Lee et al., 2020</xref>). To conclude, it remains unclear whether the <italic>CslD</italic> duplication described in this research originated all or only a part of the <italic>CslF</italic> family. In fact, the <italic>CslF</italic> genes syntenic to <italic>CslD</italic> members are the only ones genomically organized as singletons or in small tandem arrays, while no synteny between <italic>CslF</italic> and <italic>CslD</italic> genes was detected for the <italic>CslF</italic> members arranged into the large conserved <italic>CslF</italic> array (see section 2.1.2). Interestingly, such large conserved <italic>CslF</italic> array contains the two barley <italic>CslF</italic> genes that were recently found to synthesize a modified form of 1,4-linked polysaccharides termed (1,4)-&#x03B2;-linked glucoxylans (<xref ref-type="bibr" rid="B41">Little et al., 2019</xref>). Altogether, these observations indicate that (different parts of) the <italic>CslF</italic> family may have formed or expanded repeatedly during grass evolution, but the data produced in this research fail to highlight the specific modes of such evolution. Moreover, as for the (1,4)-&#x03B2;-linked glucoxylans of barley, these dynamic evolutionary patterns involving differential genomic arrangement of homologous genes may be at the basis of the formation of novel cell wall polysaccharides. However, the presence of this polysaccharide and of <italic>CslF</italic> genes responsible for its synthesis in species other than barley should be confirmed in the future to fully support the statement above. In conclusion, since the <italic>CslF</italic> genes syntenic with the <italic>CslD</italic> members are the phylogenetically oldest <italic>CslFs</italic> (<xref ref-type="fig" rid="F6">Figure 6</xref>), we can confidently state that the <italic>CslD</italic> duplication was the first event in initiating the <italic>CslF</italic> family.</p>
</sec>
</sec>
<sec id="S5" sec-type="conclusion">
<title>Conclusion</title>
<p>In this research, we took advantage of novel bioinformatic tools for large-scale synteny, genomic, and phylogenetic analysis to improve our knowledge on the genetics, evolution, and functional characterization of the <italic>CesA</italic> superfamily. The tools we used &#x2013; especially the combined analysis of synteny, phylogenetic, and functional data &#x2013; provided the power to dissect the aspects just mentioned in the complex gene families at the basis of plant cell walls. However, the current level of functional characterization of many genes within the <italic>CesA</italic> superfamily is often hampering an effective use of the methodologies described here. Therefore, we foresee that progress in the functional characterization of cell wall genes through reverse genetics will be crucial to further advance our fundamental knowledge on these genes. In addition, future research can take advantage of our results to test the hypotheses we raise and close the open gaps in our understanding of the <italic>CesA</italic> superfamily.</p>
</sec>
<sec id="S6" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="DS1">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="S7">
<title>Author Contributions</title>
<p>FP designed and conducted this research and wrote the article, with inputs and supervision from LT, MS, and EL. LT, MS, and EL corrected the manuscript. LT, MS, and EL approved the final manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="pudiscl1" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="S8" sec-type="funding-information">
<title>Funding</title>
<p>This research was part of a project that received funding from the European Union&#x2019;s Horizon 2020 Research and Innovation Program under grant agreement No. 727698.</p>
</sec>
<ack><p>We thank Dr. Robin van Velzen for the help with the phylogenetic analyses.</p>
</ack>
<sec id="S10" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpls.2022.870818/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpls.2022.870818/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.PDF" id="DS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_1.XLSX" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname> <given-names>S. F.</given-names></name> <name><surname>Gish</surname> <given-names>W.</given-names></name> <name><surname>Miller</surname> <given-names>W.</given-names></name> <name><surname>Myers</surname> <given-names>E. W.</given-names></name> <name><surname>Lipman</surname> <given-names>D. J.</given-names></name></person-group> (<year>1990</year>). <article-title>Basic local alignment search tool.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>215</volume> <fpage>403</fpage>&#x2013;<lpage>410</lpage>.</citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arioli</surname> <given-names>T.</given-names></name> <name><surname>Peng</surname> <given-names>L.</given-names></name> <name><surname>Betzner</surname> <given-names>A. S.</given-names></name> <name><surname>Burn</surname> <given-names>J.</given-names></name> <name><surname>Wittke</surname> <given-names>W.</given-names></name> <name><surname>Herth</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>1998</year>). <article-title>Molecular analysis of cellulose biosynthesis in <italic>Arabidopsis</italic>.</article-title> <source><italic>Science</italic></source> <volume>279</volume> <fpage>717</fpage>&#x2013;<lpage>720</lpage>. <pub-id pub-id-type="doi">10.1126/science.279.5351.717</pub-id> <pub-id pub-id-type="pmid">9445479</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banasiak</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Evolution of the cell wall components during terrestrialization.</article-title> <source><italic>Acta Soc. Botanicorum Pol.</italic></source> <volume>83</volume> <fpage>349</fpage>&#x2013;<lpage>362</lpage>. <pub-id pub-id-type="doi">10.5586/asbp.2014.051</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernal</surname> <given-names>A. J.</given-names></name> <name><surname>Yoo</surname> <given-names>C.-M.</given-names></name> <name><surname>Mutwil</surname> <given-names>M.</given-names></name> <name><surname>Jensen</surname> <given-names>J. K.</given-names></name> <name><surname>Hou</surname> <given-names>G.</given-names></name> <name><surname>Blaukopf</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Functional analysis of the cellulose synthase-like genes CSLD1, CSLD2, and CSLD4 in tip-growing <italic>Arabidopsis</italic> cells.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>148</volume> <fpage>1238</fpage>&#x2013;<lpage>1253</lpage>. <pub-id pub-id-type="doi">10.1104/pp.108.121939</pub-id> <pub-id pub-id-type="pmid">18768911</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buchfink</surname> <given-names>B.</given-names></name> <name><surname>Xie</surname> <given-names>C.</given-names></name> <name><surname>Huson</surname> <given-names>D. H.</given-names></name></person-group> (<year>2015</year>). <article-title>Fast and sensitive protein alignment using DIAMOND.</article-title> <source><italic>Nat. Methods</italic></source> <volume>12</volume> <fpage>59</fpage>&#x2013;<lpage>60</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.3176</pub-id> <pub-id pub-id-type="pmid">25402007</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burton</surname> <given-names>R. A.</given-names></name> <name><surname>Shirley</surname> <given-names>N. J.</given-names></name> <name><surname>King</surname> <given-names>B. J.</given-names></name> <name><surname>Harvey</surname> <given-names>A. J.</given-names></name> <name><surname>Fincher</surname> <given-names>G. B.</given-names></name></person-group> (<year>2004</year>). <article-title>The CesA gene family of barley. Quantitative analysis of transcripts reveals two groups of co-expressed genes.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>134</volume> <fpage>224</fpage>&#x2013;<lpage>236</lpage>. <pub-id pub-id-type="doi">10.1104/pp.103.032904</pub-id> <pub-id pub-id-type="pmid">14701917</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burton</surname> <given-names>R. A.</given-names></name> <name><surname>Wilson</surname> <given-names>S. M.</given-names></name> <name><surname>Hrmova</surname> <given-names>M.</given-names></name> <name><surname>Harvey</surname> <given-names>A. J.</given-names></name> <name><surname>Shirley</surname> <given-names>N. J.</given-names></name> <name><surname>Medhurst</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2006</year>). <article-title>Cellulose synthase-like CslF genes mediate the synthesis of cell wall (1, 3; 1, 4)-&#x00DF;-D-glucans.</article-title> <source><italic>Science</italic></source> <volume>311</volume> <fpage>1940</fpage>&#x2013;<lpage>1942</lpage>. <pub-id pub-id-type="doi">10.1126/science.1122975</pub-id> <pub-id pub-id-type="pmid">16574868</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Capella-Guti&#x00E9;rrez</surname> <given-names>S.</given-names></name> <name><surname>Silla-Mart&#x00ED;nez</surname> <given-names>J. M.</given-names></name> <name><surname>Gabald&#x00F3;n</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>1972</fpage>&#x2013;<lpage>1973</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp348</pub-id> <pub-id pub-id-type="pmid">19505945</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carlquist</surname> <given-names>S.</given-names></name> <name><surname>Schneider</surname> <given-names>E. L.</given-names></name></person-group> (<year>2001</year>). <article-title>Vessels in ferns: structural, ecological, and evolutionary significance.</article-title> <source><italic>Am. J. Bot.</italic></source> <volume>88</volume> <fpage>1</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.2307/2657121</pub-id> <pub-id pub-id-type="pmid">11159121</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carroll</surname> <given-names>A.</given-names></name> <name><surname>Mansoori</surname> <given-names>N.</given-names></name> <name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Lei</surname> <given-names>L.</given-names></name> <name><surname>Vernhettes</surname> <given-names>S.</given-names></name> <name><surname>Visser</surname> <given-names>R. G.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Complexes with mixed primary and secondary cellulose synthases are functional in <italic>Arabidopsis</italic> plants.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>160</volume> <fpage>726</fpage>&#x2013;<lpage>737</lpage>. <pub-id pub-id-type="doi">10.1104/pp.112.199208</pub-id> <pub-id pub-id-type="pmid">22926318</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>J. W.</given-names></name> <name><surname>Donoghue</surname> <given-names>P. C.</given-names></name></person-group> (<year>2018</year>). <article-title>Whole-genome duplication and plant macroevolution.</article-title> <source><italic>Trends Plant Sci.</italic></source> <volume>23</volume> <fpage>933</fpage>&#x2013;<lpage>945</lpage>. <pub-id pub-id-type="doi">10.1016/j.tplants.2018.07.006</pub-id> <pub-id pub-id-type="pmid">30122372</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cocuron</surname> <given-names>J.-C.</given-names></name> <name><surname>Lerouxel</surname> <given-names>O.</given-names></name> <name><surname>Drakakaki</surname> <given-names>G.</given-names></name> <name><surname>Alonso</surname> <given-names>A. P.</given-names></name> <name><surname>Liepman</surname> <given-names>A. H.</given-names></name> <name><surname>Keegstra</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>A gene from the cellulose synthase-like C family encodes a &#x03B2;-1, 4 glucan synthase.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>104</volume> <fpage>8550</fpage>&#x2013;<lpage>8555</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0703133104</pub-id> <pub-id pub-id-type="pmid">17488821</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cosgrove</surname> <given-names>D. C.</given-names></name></person-group> (<year>2012</year>). <article-title>Comparative structure and biomechanics of plant primary and secondary cell walls.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>3</volume>:<issue>204</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2012.00204</pub-id> <pub-id pub-id-type="pmid">22936943</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crow</surname> <given-names>T.</given-names></name> <name><surname>Ta</surname> <given-names>J.</given-names></name> <name><surname>Nojoomi</surname> <given-names>S.</given-names></name> <name><surname>Aguilar-Rangel</surname> <given-names>M. R.</given-names></name> <name><surname>Rodr&#x00ED;guez</surname> <given-names>J. V. T.</given-names></name> <name><surname>Gates</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Gene regulatory effects of a large chromosomal inversion in highland maize.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>16</volume>:<issue>e1009213</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1009213</pub-id> <pub-id pub-id-type="pmid">33270639</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Demuth</surname> <given-names>J. P.</given-names></name> <name><surname>Hahn</surname> <given-names>M. W.</given-names></name></person-group> (<year>2009</year>). <article-title>The life and death of gene families.</article-title> <source><italic>Bioessays</italic></source> <volume>31</volume> <fpage>29</fpage>&#x2013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.1002/bies.080085</pub-id> <pub-id pub-id-type="pmid">19153999</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dhugga</surname> <given-names>K. S.</given-names></name> <name><surname>Barreiro</surname> <given-names>R.</given-names></name> <name><surname>Whitten</surname> <given-names>B.</given-names></name> <name><surname>Stecca</surname> <given-names>K.</given-names></name> <name><surname>Hazebroek</surname> <given-names>J.</given-names></name> <name><surname>Randhawa</surname> <given-names>G. S.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>Guar seed &#x00DF;-mannan synthase is a member of the cellulose synthase super gene family.</article-title> <source><italic>Science</italic></source> <volume>303</volume> <fpage>363</fpage>&#x2013;<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1126/science.1090908</pub-id> <pub-id pub-id-type="pmid">14726589</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doblin</surname> <given-names>M. S.</given-names></name> <name><surname>De Melis</surname> <given-names>L.</given-names></name> <name><surname>Newbigin</surname> <given-names>E.</given-names></name> <name><surname>Bacic</surname> <given-names>A.</given-names></name> <name><surname>Read</surname> <given-names>S. M.</given-names></name></person-group> (<year>2001</year>). <article-title>Pollen tubes of <italic>Nicotiana alata</italic> express two genes from different &#x03B2;-glucan synthase families.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>125</volume> <fpage>2040</fpage>&#x2013;<lpage>2052</lpage>. <pub-id pub-id-type="doi">10.1104/pp.125.4.2040</pub-id> <pub-id pub-id-type="pmid">11299383</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doblin</surname> <given-names>M. S.</given-names></name> <name><surname>Pettolino</surname> <given-names>F. A.</given-names></name> <name><surname>Wilson</surname> <given-names>S. M.</given-names></name> <name><surname>Campbell</surname> <given-names>R.</given-names></name> <name><surname>Burton</surname> <given-names>R. A.</given-names></name> <name><surname>Fincher</surname> <given-names>G. B.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>A barley cellulose synthase-like CSLH gene mediates (1, 3; 1, 4)-&#x03B2;-D-glucan synthesis in transgenic <italic>Arabidopsis</italic>.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>106</volume> <fpage>5996</fpage>&#x2013;<lpage>6001</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0902019106</pub-id> <pub-id pub-id-type="pmid">19321749</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eddy</surname> <given-names>S. R.</given-names></name></person-group> (<year>2009</year>). <article-title>A new generation of homology search tools based on probabilistic inference.</article-title> <source><italic>Genome Informatics</italic></source> <volume>23</volume> <fpage>205</fpage>&#x2013;<lpage>211</lpage>. <pub-id pub-id-type="pmid">20180275</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>El-Gebali</surname> <given-names>S.</given-names></name> <name><surname>Mistry</surname> <given-names>J.</given-names></name> <name><surname>Bateman</surname> <given-names>A.</given-names></name> <name><surname>Eddy</surname> <given-names>S. R.</given-names></name> <name><surname>Luciani</surname> <given-names>A.</given-names></name> <name><surname>Potter</surname> <given-names>S. C.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>The Pfam protein families database in 2019.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>D427</fpage>&#x2013;<lpage>D432</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky995</pub-id> <pub-id pub-id-type="pmid">30357350</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gould</surname> <given-names>B. A.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Lowry</surname> <given-names>D. B.</given-names></name></person-group> (<year>2018</year>). <article-title>Gene regulatory divergence between locally adapted ecotypes in their native habitats.</article-title> <source><italic>Mol. Ecol.</italic></source> <volume>27</volume> <fpage>4174</fpage>&#x2013;<lpage>4188</lpage>. <pub-id pub-id-type="doi">10.1111/mec.14852</pub-id> <pub-id pub-id-type="pmid">30168223</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haigler</surname> <given-names>C. H.</given-names></name> <name><surname>Roberts</surname> <given-names>A. W.</given-names></name></person-group> (<year>2019</year>). <article-title>Structure/function relationships in the rosette cellulose synthesis complex illuminated by an evolutionary perspective.</article-title> <source><italic>Cellulose</italic></source> <volume>26</volume> <fpage>227</fpage>&#x2013;<lpage>247</lpage>. <pub-id pub-id-type="doi">10.1007/s10570-018-2157-9</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>R.</given-names></name> <name><surname>Dong</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Fan</surname> <given-names>C.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>AtCSLD3 and GhCSLD3 mediate root growth and cell elongation downstream of the ethylene response pathway in <italic>Arabidopsis</italic>.</article-title> <source><italic>J. Exp. Bot.</italic></source> <volume>69</volume> <fpage>1065</fpage>&#x2013;<lpage>1080</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/erx470</pub-id> <pub-id pub-id-type="pmid">29253184</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hunter</surname> <given-names>C. T.</given-names></name> <name><surname>Kirienko</surname> <given-names>D. H.</given-names></name> <name><surname>Sylvester</surname> <given-names>A. W.</given-names></name> <name><surname>Peter</surname> <given-names>G. F.</given-names></name> <name><surname>Mccarty</surname> <given-names>D. R.</given-names></name> <name><surname>Koch</surname> <given-names>K. E.</given-names></name></person-group> (<year>2012</year>). <article-title>Cellulose synthase-Like D1 is integral to normal cell division, expansion, and leaf development in maize.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>158</volume> <fpage>708</fpage>&#x2013;<lpage>724</lpage>. <pub-id pub-id-type="doi">10.1104/pp.111.188466</pub-id> <pub-id pub-id-type="pmid">22123901</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jensen</surname> <given-names>J. D.</given-names></name> <name><surname>Bachtrog</surname> <given-names>D.</given-names></name></person-group> (<year>2010</year>). <article-title>Characterizing recurrent positive selection at fast-evolving genes in <italic>Drosophila miranda</italic> and <italic>Drosophila pseudoobscura</italic>.</article-title> <source><italic>Genome Biol. Evol.</italic></source> <volume>2</volume> <fpage>371</fpage>&#x2013;<lpage>378</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evq028</pub-id> <pub-id pub-id-type="pmid">20624741</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kalluri</surname> <given-names>U. C.</given-names></name> <name><surname>Joshi</surname> <given-names>C. P.</given-names></name></person-group> (<year>2004</year>). <article-title>Differential expression patterns of two cellulose synthase genes are associated with primary and secondary cell wall development in aspen trees.</article-title> <source><italic>Planta</italic></source> <volume>220</volume> <fpage>47</fpage>&#x2013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1007/s00425-004-1329-z</pub-id> <pub-id pub-id-type="pmid">15278458</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname> <given-names>K.</given-names></name> <name><surname>Standley</surname> <given-names>D. M.</given-names></name></person-group> (<year>2013</year>). <article-title>MAFFT multiple sequence alignment software version 7: improvements in performance and usability.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>30</volume> <fpage>772</fpage>&#x2013;<lpage>780</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst010</pub-id> <pub-id pub-id-type="pmid">23329690</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaur</surname> <given-names>S.</given-names></name> <name><surname>Dhugga</surname> <given-names>K. S.</given-names></name> <name><surname>Gill</surname> <given-names>K.</given-names></name> <name><surname>Singh</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>Novel structural and functional motifs in cellulose synthase (CesA) genes of bread wheat (<italic>Triticum aestivum</italic>, L.).</article-title> <source><italic>PLoS One</italic></source> <volume>11</volume>:<issue>e0147046</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0147046</pub-id> <pub-id pub-id-type="pmid">26771740</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kerstens</surname> <given-names>M. H.</given-names></name> <name><surname>Schranz</surname> <given-names>M. E.</given-names></name> <name><surname>Bouwmeester</surname> <given-names>K.</given-names></name></person-group> (<year>2020</year>). <article-title>Phylogenomic analysis of the APETALA2 transcription factor subfamily across angiosperms reveals both deep conservation and lineage-specific patterns.</article-title> <source><italic>Plant J.</italic></source> <volume>103</volume> <fpage>1516</fpage>&#x2013;<lpage>1524</lpage>. <pub-id pub-id-type="doi">10.1111/tpj.14843</pub-id> <pub-id pub-id-type="pmid">32436321</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>C. M.</given-names></name> <name><surname>Park</surname> <given-names>S. H.</given-names></name> <name><surname>Je</surname> <given-names>B. I.</given-names></name> <name><surname>Park</surname> <given-names>S. H.</given-names></name> <name><surname>Park</surname> <given-names>S. J.</given-names></name> <name><surname>Piao</surname> <given-names>H. L.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>OsCSLD1, a cellulose synthase-like D1 gene, is required for root hair morphogenesis in rice.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>143</volume> <fpage>1220</fpage>&#x2013;<lpage>1230</lpage>. <pub-id pub-id-type="doi">10.1104/pp.106.091546</pub-id> <pub-id pub-id-type="pmid">17259288</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>S.-J.</given-names></name> <name><surname>Chandrasekar</surname> <given-names>B.</given-names></name> <name><surname>Rea</surname> <given-names>A. C.</given-names></name> <name><surname>Danhof</surname> <given-names>L.</given-names></name> <name><surname>Zemelis-Durfee</surname> <given-names>S.</given-names></name> <name><surname>Thrower</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>The synthesis of xyloglucan, an abundant plant cell wall polysaccharide, requires CSLC function.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>117</volume> <fpage>20316</fpage>&#x2013;<lpage>20324</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.2007245117</pub-id> <pub-id pub-id-type="pmid">32737163</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lai</surname> <given-names>X.</given-names></name> <name><surname>Bendix</surname> <given-names>C.</given-names></name> <name><surname>Yan</surname> <given-names>L.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Schnable</surname> <given-names>J. C.</given-names></name> <name><surname>Harmon</surname> <given-names>F. G.</given-names></name></person-group> (<year>2020</year>). <article-title>Interspecific analysis of diurnal gene regulation in panicoid grasses identifies known and novel regulatory motifs.</article-title> <source><italic>BMC Genomics</italic></source> <volume>21</volume>:<issue>428</issue>. <pub-id pub-id-type="doi">10.1186/s12864-020-06824-3</pub-id> <pub-id pub-id-type="pmid">32586356</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>S.</given-names></name> <name><surname>Choi</surname> <given-names>S.</given-names></name> <name><surname>Jeon</surname> <given-names>D.</given-names></name> <name><surname>Kang</surname> <given-names>Y.</given-names></name> <name><surname>Kim</surname> <given-names>C.</given-names></name></person-group> (<year>2020</year>). <article-title>Evolutionary impact of whole genome duplication in <italic>Poaceae</italic> family.</article-title> <source><italic>J. Crop Sci. Biotechnol.</italic></source> <volume>23</volume> <fpage>413</fpage>&#x2013;<lpage>425</lpage>. <pub-id pub-id-type="doi">10.1007/s12892-020-00049-2</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letunic</surname> <given-names>I.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Interactive tree of life (iTOL) v4: recent updates and new developments.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>W256</fpage>&#x2013;<lpage>W259</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz239</pub-id> <pub-id pub-id-type="pmid">30931475</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>A.</given-names></name> <name><surname>Wang</surname> <given-names>R.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Fan</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Proteomic profiling of cellulase-aid-extracted membrane proteins for functional identification of cellulose synthase complexes and their potential associated-components in cotton fibers.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>6</volume> <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1038/srep26356</pub-id> <pub-id pub-id-type="pmid">27192945</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>A.</given-names></name> <name><surname>Xia</surname> <given-names>T.</given-names></name> <name><surname>Xu</surname> <given-names>W.</given-names></name> <name><surname>Chen</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Fan</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>An integrative analysis of four CESA isoforms specific for fiber cellulose production between <italic>Gossypium hirsutum</italic> and <italic>Gossypium barbadense</italic>.</article-title> <source><italic>Planta</italic></source> <volume>237</volume> <fpage>1585</fpage>&#x2013;<lpage>1597</lpage>. <pub-id pub-id-type="doi">10.1007/s00425-013-1868-2</pub-id> <pub-id pub-id-type="pmid">23508664</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Xiong</surname> <given-names>G.</given-names></name> <name><surname>Li</surname> <given-names>R.</given-names></name> <name><surname>Cui</surname> <given-names>J.</given-names></name> <name><surname>Tang</surname> <given-names>D.</given-names></name> <name><surname>Zhang</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Rice cellulose synthase-like D4 is essential for normal cell-wall biosynthesis and plant growth.</article-title> <source><italic>Plant J.</italic></source> <volume>60</volume> <fpage>1055</fpage>&#x2013;<lpage>1069</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2009.04022.x</pub-id> <pub-id pub-id-type="pmid">19765235</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>W.</given-names></name> <name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Yao</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Song</surname> <given-names>W.</given-names></name> <name><surname>Yang</surname> <given-names>X.</given-names></name></person-group> (<year>2018</year>). <article-title>Cellulose synthase-like D1 controls organ size in maize.</article-title> <source><italic>BMC Plant Biol.</italic></source> <volume>18</volume>:<issue>239</issue>. <pub-id pub-id-type="doi">10.1186/s12870-018-1453-8</pub-id> <pub-id pub-id-type="pmid">30326832</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liepman</surname> <given-names>A. H.</given-names></name> <name><surname>Cavalier</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>The cellulose synthase-like A and cellulose synthase-like C families: recent advances and future perspectives.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>3</volume>:<issue>109</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2012.00109</pub-id> <pub-id pub-id-type="pmid">22654891</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liepman</surname> <given-names>A. H.</given-names></name> <name><surname>Wilkerson</surname> <given-names>C. G.</given-names></name> <name><surname>Keegstra</surname> <given-names>K.</given-names></name></person-group> (<year>2005</year>). <article-title>Expression of cellulose synthase-like (Csl) genes in insect cells reveals that CslA family members encode mannan synthases.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>102</volume> <fpage>2221</fpage>&#x2013;<lpage>2226</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0409179102</pub-id> <pub-id pub-id-type="pmid">15647349</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Little</surname> <given-names>A.</given-names></name> <name><surname>Lahnstein</surname> <given-names>J.</given-names></name> <name><surname>Jeffery</surname> <given-names>D. W.</given-names></name> <name><surname>Khor</surname> <given-names>S. F.</given-names></name> <name><surname>Schwerdt</surname> <given-names>J. G.</given-names></name> <name><surname>Shirley</surname> <given-names>N. J.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>A novel (1, 4)-&#x03B2;-linked glucoxylan is synthesized by members of the Cellulose synthase-like F gene family in land plants.</article-title> <source><italic>ACS Cent. Sci.</italic></source> <volume>5</volume> <fpage>73</fpage>&#x2013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1021/acscentsci.8b00568</pub-id> <pub-id pub-id-type="pmid">30693327</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Little</surname> <given-names>A.</given-names></name> <name><surname>Schwerdt</surname> <given-names>J. G.</given-names></name> <name><surname>Shirley</surname> <given-names>N. J.</given-names></name> <name><surname>Khor</surname> <given-names>S. F.</given-names></name> <name><surname>Neumann</surname> <given-names>K.</given-names></name> <name><surname>O&#x2019;Donovan</surname> <given-names>L. A.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Revised phylogeny of the cellulose synthase gene superfamily: insights into cell wall evolution.</article-title> <source><italic>Plant physiology</italic></source> <volume>177</volume> <fpage>1124</fpage>&#x2013;<lpage>1141</lpage>. <pub-id pub-id-type="doi">10.1104/pp.17.01718</pub-id> <pub-id pub-id-type="pmid">29780036</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>P.</given-names></name> <name><surname>Song</surname> <given-names>F.</given-names></name> <name><surname>Guan</surname> <given-names>M.</given-names></name> <name><surname>Jin</surname> <given-names>L.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Four novel cellulose synthase (CESA) genes from birch (<italic>Betula platyphylla</italic> Suk.) involved in primary and secondary cell wall biosynthesis.</article-title> <source><italic>Int. J. Mol. Sci.</italic></source> <volume>13</volume> <fpage>12195</fpage>&#x2013;<lpage>12212</lpage>. <pub-id pub-id-type="doi">10.3390/ijms131012195</pub-id> <pub-id pub-id-type="pmid">23202892</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luan</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>F.</given-names></name> <name><surname>Song</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Peng</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>OsCD1 encodes a putative member of the cellulose synthase-like D sub-family and is essential for rice plant architecture and growth.</article-title> <source><italic>Plant Biotechnol. J.</italic></source> <volume>9</volume> <fpage>513</fpage>&#x2013;<lpage>524</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7652.2010.00570.x</pub-id> <pub-id pub-id-type="pmid">20955181</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pear</surname> <given-names>J. R.</given-names></name> <name><surname>Kawagoe</surname> <given-names>Y.</given-names></name> <name><surname>Schreckengost</surname> <given-names>W. E.</given-names></name> <name><surname>Delmer</surname> <given-names>D. P.</given-names></name> <name><surname>Stalker</surname> <given-names>D. M.</given-names></name></person-group> (<year>1996</year>). <article-title>Higher plants contain homologs of the bacterial celA genes encoding the catalytic subunit of cellulose synthase.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>93</volume> <fpage>12637</fpage>&#x2013;<lpage>12642</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.93.22.12637</pub-id> <pub-id pub-id-type="pmid">8901635</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pettolino</surname> <given-names>F. A.</given-names></name> <name><surname>Walsh</surname> <given-names>C.</given-names></name> <name><surname>Fincher</surname> <given-names>G. B.</given-names></name> <name><surname>Bacic</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Determining the polysaccharide composition of plant cell walls.</article-title> <source><italic>Nat. Protocols</italic></source> <volume>7</volume> <fpage>1590</fpage>&#x2013;<lpage>1607</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2012.081</pub-id> <pub-id pub-id-type="pmid">22864200</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ranik</surname> <given-names>M.</given-names></name> <name><surname>Myburg</surname> <given-names>A. A.</given-names></name></person-group> (<year>2006</year>). <article-title>Six new cellulose synthase genes from Eucalyptus are associated with primary and secondary cell wall biosynthesis.</article-title> <source><italic>Tree Physiol.</italic></source> <volume>26</volume> <fpage>545</fpage>&#x2013;<lpage>556</lpage>. <pub-id pub-id-type="doi">10.1093/treephys/26.5.545</pub-id> <pub-id pub-id-type="pmid">16452068</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rao</surname> <given-names>X.</given-names></name> <name><surname>Dixon</surname> <given-names>R. A.</given-names></name></person-group> (<year>2018</year>). <article-title>Current models for transcriptional regulation of secondary cell wall biosynthesis in grasses.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>9</volume>:<issue>399</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2018.00399</pub-id> <pub-id pub-id-type="pmid">29670638</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richmond</surname> <given-names>T. A.</given-names></name> <name><surname>Somerville</surname> <given-names>C. R.</given-names></name></person-group> (<year>2000</year>). <article-title>The cellulose synthase superfamily.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>124</volume> <fpage>495</fpage>&#x2013;<lpage>498</lpage>. <pub-id pub-id-type="doi">10.1104/pp.124.2.495</pub-id> <pub-id pub-id-type="pmid">11027699</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sarkar</surname> <given-names>P.</given-names></name> <name><surname>Bosneaga</surname> <given-names>E.</given-names></name> <name><surname>Auer</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Plant cell walls throughout evolution: towards a molecular understanding of their design principles.</article-title> <source><italic>J. Exp. Bot.</italic></source> <volume>60</volume> <fpage>3615</fpage>&#x2013;<lpage>3635</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/erp245</pub-id> <pub-id pub-id-type="pmid">19687127</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwerdt</surname> <given-names>J. G.</given-names></name> <name><surname>Mackenzie</surname> <given-names>K.</given-names></name> <name><surname>Wright</surname> <given-names>F.</given-names></name> <name><surname>Oehme</surname> <given-names>D.</given-names></name> <name><surname>Wagner</surname> <given-names>J. M.</given-names></name> <name><surname>Harvey</surname> <given-names>A. J.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Evolutionary dynamics of the cellulose synthase gene superfamily in grasses.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>168</volume> <fpage>968</fpage>&#x2013;<lpage>983</lpage>. <pub-id pub-id-type="doi">10.1104/pp.15.00140</pub-id> <pub-id pub-id-type="pmid">25999407</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seppey</surname> <given-names>M.</given-names></name> <name><surname>Manni</surname> <given-names>M.</given-names></name> <name><surname>Zdobnov</surname> <given-names>E. M.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>BUSCO: assessing genome assembly and annotation completeness</article-title>,&#x201D; in <source><italic>Gene Prediction: Methods and Protocols</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Kollmar</surname> <given-names>M.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer New York</publisher-name>), <fpage>227</fpage>&#x2013;<lpage>245</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-9173-0_14</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>S&#x00F8;rensen</surname> <given-names>I.</given-names></name> <name><surname>Domozych</surname> <given-names>D.</given-names></name> <name><surname>Willats</surname> <given-names>W. G.</given-names></name></person-group> (<year>2010</year>). <article-title>How have plant cell walls evolved?</article-title> <source><italic>Plant Physiol.</italic></source> <volume>153</volume> <fpage>366</fpage>&#x2013;<lpage>372</lpage>. <pub-id pub-id-type="doi">10.1104/pp.110.154427</pub-id> <pub-id pub-id-type="pmid">20431088</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Speck</surname> <given-names>T.</given-names></name> <name><surname>Burgert</surname> <given-names>I.</given-names></name></person-group> (<year>2011</year>). <article-title>Plant stems: functional design and mechanics.</article-title> <source><italic>Annu. Rev. Materials Res.</italic></source> <volume>41</volume> <fpage>169</fpage>&#x2013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-matsci-062910-100425</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>1312</fpage>&#x2013;<lpage>1313</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu033</pub-id> <pub-id pub-id-type="pmid">24451623</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taylor-Teeples</surname> <given-names>M.</given-names></name> <name><surname>Lin</surname> <given-names>L.</given-names></name> <name><surname>De Lucas</surname> <given-names>M.</given-names></name> <name><surname>Turco</surname> <given-names>G.</given-names></name> <name><surname>Toal</surname> <given-names>T.</given-names></name> <name><surname>Gaudinier</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>An Arabidopsis gene regulatory network for secondary cell wall synthesis.</article-title> <source><italic>Nature</italic></source> <volume>517</volume> <fpage>571</fpage>&#x2013;<lpage>575</lpage>. <pub-id pub-id-type="doi">10.1038/nature14099</pub-id> <pub-id pub-id-type="pmid">25533953</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Turner</surname> <given-names>S. R.</given-names></name> <name><surname>Somerville</surname> <given-names>C. R.</given-names></name></person-group> (<year>1997</year>). <article-title>Collapsed xylem phenotype of Arabidopsis identifies mutants deficient in cellulose deposition in the secondary cell wall.</article-title> <source><italic>Plant Cell</italic></source> <volume>9</volume> <fpage>689</fpage>&#x2013;<lpage>701</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.9.5.689</pub-id> <pub-id pub-id-type="pmid">9165747</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van de Peer</surname> <given-names>Y.</given-names></name> <name><surname>Mizrachi</surname> <given-names>E.</given-names></name> <name><surname>Marchal</surname> <given-names>K.</given-names></name></person-group> (<year>2017</year>). <article-title>The evolutionary significance of polyploidy.</article-title> <source><italic>Nat. Rev. Genet.</italic></source> <volume>18</volume> <fpage>411</fpage>&#x2013;<lpage>424</lpage>. <pub-id pub-id-type="doi">10.1038/nrg.2017.26</pub-id> <pub-id pub-id-type="pmid">28502977</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Verhertbruggen</surname> <given-names>Y.</given-names></name> <name><surname>Yin</surname> <given-names>L.</given-names></name> <name><surname>Oikawa</surname> <given-names>A.</given-names></name> <name><surname>Scheller</surname> <given-names>H. V.</given-names></name></person-group> (<year>2011</year>). <article-title>Mannan synthase activity in the CSLD family.</article-title> <source><italic>Plant Signal. Behav.</italic></source> <volume>6</volume> <fpage>1620</fpage>&#x2013;<lpage>1623</lpage>. <pub-id pub-id-type="doi">10.4161/psb.6.10.17989</pub-id> <pub-id pub-id-type="pmid">21904114</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vogel</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>Unique aspects of the grass cell wall.</article-title> <source><italic>Curr. Opin. Plant Biol.</italic></source> <volume>11</volume> <fpage>301</fpage>&#x2013;<lpage>307</lpage>. <pub-id pub-id-type="doi">10.1016/j.pbi.2008.03.002</pub-id> <pub-id pub-id-type="pmid">18434239</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Guo</surname> <given-names>K.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Tu</surname> <given-names>Y.</given-names></name> <name><surname>Hu</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Expression profiling and integrative analysis of the CESA/CSL superfamily in rice.</article-title> <source><italic>BMC Plant Biol.</italic></source> <volume>10</volume>:<issue>282</issue>. <pub-id pub-id-type="doi">10.1186/1471-2229-10-282</pub-id> <pub-id pub-id-type="pmid">21167079</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Cnops</surname> <given-names>G.</given-names></name> <name><surname>Vanderhaeghen</surname> <given-names>R.</given-names></name> <name><surname>De Block</surname> <given-names>S.</given-names></name> <name><surname>Van Montagu</surname> <given-names>M.</given-names></name> <name><surname>Van Lijsebettens</surname> <given-names>M.</given-names></name></person-group> (<year>2001</year>). <article-title>AtCSLD3, a cellulose synthase-like gene important for root hair growth in Arabidopsis.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>126</volume> <fpage>575</fpage>&#x2013;<lpage>586</lpage>. <pub-id pub-id-type="doi">10.1104/pp.126.2.575</pub-id> <pub-id pub-id-type="pmid">11402188</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>H.</given-names></name> <name><surname>Debarry</surname> <given-names>J. D.</given-names></name> <name><surname>Tan</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume> <issue>e49</issue>. <pub-id pub-id-type="doi">10.1093/nar/gkr1293</pub-id> <pub-id pub-id-type="pmid">22217600</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Algesheimer</surname> <given-names>R.</given-names></name> <name><surname>Tessone</surname> <given-names>C. J.</given-names></name></person-group> (<year>2016</year>). <article-title>A comparative analysis of community detection algorithms on artificial networks.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>6</volume>:<issue>30750</issue>.</citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yin</surname> <given-names>Y.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Xu</surname> <given-names>Y.</given-names></name></person-group> (<year>2009</year>). <article-title>The cellulose synthase superfamily in fully sequenced plants and algae.</article-title> <source><italic>BMC Plant Biol.</italic></source> <volume>9</volume>:<issue>99</issue>. <pub-id pub-id-type="doi">10.1186/1471-2229-9-99</pub-id> <pub-id pub-id-type="pmid">19646250</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Xie</surname> <given-names>M.</given-names></name> <name><surname>Tuskan</surname> <given-names>G. A.</given-names></name> <name><surname>Muchero</surname> <given-names>W.</given-names></name> <name><surname>Chen</surname> <given-names>J.-G.</given-names></name></person-group> (<year>2018</year>). <article-title>Recent advances in the transcriptional regulation of secondary cell wall biosynthesis in the woody plants.</article-title> <source><italic>Front. Plant Sci.</italic></source> <volume>9</volume>:<issue>1535</issue>. <pub-id pub-id-type="doi">10.3389/fpls.2018.01535</pub-id> <pub-id pub-id-type="pmid">30405670</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>T.</given-names></name> <name><surname>Schranz</surname> <given-names>M. E.</given-names></name></person-group> (<year>2017</year>). <article-title>Network approaches for plant phylogenomic synteny analysis.</article-title> <source><italic>Curr. Opin. Plant Biol.</italic></source> <volume>36</volume> <fpage>129</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.1016/j.pbi.2017.03.001</pub-id> <pub-id pub-id-type="pmid">28327435</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>T.</given-names></name> <name><surname>Schranz</surname> <given-names>M. E.</given-names></name></person-group> (<year>2019</year>). <article-title>Network-based microsynteny analysis identifies major differences and genomic outliers in mammalian and angiosperm genomes.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>116</volume> <fpage>2165</fpage>&#x2013;<lpage>2174</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1801757116</pub-id> <pub-id pub-id-type="pmid">30674676</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>T.</given-names></name> <name><surname>Holmer</surname> <given-names>R.</given-names></name> <name><surname>De Bruijn</surname> <given-names>S.</given-names></name> <name><surname>Angenent</surname> <given-names>G. C.</given-names></name> <name><surname>Van Den Burg</surname> <given-names>H. A.</given-names></name> <name><surname>Schranz</surname> <given-names>M. E.</given-names></name></person-group> (<year>2017</year>). <article-title>Phylogenomic synteny network analysis of MADS-box transcription factor genes reveals lineage-specific transpositions, ancient tandem duplications, and deep positional conservation.</article-title> <source><italic>Plant Cell</italic></source> <volume>29</volume> <fpage>1278</fpage>&#x2013;<lpage>1292</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.17.00312</pub-id> <pub-id pub-id-type="pmid">28584165</pub-id></citation></ref>
</ref-list>
</back>
</article>
