<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2021.758290</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Complete Chloroplast Genome Sequence of <italic>Erigeron breviscapus</italic> and Characterization of Chloroplast Regulatory Elements</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Yu</surname>
<given-names>Yifan</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1432125/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ouyang</surname>
<given-names>Zhen</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/509634/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Guo</surname>
<given-names>Juan</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/490196/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zeng</surname>
<given-names>Wen</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Zhao</surname>
<given-names>Yujun</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Huang</surname>
<given-names>Luqi</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<xref rid="c002" ref-type="corresp"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/362483/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Food and Biological Engineering, Jiangsu University</institution>, <addr-line>Zhenjiang</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>State Key Laboratory Breeding Base of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<author-notes>
<fn id="fn1" fn-type="edited-by"><p>Edited by: Thomas D. Sharkey, Michigan State University, United States</p></fn>
<fn id="fn2" fn-type="edited-by"><p>Reviewed by: Jitesh Kumar, University of Minnesota Twin Cities, United States; Yongbo Duan, Huaibei Normal University, China</p></fn>
<corresp id="c001">&#x002A;Correspondence: Yujun Zhao, <email>zhaoyj@nrc.ac.cn</email></corresp>
<corresp id="c002">Luqi Huang, <email>huangluqi01@126.com</email></corresp>
<fn id="fn3" fn-type="other"><p>This article was submitted to Plant Biotechnology, a section of the journal Frontiers in Plant Science</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>25</day>
<month>11</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>758290</elocation-id>
<history>
<date date-type="received">
<day>13</day>
<month>08</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>20</day>
<month>10</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Yu, Ouyang, Guo, Zeng, Zhao and Huang.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Yu, Ouyang, Guo, Zeng, Zhao and Huang</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p><italic>Erigeron breviscapus</italic> is a famous medicinal plant. However, the limited chloroplast genome information of <italic>E. breviscapus</italic>, especially for the chloroplast DNA sequence resources, has hindered the study of <italic>E. breviscapus</italic> chloroplast genome transformation. Here, the complete chloroplast (cp) genome of <italic>E. breviscapus</italic> was reported. This genome was 152,164bp in length, included 37.2% GC content and was structurally arranged into two 24,699bp inverted repeats (IRs) and two single-copy areas. The sizes of the large single-copy region and the small single-copy region were 84,657 and 18,109bp, respectively. The <italic>E. breviscapus</italic> cp genome consisted of 127 coding genes, including 83 protein coding genes, 36 transfer RNA (tRNA) genes, and eight ribosomal RNA (rRNA) genes. For those genes, 95 genes were single copy genes and 16 genes were duplicated in two inverted regions with seven tRNAs, four rRNAs, and five protein coding genes. Then, genomic DNA of <italic>E. breviscapus</italic> was used as a template, and the endogenous 5' and 3' flanking sequences of the <italic>trnI</italic> gene and <italic>trnA</italic> gene were selected as homologous recombinant fragments in vector construction and cloned through PCR. The endogenous 5' flanking sequences of the <italic>psbA</italic> gene and <italic>rrn16S</italic> gene, the endogenous 3' flanking sequences of the <italic>psbA</italic> gene, <italic>rbcL</italic> gene, and <italic>rps16</italic> gene and one sequence element from the psbN-psbH chloroplast operon were cloned, and certain chloroplast regulatory elements were identified. Two homologous recombination fragments and all of these elements were constructed into the cloning vector pBluescript SK (+) to yield a series of chloroplast expression vectors, which harbored the reporter gene <italic>EGFP</italic> and the selectable marker <italic>aadA</italic> gene. After identification, the chloroplast expression vectors were transformed into <italic>Escherichia coli</italic> and the function of predicted regulatory elements was confirmed by a spectinomycin resistance test and fluorescence intensity measurement. The results indicated that <italic>aadA</italic> gene and <italic>EGFP</italic> gene were efficiently expressed under the regulation of predicted regulatory elements and the chloroplast expression vector had been successfully constructed, thereby providing a solid foundation for establishing subsequent <italic>E. breviscapus</italic> chloroplast transformation system and genetic improvement of <italic>E. breviscapus</italic>.</p>
</abstract>
<kwd-group>
<kwd><italic>Erigeron breviscapus</italic></kwd>
<kwd>chloroplast genome sequencing</kwd>
<kwd>chloroplast regulatory elements</kwd>
<kwd>gene assembly</kwd>
<kwd>prokaryotic expression</kwd>
</kwd-group>
<contract-num rid="cn1">2020YFA0908000</contract-num>
<contract-num rid="cn2">81891013</contract-num>
<contract-num rid="cn3">ZK2021003</contract-num>
<contract-sponsor id="cn1">National Key R&#x0026;D Program of China</contract-sponsor>
<contract-sponsor id="cn2">National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100001809</named-content>
</contract-sponsor>
<contract-sponsor id="cn3">Fundamental Research Funds</contract-sponsor>
<counts>
<fig-count count="6"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="39"/>
<page-count count="12"/>
<word-count count="7830"/>
</counts>
</article-meta>
</front>
<body>
<sec id="sec1" sec-type="intro">
<title>Introduction</title>
<p><italic>Erigeron breviscapus</italic> (Vant.) Hand-Mazz. is a well-known medicinal plant that belongs to the family Asteraceae, and it is distributed in the southwestern region of China, mainly in Yunnan, Sichuan, Guizhou, and Guangxi Provinces (<xref ref-type="bibr" rid="ref20">Li et al., 2013</xref>). This herb is commonly used for treating occlusive cerebrovascular disease and sequelae of cerebral hemorrhage and for improving microcirculation, reducing blood viscosity, and preventing platelet aggregation (<xref ref-type="bibr" rid="ref18">Li et al., 1998</xref>; <xref ref-type="bibr" rid="ref31">Sheng et al., 1999</xref>; <xref ref-type="bibr" rid="ref22">Liu et al., 2010</xref>). In addition, <italic>E. breviscapus</italic> has the characteristics of a short growth period, strong vitality, an efficient tissue culture system (<xref ref-type="bibr" rid="ref3">Chen et al., 2018</xref>), and available genomic information (<xref ref-type="bibr" rid="ref15">Jiang et al., 2014</xref>; <xref ref-type="bibr" rid="ref38">Yang et al., 2017</xref>).</p>
<p>However, with the in-depth study of <italic>E. breviscapus</italic> chloroplasts, the current research on the genetic background of this species mainly focuses on the use of the chloroplast genome of <italic>E. breviscapus</italic> for phylogenetic research and germplasm resource protection (<xref ref-type="bibr" rid="ref19">Li et al., 2019</xref>; <xref ref-type="bibr" rid="ref24">Meng et al., 2019</xref>), <xref ref-type="bibr" rid="ref24">Meng et al. (2019)</xref> used the Illumina sequencing platform to obtain a chloroplast genome with a size of 152,183bp, but the article only briefly mentioned the basic structural information of the species&#x2019;s chloroplast genome and the type and number of genes, and the phylogenetic position of <italic>E. breviscapus</italic> was determined. <xref ref-type="bibr" rid="ref19">Li et al. (2019)</xref> used the Illumina sequencing platform to obtain a chloroplast genome with a size of 152,367bp, and only briefly introduced the basic structural information of the species&#x2019;s chloroplast genome and the type and number of genes in the article, and the phylogenetic analysis was conducted to infer phylogenetic position of <italic>E. breviscapus</italic> and <italic>Erigeron multiradiatus</italic> within the family of Asteraceae, the above two reports did not mention the test data and its quality control. Both of the above mentioned studies lacked systematic prediction of the gene structure of the chloroplast genome of <italic>E. breviscapus</italic> and bioinformatics analysis such as codon usage bias analysis and simple sequence repeat (SSR) analysis. In addition, the relevant regulatory elements of the chloroplasts of this species have not been identified. Therefore, for follow-up research on the establishment of the chloroplast genetic transformation system of <italic>E. breviscapus</italic>, it is necessary to obtain a complete <italic>E. breviscapus</italic> cp genome and screen out highly efficient genetic regulatory elements.</p>
<p>Previous studies have reported some chloroplast regulatory elements and proved that these elements play an important role in the level of gene expression. Native genes <italic>trnA/trnI</italic> are located at transcriptionally active spacer regions. After an herbicide resistance gene was integrated into the transcriptionally active spacer region for the first time (<xref ref-type="bibr" rid="ref5">Daniell et al., 1998</xref>), most subsequent studies preferentially used this site of integration. Compared with the transcriptional silencing spacer (rbcl/accD), the integration of the transgene into the transcriptionally active spacer (trnl/trnA) resulted in 25-fold higher expression of transgenes (<xref ref-type="bibr" rid="ref17">Krichevsky et al., 2010</xref>) and the <italic>trnA</italic> gene intron included the chloroplast origin of replication and generated more copies of the template (chloroplast vector) to integrate the transgene cassette (<xref ref-type="bibr" rid="ref6">Daniell et al., 1990</xref>). In addition to the integration site, regulatory sequences located upstream (promoter, 5&#x2019;UTR) and downstream (3&#x2019;UTR) of the transgene play a major role in determining its expression level. The most commonly used promoters are photosystem II reaction center promoter PpsbA and <italic>rrn16S</italic> gene promoter Prrn; moreover, the psbA regulatory region, which was first used almost 30years ago (<xref ref-type="bibr" rid="ref6">Daniell et al., 1990</xref>), still appears to be the best option for use in an expression cassette because the <italic>psbA</italic> gene mediates light-induced activation of translation (<xref ref-type="bibr" rid="ref29">Ruhlman et al., 2010</xref>) and it also encodes the most highly translated protein in the chloroplast (<xref ref-type="bibr" rid="ref16">Klein and Mullet, 1987</xref>). In a pioneering work, <xref ref-type="bibr" rid="ref9">Fei et al. (2007)</xref> studied the <italic>psbB</italic> operon from tobacco chloroplasts and identified a small cistronic element called IEE, which was included in the synthetic structure to drive the expression of <italic>nptII</italic> and <italic>yfp</italic> and proved to be sufficient for mediating polycistronic expression. The polycistronic transcript has been processed into stable and translatable monocistronic mRNA, and this function has been used to express transgenic polycistronic mRNA from a single promoter in a single transformation event (<xref ref-type="bibr" rid="ref23">Lu et al., 2013</xref>; <xref ref-type="bibr" rid="ref1">Bock, 2014</xref>; <xref ref-type="bibr" rid="ref10">Fuentes et al., 2016</xref>). Therefore, for the in-depth study of chloroplast transformation in medicinal plants, it is necessary to further develop and explore the endogenous regulatory elements and establish a more efficient chloroplast transformation system. In this work, we combined second-generation and third-generation sequencing technologies to obtain more completely and accurate <italic>E. breviscapus</italic> cp genome sequence information and performed detailed bioinformatics analysis, including gene annotation, structural statistics, SSR analysis, and codon usage bias analysis. Then, a series of chloroplast regulatory elements were cloned from <italic>E. breviscapus</italic>, and the corresponding chloroplast expression vectors of <italic>E. breviscapus</italic> were constructed and the biological functions of these chloroplast regulatory elements were verified through <italic>in vitro</italic> experiments, thus providing a foundation for the establishment of the subsequent chloroplast genetic transformation system and research on the biology and evolution of <italic>E. breviscapus</italic>.</p>
</sec>
<sec id="sec2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="sec3">
<title>Materials and Reagents</title>
<p>Soaked the seeds in deionized water for 3&#x2013;4h, then placed the seeds at room temperature to fully dry them for 3&#x2013;4h, soaked the seeds in 1% NaClO solution for 10min in the Vertical Flow Clean Bench, and then washed them with sterile water three times. About 20&#x2013;30 sterilized seeds were inoculated per petri dish in MS solid medium (<xref ref-type="bibr" rid="ref39">Murashige and Skoog, 1962</xref>) for germination and then maintained in a culture room under white fluorescent lamps (1,900 lux), with a 16h light/8h dark cycle at 25&#x00B0;C for 4&#x2013;7days. Individual germinated seedlings were transferred to a sterile tissue culture bottle containing MS medium supplemented with 30g/L sucrose and kept in a culture room under white fluorescent lamps (1,900 lux), with a 16h light/8h dark cycle at 26&#x00B0;C for 4&#x2013;7weeks.</p>
</sec>
<sec id="sec4">
<title>Total DNA Extraction and Sequencing</title>
<p>Total genomic DNA was used to build sequence libraries (Illumina Inc., San Diego, CA, United States), and it was extracted from leaves using the cetyltrimethylammonium bromide (CTAB) method. The experimental procedures were performed in accordance with the standard protocol provided by Illumina, and sequence libraries were prepared with the NEB Next Ultra DNA Library Prep Kit. An Illumina NovaSeq sequencer was used to sequence paired-end (PE) sequencing libraries with an average 350bp insert length at the Wuhan Benagen Tech Solutions Company Limited. From these data, over 20 million clean reads with a 150bp read length were passed through quality control. Then, we performed nanopore sequencing to facilitate the connection of high-quality contigs into longer complete sequences in subsequent genome assembly.</p>
</sec>
<sec id="sec5">
<title>Chloroplast Genome Assembly</title>
<p>Since the original sequencing data may contain low-quality sequences, adaptor sequences, etc. (<xref ref-type="bibr" rid="ref8">Ewing and Green, 1998</xref>), to ensure the reliability of the information analysis results, the original sequencing data need to be filtered to obtain clean reads, which are stored in FASTQ format (<xref ref-type="bibr" rid="ref4">Cock et al., 2009</xref>). This project used the filtering software SOAPnuke (version: 1.3.0) to remove reads with an N base content of more than 5%, low-quality (quality value less than or equal to 5) reads with a base number of 50%, and reads with adapter contamination.</p>
<p>UniCycler (0.4.8) software was used to assemble the filtered reads (<xref ref-type="bibr" rid="ref36">Wick et al., 2017</xref>). First, high-accuracy Illumina data (Q30&#x003E;85%) were assembled to obtain a high-quality cp genome skeleton (contig), and then nanopore data were used to connect the high-quality contig into a longer complete sequence. The accuracy of the cp genome assembled by this strategy is determined by the accuracy of the Illumina data, and it can effectively avoid the sequence contamination introduced by the error of nanopore data splitting. Finally, Pilon software was used to further correct the Illumina data for errors in the assembled genome, resulting in a genome with higher final accuracy.</p>
<p>Unicycler software was aligned with the close reference genome (NC_043882.1) by blastn (version: BLAST 2.2.30+; parameter: -evalue 1e-5), and the best candidate sequence was selected based on the alignment. The chloroplast genome connection relationship was determined based on the sequence sequencing depth and read alignment and by comparison with closely related species. The connection relationship and whether it forms a loop need to be further experimentally verified.</p>
</sec>
<sec id="sec6">
<title>Gene Structure Prediction</title>
<p>Chloroplast genome function annotation includes coding gene prediction and noncoding RNA annotation [ribosomal RNA (rRNA) and transfer RNA (tRNA) annotation]. CPGAVAS2<xref rid="fn0001" ref-type="fn"><sup>1</sup></xref> was used for gene annotation and drawn based on the results of the cp genome annotation (<xref ref-type="bibr" rid="ref32">Shi et al., 2019</xref>).</p>
</sec>
<sec id="sec7">
<title>Codon Usage Bias Analysis</title>
<p>CodonW (Version: 1.4.4) was used to analyze codon usage bias and statistically estimate the frequency of relative synonymous codon usage (RSCU). Codons with RSCU values &#x003E;1 were defined as high-frequency codons. We conducted a codon usage bias analysis using the following gene sequences: &#x201C;ATG&#x201D;, &#x201C;TTG&#x201D;, &#x201C;CTG&#x201D;, &#x201C;ATT&#x201D;, &#x201C;ATC&#x201D;, &#x201C;GTG&#x201D;, and &#x201C;ATA&#x201D;, which were used as start codons; and &#x201C;TGA&#x201D;, &#x201C;TAG&#x201D;, and &#x201C;TAA&#x201D;, which were used as stop codons; in addition, protein-coding genes shorter than 300bp in length were excluded from the base composition analysis to avoid sampling bias (<xref ref-type="bibr" rid="ref37">Wright, 1990</xref>).</p>
</sec>
<sec id="sec8">
<title>Simple Sequence Repeat and Long Repetitive Sequences Analysis</title>
<p>MISA (<xref ref-type="bibr" rid="ref30">Sebastian et al., 2017</xref>) was used to perform a SSR analysis of the <italic>E. breviscapus</italic> chloroplast [version: 1.0; default parameters: the corresponding minimum number of repetitions of each repeating unit (unit size) are: 1&#x2013;8, 2&#x2013;4, 3&#x2013;4, 4&#x2013;3, 5&#x2013;3, and 6&#x2013;3, with 1&#x2013;8 meaning that when a single nucleotide is used as the repeat unit, the number of repeats must be at least 8 to be detected; see <ext-link xlink:href="http://pgrc.ipk-gatersleben.de/misa/misa.html" ext-link-type="uri">http://pgrc.ipk-gatersleben.de/misa/misa.html</ext-link> for details].</p>
<p>The software vmatch<xref rid="fn0002" ref-type="fn"><sup>2</sup></xref> was used to find scattered long repetitive sequences in the <italic>E. breviscapus</italic> cp genome (<xref ref-type="bibr" rid="ref14">Jean-Simon et al., 2010</xref>). The long repeat sequences included three types: forward (F), palindrome (P), and tandem (T) repeats.</p>
</sec>
<sec id="sec9">
<title>Chloroplast Genome Extraction and PCR Cloning</title>
<p>There is no report on the extraction method of chloroplast of <italic>E. breviscapus</italic> before, therefore, we have developed a method suitable for extracting chloroplast DNA of <italic>E. breviscapus</italic>, which is based on the modified high &#x2013; salt low &#x2013; pH method (<xref ref-type="bibr" rid="ref33">Shi et al., 2012</xref>). Borax was not used in this method, which greatly guaranteed the safety of the operators. In addition, we removed the step of &#x201C;add 1.5ml of 5mol/L KAc (PH 5.2) and continue freezing for 30min. Then centrifuge at 10,000<italic>g</italic> for 15min, and discard the precipitate&#x201D;, this saved extraction time and effectively simplified the extraction step. After extracting high-quality chloroplast genomic DNA of <italic>E. breviscapus</italic>, regular PCR was performed as follows: a 30s initial denaturation step at 98&#x00B0;C, followed by 35cycles of denaturation for 10s at 98&#x00B0;C, annealing for 30s at 55&#x2013;63&#x00B0;C, extension for 30s per kbp sequence length at 72&#x00B0;C, and a final 7min extension step at 72&#x00B0;C. Phusion High-Fidelity DNA Polymerase (New England Biolabs, China) and 2&#x00D7;EasyTaq PCR SuperMix (+dye; TransGen Biotech, China) were utilized for DNA cloning and PCR analysis, respectively. The sequence information of the primers for amplification was provided in <xref ref-type="supplementary-material" rid="SM1">Supplementary Table S1</xref>. The pEASY-Blunt Zero Cloning Kit (TransGen Biotech, China) was utilized for seamless cloning of the selected amplified fragments.</p>
</sec>
<sec id="sec10">
<title>Construction of Transforming Vectors</title>
<p>The synthetic operon constructs for chloroplast transformation (pBtEa1t, pBtEa2t, pBtEat3, and pBtEIat) were based on the empty vector pBluescript SK (+). They were all constructed using the pEASY-Uni Seamless Cloning and Assembly Kit (TransGene Biotech, China), which can be used to recombine the vector linearized by any method and the PCR fragment with 15&#x2013;25bp overlapping region with both ends of the linearized vector. All the fragments needed for chloroplast expression vector construction were then assembled into synthetic operons as follows:</p>
<p>For the first round (Round I) of gene assembly, The empty vector pBluescript SK (+) was digested with Not I and Hind III, and then Eb-PpsbA, <italic>aadA</italic> gene and Eb-TpsbA were cloned into linearized vector and generating construct pBa1, Eb-Prrn, <italic>aadA</italic> gene, and Eb-TrbcL were cloned into linearized vector and generating construct pBa2, Eb-Prrn, <italic>aadA</italic> gene, and Eb-Trps16 were cloned into linearized vector and generating construct pBa3, all the recombinant vectors were verified by Sanger sequencing for the next round of gene assembly.</p>
<p>For the second round (Round II) of gene assembly, the vectors pBa1, pBa2, pBa3, and pBluescript SK (+) were digested with Sal I, and then Eb-PpsbA, <italic>EGFP</italic> gene, and Eb-TpsbA were cloned into linearized vector pBa1 and generating construct pBEa1, Eb-Prrn, <italic>EGFP</italic> gene, and Eb-TrbcL were cloned into linearized vector pBa2 and generating construct pBEa2, Eb-Prrn, <italic>EGFP</italic> gene, and Eb-Trps16 were cloned into linearized vector pBa3 and generating construct pBEa3, Eb-Prrn, <italic>EGFP</italic> gene, Eb-IEE, <italic>aadA</italic> gene, and Eb-Trps16 were cloned into linearized vector pBluescript SK (+) and generating construct pBEIa, all the recombinant vectors were verified by Sanger sequencing for the next round of gene assembly.</p>
<p>For the third round (Round III) of gene assembly, the vectors pBEa1, pBEa2, pBEa3, and pBEIa were digested with Kpn I, and then the homologous recombination fragment Eb-trnI was cloned into linearized vectors and generating construct pBtEa1, pBtEa2, pBtEa3, and pBtEIa, respectively and verified by Sanger sequencing. Finally, the vectors pBtEa1, pBtEa2, pBtEa3, and pBtEIa were digested with Sac I, and then the homologous recombination fragment Eb-trnA was cloned into linearized vectors and generating construct pBtEat1, pBtEat2, pBtEat3, and pBtEIat (<xref ref-type="supplementary-material" rid="SM4">Supplementary Figure S1</xref>) and verified by Sanger sequencing.</p>
</sec>
<sec id="sec11">
<title>Analysis of <italic>aadA</italic> Gene Expression in <italic>E. coli</italic></title>
<p>After PCR identification and Sanger sequencing, four verified vectors (pBtEat1, pBtEat2, pBtEat3, and pBtEIat) and the empty vector pBluescript SK (+) were transformed into the <italic>E. coli</italic> expression strain Transetta (DE3). Then, the transformed <italic>E. coli</italic> strains were grown in Luria-Bertani (LB) medium at 37&#x00B0;C unless otherwise indicated. Spectinomycin and ampicillin were used for selection at appropriate concentration.</p>
</sec>
<sec id="sec12">
<title>Gene Expression Analysis of <italic>EGFP</italic></title>
<p>After culturing in 20ml of LB medium for 14&#x2013;16h at 16&#x00B0;C (150&#x2013;180rpm), <italic>EGFP</italic> gene expression in transformed <italic>E. coli</italic> strains was examined using a Thermo Scientific Varioskan LUX Multimode Microplate Reader (Thermo Fisher Scientific, MA, United States). The loading volume was 200&#x03BC;l, and the &#x201C;Fluorescence&#x201D; protocol was executed in Skanlt Software 6.1 RE, which included an excitation wavelength at 485nm and an emission wavelength at 520nm. Subsequently, the fluorescence intensity of <italic>E. coli</italic> strains containing different vectors was counted. Each sample was repeated for three times to ensure the credibility of the data, in order to compare the effects of different combinations of chloroplast regulatory elements on the expression level of green fluorescent protein in <italic>E. coli</italic>, the expression level of fluorescent protein in each sample was compared by unpaired <italic>t</italic>-test (<italic>p</italic>&#x003C;0.05), the results are represented as a bar graph.</p>
</sec>
<sec id="sec13">
<title>Statistical Analysis</title>
<p>The statistical evaluation was performed using SPSS 19.0 software (IBM Corp., United States). All data were presented as the means&#x00B1;SE of at least three replicates. ANOVA was followed by a <italic>t</italic>-test. Mean values were considered significantly different at <italic>p</italic>&#x003C;0.05.</p>
</sec>
</sec>
<sec id="sec14" sec-type="results">
<title>Results</title>
<sec id="sec15">
<title>Chloroplast Assembly and Genome Features</title>
<p>This project adopted the whole genome shotgun strategy to construct a 350bp library from the DNA samples of <italic>E. breviscapus</italic> and then used the Illumina NovaSeq sequencer to perform PE sequencing on this library, and the reads were 150bp in length. In this sequencing, the raw data sequencing volume of the <italic>E. breviscapus</italic> sample was 4.2Gbp. After data filtering, the clean data volume of the sample was 4.1Gbp (<xref rid="tab1" ref-type="table">Table 1</xref>). The original sequencing statistics of the third-generation nanopore sequencing were shown in <xref ref-type="supplementary-material" rid="SM2">Supplementary Table S2</xref>.</p>
<p>The <italic>E. breviscapus</italic> cp genome was completely assembled into a single molecule of 152,164bp, the assembly result of the cp genome of <italic>E. breviscapus</italic> contained 0 Gap, the GC content was 37.20, the assembly coverage was 332.55X, and the sequencing quality value (Phred) was above 30 (<xref rid="fig1" ref-type="fig">Figure 1</xref>), indicating that the sequencing quality was high.</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption><p>Sequencing data volume information statistics.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top">Sample name</th>
<th align="center" valign="top">Length (bp)</th>
<th align="center" valign="top">Raw reads</th>
<th align="center" valign="top">Raw bases (bp)</th>
<th align="center" valign="top">Clean reads</th>
<th align="center" valign="top">Clean bases (bp)</th>
<th align="center" valign="top">GC (%)</th>
<th align="center" valign="top">Q20 (%)</th>
<th align="center" valign="top">Q30 (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top"><italic>Erigeron breviscapus</italic></td>
<td align="center" valign="top">150</td>
<td align="center" valign="top">27,679,808</td>
<td align="center" valign="top">4,151,971,200</td>
<td align="center" valign="top">27,467,550</td>
<td align="center" valign="top">4,120,132,500</td>
<td align="center" valign="top">37.73</td>
<td align="center" valign="top">98.55</td>
<td align="center" valign="top">95.37</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption><p>Base quality value distribution graph of <italic>E. breviscapus</italic> sample. The graph reflects the average sequencing quality (Y-axis) of read 1 and read 2 along the Read Cycle (X-axis).</p></caption>
<graphic xlink:href="fpls-12-758290-g001.tif"/>
</fig>
<p>The conserved size and quadripartite structure of the complete <italic>E. breviscapus</italic> cp genome was similar to the cp genomes of previously published data. These genomes were usually 152kb in size and included two inverted repeat regions (IRs) and two single-copy regions, namely, a large single copy and small single copy (LSC and SSC, respectively) region.</p>
<p>The cp genome of <italic>E. breviscapus</italic> consisted of two single-copy regions isolated by two identical IR regions of 24,699bp each, one LSC region of 84,657bp and one SSC region of 18,109bp. The proportions of LSC, SSC, and IR sizes in the entire cp genome were 55.6, 11.9, and 32.5%, respectively (<xref rid="fig2" ref-type="fig">Figure 2</xref>; <xref rid="tab2" ref-type="table">Table 2</xref>). The GC contents of the LSC, SSC, and IR and whole cp genome were 35.06, 31.00, 43.13, and 37.20%, respectively, which were consistent with the published <italic>E. breviscapus</italic> cp genomes (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table S3</xref>). The cp genome of <italic>E. breviscapus</italic> was compared to previously published data, no structural differences were found among all compared data, it demonstrated that gene order, gene content, and entire genome structure were conserved in <italic>E. breviscapus</italic> cp genome.</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption><p>Circular map of the <italic>E. breviscapus</italic> cp genome. There are four rings on the cp genome map. From the center to the outside, the first circle shows the forward and reverse repetitions connected by the red arc and the green arc respectively; The second circle shows the tandem repeat marked with a short bar; The third circle is the microsatellite sequence identified by MISA; the fourth circle was drawn with genemap to show the gene structure on the plastids, Genes shown outside and inside of the fourth circle are transcribed counterclockwise and clockwise, respectively; the colors of genes are distinguished according to their functional classification.</p></caption>
<graphic xlink:href="fpls-12-758290-g002.tif"/>
</fig>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption><p>Chloroplast structure statistics of <italic>Erigeron breviscapus</italic> samples.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top">Region</th>
<th align="center" valign="top">Start</th>
<th align="center" valign="top">End</th>
<th align="center" valign="top">Length</th>
<th align="center" valign="top">A (%)</th>
<th align="center" valign="top">T (%)</th>
<th align="center" valign="top">C (%)</th>
<th align="center" valign="top">G (%)</th>
<th align="center" valign="top">AT (%)</th>
<th align="center" valign="top">GC (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">Total genome</td>
<td align="center" valign="middle">1</td>
<td align="center" valign="middle">152,164</td>
<td align="center" valign="middle">152,164</td>
<td align="center" valign="middle">31.36</td>
<td align="center" valign="middle">31.45</td>
<td align="center" valign="middle">18.54</td>
<td align="center" valign="middle">18.65</td>
<td align="center" valign="middle">62.80</td>
<td align="center" valign="middle">37.20</td>
</tr>
<tr>
<td align="left" valign="middle">LSC</td>
<td align="center" valign="middle">1</td>
<td align="center" valign="middle">84,657</td>
<td align="center" valign="middle">84,657</td>
<td align="center" valign="middle">32.29</td>
<td align="center" valign="middle">32.65</td>
<td align="center" valign="middle">17.25</td>
<td align="center" valign="middle">17.80</td>
<td align="center" valign="middle">64.94</td>
<td align="center" valign="middle">35.06</td>
</tr>
<tr>
<td align="left" valign="middle">IRA</td>
<td align="center" valign="middle">84,658</td>
<td align="center" valign="middle">109,356</td>
<td align="center" valign="middle">24,699</td>
<td align="center" valign="middle">28.37</td>
<td align="center" valign="middle">28.50</td>
<td align="center" valign="middle">20.73</td>
<td align="center" valign="middle">22.40</td>
<td align="center" valign="middle">56.87</td>
<td align="center" valign="middle">43.13</td>
</tr>
<tr>
<td align="left" valign="middle">SSC</td>
<td align="center" valign="middle">109,357</td>
<td align="center" valign="middle">127,465</td>
<td align="center" valign="middle">18,109</td>
<td align="center" valign="middle">34.97</td>
<td align="center" valign="middle">34.03</td>
<td align="center" valign="middle">16.33</td>
<td align="center" valign="middle">14.67</td>
<td align="center" valign="middle">69.00</td>
<td align="center" valign="middle">31.00</td>
</tr>
<tr>
<td align="left" valign="middle">IRB</td>
<td align="center" valign="middle">127,466</td>
<td align="center" valign="middle">152,164</td>
<td align="center" valign="middle">24,699</td>
<td align="center" valign="middle">28.50</td>
<td align="center" valign="middle">28.37</td>
<td align="center" valign="middle">22.40</td>
<td align="center" valign="middle">20.73</td>
<td align="center" valign="middle">56.87</td>
<td align="center" valign="middle">43.13</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="sec16">
<title>Gene Content and Structure</title>
<p>The cp genome of <italic>E. breviscapus</italic> consisted of 127 coding regions made up of 83 protein-coding genes, 36 tRNAs, and eight rRNAs, of which 95 genes were unique and 16 genes were repeated in two inverted regions consisting of five protein coding genes, seven tRNAs, and four rRNAs (<xref rid="fig2" ref-type="fig">Figure 2</xref>; <xref rid="tab3" ref-type="table">Table 3</xref>). Among these 95 unique genes, one gene crossed different cp genome boundaries: <italic>rps12</italic> crossed two IR regions, the LSC region and the SSC region (two 3&#x2032; end exons repeated in IRs and 5&#x2032; end exon situated in LSC; <xref rid="fig2" ref-type="fig">Figure 2</xref>). Of the remaining 94 genes, 82 were situated in the LSC, including 61 protein-coding genes and 21 tRNAs, and 12 were situated in the SSC, including 11 protein-coding genes and one tRNA.</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption><p>List of genes in the <italic>E. breviscapus</italic> cp genome.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top">Gene group</th>
<th align="center" valign="top">Gene name</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">NADH dehydrogenase</td>
<td align="left" valign="middle"><italic>ndh</italic>, <italic>ndhI</italic>, <italic>ndhJ</italic>, <italic>ndhK</italic>, <italic>ndhA</italic>, <italic>ndhH</italic>, <italic>ndhD</italic>, <italic>ndhC</italic>, <italic>ndhB</italic>, <italic>ndhG</italic>, and <italic>ndhE</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Other genes</td>
<td align="left" valign="middle"><italic>ccsA</italic>, <italic>cemA</italic>, and <italic>accD</italic></td>
</tr>
<tr>
<td align="left" valign="middle">RNA polymerase</td>
<td align="left" valign="middle"><italic>rpoC2</italic>, <italic>rpoA</italic>, <italic>rpoB</italic>, and <italic>rpoC1</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Transfer RNAs</td>
<td align="left" valign="middle"><italic>trnS-GCU</italic>, <italic>trnD-GUC</italic>, <italic>trnH-GUG</italic>, <italic>trnM-CAU</italic>, <italic>trnR-ACG</italic>, <italic>trnF-GAA</italic>, <italic>trnY-GUA</italic>, <italic>trnP-UGG</italic>, <italic>trnT-GGU</italic>, <italic>trnS-GGA</italic>, <italic>trnL-UAA</italic>, <italic>trnC-GCA</italic>, <italic>trnV-GAC</italic>, <italic>trnG-GCC</italic>, <italic>trnL-CAA</italic>, <italic>trnS-UGA</italic>, <italic>trnQ-UUG</italic>, <italic>trnR-UCU</italic>, <italic>trnS-CGA</italic>, <italic>trnA-UGC</italic>, <italic>trnL-UAG</italic>, <italic>trnT-UGU</italic>, <italic>trnE-UUC</italic>, <italic>trnN-GUU</italic>, <italic>trnK-UUU</italic>, and <italic>trnW-CCA</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Ribosomal proteins (SSU)</td>
<td align="left" valign="middle"><italic>rps18</italic>, <italic>rps19</italic>, <italic>rps2</italic>, <italic>rps7</italic>, <italic>rps16</italic>, <italic>rps15</italic>, <italic>rps8</italic>, <italic>rps11</italic>, <italic>rps12</italic>, <italic>rps4</italic>, <italic>rps3</italic>, and <italic>rps14</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Maturase</td>
<td align="left" valign="middle"><italic>matK</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Cytochrome b/f complex</td>
<td align="left" valign="middle"><italic>petA</italic>, <italic>petL</italic>, <italic>petG</italic>, <italic>petB</italic>, <italic>petD</italic>, and <italic>petN</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Photosystem II</td>
<td align="left" valign="middle"><italic>psbK</italic>, <italic>psbT</italic>, <italic>psbB</italic>, <italic>psbH</italic>, <italic>psbD</italic>, <italic>psbF</italic>, <italic>psbZ</italic>, <italic>psbC</italic>, <italic>psbM</italic>, <italic>psbA</italic>, <italic>psbN</italic>, <italic>psbJ</italic>, <italic>psbI</italic>, and <italic>psbE</italic></td>
</tr>
<tr>
<td align="left" valign="middle">ATP synthase</td>
<td align="left" valign="middle"><italic>atpI</italic>, <italic>atpA</italic>, <italic>atpE</italic>, <italic>atpF</italic>, <italic>atpB</italic>, and <italic>atpH</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Photosystem I</td>
<td align="left" valign="middle"><italic>psaJ</italic>, <italic>psaC</italic>, <italic>psaB</italic>, <italic>psaA</italic>, and <italic>psaI</italic></td>
</tr>
<tr>
<td align="left" valign="middle">RubisCO large subunit</td>
<td align="left" valign="middle"><italic>rbcL</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Ribosomal proteins (LSU)</td>
<td align="left" valign="middle"><italic>rpl20</italic>, <italic>rpl32</italic>, <italic>rpl23</italic>, <italic>rpl33</italic>, <italic>rpl16</italic>, <italic>rpl2</italic>, <italic>rpl14</italic>, <italic>rpl36</italic>, and <italic>rpl22</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Translational initiation factor</td>
<td align="left" valign="middle"><italic>infA</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Ribosomal RNAs</td>
<td align="left" valign="middle"><italic>rrn23S</italic>, <italic>rrn16S</italic>, <italic>rrn5S</italic>, and <italic>rrn4.5S</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Protease</td>
<td align="left" valign="middle"><italic>clpP</italic></td>
</tr>
<tr>
<td align="left" valign="middle">Hypothetical chloroplast reading frames (ycf)</td>
<td align="left" valign="middle"><italic>ycf2</italic>, <italic>ycf3</italic>, and <italic>ycf4</italic></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The <italic>E. breviscapus</italic> cp genome was composed of protein coding genes, tRNAs, rRNAs, and intronic and intergenic regions. Noncoding DNA accounted for 68,539bp (45.0%) of the whole <italic>E. breviscapus</italic> cp genome, protein-coding genes accounted for 71,850bp (47.2%), tRNA accounted for 2,725bp (1.8%), and rRNA accounted for 9,050bp (6.0%).</p>
<p>Most of the protein-coding genes contained only one exon, while 13 genes contained at least one intron, of which eight genes were distributed in the LSC, one was distributed in the SSC and four were distributed in both IRs (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table S4</xref>). Among them, three genes (<italic>rps12</italic>, <italic>clpP</italic>, and <italic>ycf3</italic>) contained two introns, while 10 genes (<italic>trnA-UGC</italic>, <italic>trnE-UUC</italic>, <italic>trnL-UAA</italic>, <italic>rpl2</italic>, <italic>ndhA</italic>, <italic>ndhB</italic>, <italic>petB</italic>, <italic>atpF</italic>, <italic>rpoC1</italic>, and <italic>trnK-UUU</italic>) contained one intron. The longest intron of <italic>trnK-UUU</italic> was 2,543bp, and it included the 1,515bp encoding the <italic>matK</italic> gene (<xref ref-type="bibr" rid="ref13">Gu et al., 2016</xref>). The <italic>rps12</italic> gene was predicted to be trans-spliced with a repeated 3&#x2032; end duplicated in the two IRs and a single 5&#x2032; end exon in the LSC (<xref ref-type="bibr" rid="ref28">Redwan et al., 2015</xref>).</p>
</sec>
<sec id="sec17">
<title>Statistics of Codon Usage Bias</title>
<p>Relative synonymous codon usage is an assessment of synonymous codon usage bias. This value is equal to the ratio of the actual observed value of synonymous codons to the expected value of the average use of synonymous codons. If there is no bias for codon usage, then the RSCU value will be 1; if the codon is used more frequently than other synonymous codons, then its RSCU value will be greater than 1; otherwise, the RSCU value will be less than 1 (<xref ref-type="bibr" rid="ref26">Nie et al., 2012</xref>; <xref ref-type="bibr" rid="ref21">Liu et al., 2020</xref>).</p>
<p>The results of the RSCU value analysis (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table S5</xref>) of various amino acids showed that there were 31 codons with RSCU&#x2265;1, of which 12 codons, such as UUA, AGA, CAA, and GGA, end in A; 16 codons, such as UCU, GCU, ACU, and UAU, end in U; and the only ones ending with G are UUG, UGG, and AUG. These findings indicated that the codons ending with A and U in the chloroplast genome of <italic>E. breviscapus</italic> appeared more frequently and are preferred codons, while the codons ending with C and G were non-preferred codons.</p>
</sec>
<sec id="sec18">
<title>Chloroplast Genome Simple Sequence Repeats</title>
<p>Simple sequence repeats are sequences with motifs from 1 to 6bp in length repeated multiple times (see section &#x201C;Materials and Methods&#x201D; for cutoff criteria), and they are distributed throughout the cp genome and often used as markers for breeding studies, population genetics, and genetic linkage mapping (<xref ref-type="bibr" rid="ref12">Grassi et al., 2002</xref>; <xref ref-type="bibr" rid="ref34">Timme et al., 2007</xref>).</p>
<p>A total of 223 SSRs were found in the <italic>E. breviscapus</italic> cp genome (<xref rid="fig3" ref-type="fig">Figure 3</xref>; <xref ref-type="supplementary-material" rid="SM3">Supplementary Table S6</xref>). These SSRs included 142 complex repeat-type SSRs (63%), 50 dinucleotide SSRs (22%), 12 trinucleotide SSRs (5%), 14 tetranucleotide (6%), four pentanucleotide SSRs (1%), and one hexanucleotide SSR (0.003%; <xref rid="fig3" ref-type="fig">Figure 3A</xref>; <xref ref-type="supplementary-material" rid="SM3">Supplementary Table S6</xref>). Among the 223 SSRs, 87% of SSRs (195) were the AT type, with copy numbers from 8 to 23 (<xref ref-type="supplementary-material" rid="SM3">Supplementary Table S6</xref>). Forty-eight SSRs were detected in protein-coding genes, 19 SSRs were detected in introns, and 156 were detected in intergenic regions (<xref rid="fig3" ref-type="fig">Figure 3C</xref>). In relation to the quadripartite, 151 SSRs were situated in the LSC, while 32 and 40 were identified in the IR and SSC, respectively (<xref rid="fig3" ref-type="fig">Figure 3B</xref>).</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption><p>The type, distribution, and presence of simple sequence repeats (SSRs) in <italic>E. breviscapus</italic> cp genome. <bold>(A)</bold> Number of different SSR types detected in <italic>E. breviscapus</italic> cp genome. <bold>(B)</bold> Frequency of SSRs in the large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions. <bold>(C)</bold> Frequency of SSRs in the protein-coding regions, intergenic spacers, and intron.</p></caption>
<graphic xlink:href="fpls-12-758290-g003.tif"/>
</fig>
</sec>
<sec id="sec19">
<title>Identification of Long Repetitive Sequences</title>
<p>Long repetitive sequences may promote the rearrangement of the cp genome and increase population genetic diversity (<xref ref-type="bibr" rid="ref2">Cavalier-Smith, 2002</xref>). A total of three unique repeats were found in the <italic>E. breviscapus</italic> cp genome, the size of two palindromic repeats were 48bp and 24,699bp, and the forward repeat was 39bp in size. The palindromic repeat which was 48bp in size had two same starting sites in the chloroplast genome sequence, these two starting sites were located at 74,576bp, another palindromic repeat&#x2019;s two starting site were located at 84,657 and 127,465bp respectively, the first starting site of the forward repeat located at 43,603bp and the second starting site located at 119,435bp. In addition, forward repeats are usually caused by transposon activity (<xref ref-type="bibr" rid="ref11">Gemayel et al., 2012</xref>), which will increase under cell stress (<xref ref-type="bibr" rid="ref35">Voronova et al., 2014</xref>). However, the origin and production mechanism of long repetitive sequences are not yet fully understood. Previous studies have shown that slipped-strand mispairing and inapposite recombination of repetitive sequences may lead to genome rearrangement (<xref ref-type="bibr" rid="ref34">Timme et al., 2007</xref>). Moreover, forward repeats can cause changes in genome structure; therefore, they can be used as markers in phylogenetic studies.</p>
</sec>
<sec id="sec20">
<title>Cloning of Chloroplast Regulatory Elements of <italic>E. breviscapus</italic></title>
<p>Before cloning the target sequence, we first obtained the specific sequence information of the target sequence through previous research, and this information included the sequence length, putative transcriptional elements, and secondary structure. Then, through a sequence alignment, secondary structure analysis, etc., the specific information of these target sequences was initially obtained from the cp genome of <italic>E. breviscapus</italic> (<xref rid="tab4" ref-type="table">Table 4</xref>).</p>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption><p>Regulatory elements of <italic>E. breviscapus</italic> cp genome.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top">Feature</th>
<th align="center" valign="top">Type</th>
<th align="center" valign="top">Size (bp)</th>
<th align="center" valign="top" colspan="2">Transcriptional elements</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">Eb-trnI</td>
<td align="left" valign="middle">Site of integration</td>
<td align="center" valign="middle">1,322</td>
<td align="left" valign="middle" colspan="2">-</td>
</tr>
<tr>
<td align="left" valign="middle">Eb-trnA</td>
<td align="left" valign="middle">Site of integration</td>
<td align="center" valign="middle">936</td>
<td align="left" valign="middle" colspan="2">-</td>
</tr>
<tr>
<td align="left" valign="middle" rowspan="4">Eb-Prrn</td>
<td align="left" valign="middle" rowspan="4">Promoter+rbcL5'UTR</td>
<td align="center" valign="middle" rowspan="4">135</td>
<td align="left" valign="middle">&#x2212;10 consensus sequence</td>
<td align="left" valign="middle">TATATT</td>
</tr>
<tr>
<td align="left" valign="middle">&#x2212;35 consensus sequence</td>
<td align="left" valign="middle">TTGACG</td>
</tr>
<tr>
<td align="left" valign="middle">GAA-box</td>
<td align="left" valign="middle">GAA</td>
</tr>
<tr>
<td align="left" valign="middle">SD-like sequence</td>
<td align="left" valign="middle">GGAGG</td>
</tr>
<tr>
<td align="left" valign="middle" rowspan="5">Eb-PpsbA</td>
<td align="left" valign="middle" rowspan="5">Promoter+psbA5'UTR</td>
<td align="center" valign="middle" rowspan="5">244</td>
<td align="left" valign="middle">&#x2212;10 consensus sequence</td>
<td align="left" valign="middle">TATACT</td>
</tr>
<tr>
<td align="left" valign="middle">&#x2212;35 consensus sequence</td>
<td align="left" valign="middle">TTGACA</td>
</tr>
<tr>
<td align="left" valign="middle">Ribosome binding site 1</td>
<td align="left" valign="middle">AAG</td>
</tr>
<tr>
<td align="left" valign="middle">Ribosome binding site 2</td>
<td align="left" valign="middle">TGATAAT</td>
</tr>
<tr>
<td align="left" valign="middle">AU-box</td>
<td align="left" valign="middle">TAAATAAA</td>
</tr>
<tr>
<td align="left" valign="middle">Eb-TpsbA</td>
<td align="left" valign="middle">3'UTR</td>
<td align="center" valign="middle">138</td>
<td align="left" valign="middle" colspan="2">Stem-loop</td>
</tr>
<tr>
<td align="left" valign="middle">Eb-TrbcL</td>
<td align="left" valign="middle">3'UTR</td>
<td align="center" valign="middle">309</td>
<td align="left" valign="middle" colspan="2">Stem-loop</td>
</tr>
<tr>
<td align="left" valign="middle">Eb-Trps16</td>
<td align="left" valign="middle">3'UTR</td>
<td align="center" valign="middle">303</td>
<td align="left" valign="middle" colspan="2">Stem-loop</td>
</tr>
<tr>
<td align="left" valign="middle" rowspan="3">Eb-IEE</td>
<td align="left" valign="middle" rowspan="3">IEE+rbcL5'UTR</td>
<td align="center" valign="middle" rowspan="3">67</td>
<td align="left" valign="middle">Stem-loop</td>
<td align="left" valign="middle">-</td>
</tr>
<tr>
<td align="left" valign="middle">S-box</td>
<td align="left" valign="middle">AAGTCAA</td>
</tr>
<tr>
<td align="left" valign="middle">SD-like sequence</td>
<td align="left" valign="middle">GGAGG</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Using <italic>E. breviscapus</italic> genomic DNA as a template, the target gene fragment was amplified with the primers shown in <xref ref-type="supplementary-material" rid="SM1">Supplementary Table S1</xref>. As shown in <xref rid="fig4" ref-type="fig">Figure 4</xref>, all amplified bands were the same size as the target gene fragment. Eb-Prrn was a fragment containing the rRNA operon promoter from <italic>E. breviscapus</italic> which was PCR amplified with a Shine-Dalgarno (SD) sequence derived from the chloroplast <italic>rbcL</italic> gene, Eb-psbA was a fragment containing the <italic>E. breviscapus</italic> psbA promoter and the <italic>E. breviscapus</italic> psbA leader. Eb-TpsbA, Eb-TrbcL, and Eb-Trps16 were the terminator of <italic>psbA</italic>, <italic>rbcL</italic> and <italic>rps16</italic> gene, respectively. Eb-IEE was an intercistronic expression element conferring intercistronic RNA processing and, in this way, enhancing expression of downstream cistrons of the operon. Eb-trnI was a fragment containing the <italic>trnI</italic> gene and part of the flanking sequence of this gene and Eb-trnA was a fragment containing the <italic>trnA</italic> gene and part of the flanking sequence of this gene. The cloned chloroplast regulatory element sequence can be used for subsequent vector construction.</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption><p>PCR amplification of chloroplast regulatory elements of <italic>E. breviscapus</italic>.</p></caption>
<graphic xlink:href="fpls-12-758290-g004.tif"/>
</fig>
</sec>
<sec id="sec21">
<title>Analysis of <italic>aadA</italic> Gene Expression in <italic>E. coli</italic></title>
<p>The verified vectors pBtEat1, pBtEat2, pBtEat3, and pBtEIat were transformed into the <italic>E. coli</italic> expression strain Transetta (DE3). As shown in <xref rid="fig5" ref-type="fig">Figure 5</xref>, after incubation, <italic>E. coli</italic> containing the empty vector could not grow on LB medium containing ampicillin and spectinomycin (<xref rid="fig5" ref-type="fig">Figure 5A</xref>). However, <italic>E. coli</italic> containing the recombinant vector could be grown on LB medium containing ampicillin and spectinomycin (<xref rid="fig5" ref-type="fig">Figures 5B</xref>&#x2013;<xref rid="fig5" ref-type="fig">E</xref>), indicating that the selected regulatory elements had prokaryotic characteristics and could mediate the correct expression of foreign genes.</p>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption><p>The expression of <italic>aadA</italic> gene in <italic>Escherichia coli</italic>. <bold>(A)</bold>: <italic>Escherichia coli</italic> containing empty vector, <bold>(B&#x2013;E)</bold>: <italic>Escherichia coli</italic> containing recombinant vector.</p></caption>
<graphic xlink:href="fpls-12-758290-g005.tif"/>
</fig>
</sec>
<sec id="sec22">
<title>Detection of <italic>EGFP</italic> Expression</title>
<p>As shown in <xref rid="fig6" ref-type="fig">Figure 6</xref>, the fluorescence intensity of the <italic>E. coli</italic> expression strain containing the recombinant vector was higher than that of the control test group. Among them, the expression strain containing the recombinant vector pBtEat3 (containing the Prrn-EGFP-Trps16 expression cassette) had the highest fluorescence intensity, which showed that the promoters Prrn and 3&#x2032; UTR Trps16 could mediate the high-efficiency expression of foreign genes in <italic>E. coli</italic>. In this study, the <italic>EGFP</italic> gene was efficiently expressed in <italic>E. coli</italic> transformed with the <italic>E. breviscapus</italic> chloroplast expression vector, suggesting that the <italic>EGFP</italic> gene may also be highly expressed in <italic>E. breviscapus</italic> chloroplasts.</p>
<fig position="float" id="fig6">
<label>Figure 6</label>
<caption><p>The expression level of <italic>EGFP</italic> gene in <italic>E. coli</italic>. Data are presented as mean&#x00B1;SE (<italic>n</italic>=3). The asterisk represents a significant difference (<sup>&#x002A;</sup><italic>p</italic>&#x003C;0.05).</p></caption>
<graphic xlink:href="fpls-12-758290-g006.tif"/>
</fig>
</sec>
</sec>
<sec id="sec23" sec-type="discussions">
<title>Discussion</title>
<p>The present work was to use high-throughput sequencing technology to systematically and comprehensively analyze the chloroplast genome of <italic>E. breviscapus</italic>. On this basis, we used the prokaryotic expression system to screen out high-efficiency chloroplast regulatory elements, which laid the foundation for the subsequent establishment of the chloroplast genetic transformation system of <italic>E. breviscapus</italic>.</p>
<p>Although, there have been reported on the chloroplast genome of <italic>E. breviscapus</italic>, since the <italic>E. breviscapus</italic> used in this study is cultivated species, It is difficult to rule out differences in genetic background between cultivated species and other varieties. In order to have a clearer understanding of the genetic background of the chloroplast of <italic>E. breviscapus</italic> used in this article, as well as to provide an accurate reference for the selection of chloroplast regulatory elements and the efficient expression of foreign genes for chloroplast transformation, we combined second-generation and third-generation sequencing technologies, the advantage of this method is the long read lengths (<xref ref-type="bibr" rid="ref7">Eid et al., 2009</xref>), which facilitate <italic>de novo</italic> genome assembly, particularly in the four chloroplast junctions between the single-copy regions and IR regions. At the same time, using the Illumina short read length can also improve the low accuracy of the long reads generated by the nanopore platform (~85% of the raw data; <xref ref-type="bibr" rid="ref28">Redwan et al., 2015</xref>). The chloroplast genome assembly statistics and bioinformatics analysis data in this article were compared with the previously published data. It was obviously that the chloroplast genome of <italic>E. breviscapus</italic> reported in the three articles were conserved in gene order, gene content, and entire genome structure, but previous reports only provided a little bit of genome information and have not provided a more systematic and comprehensive bioinformatics analysis of the chloroplast genome of <italic>E. breviscapus</italic>. Therefore, we conducted an in-depth analysis of chloroplast genome data of <italic>E. breviscapus</italic>, including a systematic classification of chloroplast genes, SSR analysis, codon usage bias analysis, long repetitive sequences analysis, and this process allowed us to further understand the genomic structure and genetic background of the chloroplast of <italic>E. breviscapus</italic>. On this basis, we successfully screened the target chloroplast regulatory elements from the cp genome of <italic>E. breviscapus</italic> through an extensive literature research and accurate bioinformatics analysis. Active promoters with suitable 5&#x2032; UTR and 3&#x2032; UTR are important constitutions of efficient transformation and expression vectors. It is generally thought that the use of these endogenous genes can greatly promote the integration and expression of foreign genes in the host genome (<xref ref-type="bibr" rid="ref27">Radakovits et al., 2010</xref>). In this work, two homologous recombinant fragments, two high-efficiency promoters, three 3&#x2032; UTR sequences, one SD sequence and one intercistronic expression element were identified from the chloroplasts of <italic>E. breviscapus</italic>.</p>
<p>Sequence analysis was performed on the chloroplast regulatory elements for prediction of its potential cis-acting elements involved with transcription initiation, regulation, or termination. In the Eb-PpsbA and Eb-Prrn sequence, &#x2212;10 consensus sequence, and -35 consensus sequence were detected. The &#x2212;10 consensus sequence is located at the core promoter region and involved with accurate transcription initiation. The &#x2212;35 consensus sequence commonly exists in many prokaryotic promoters, which acts as an enhancer region. Translation initiation occurs on a specific sequence called the ribosome binding site, this is a short base sequence located in front of the coding region. This site can be complementary to the 3&#x2032; end of the ribosomal 16S rRNA, which promotes the binding of the ribosome to the mRNA and facilitates the initiation of translation. Two ribosome binding sites were identified from Eb-PpsbA, since Eb-Prrn lacks a ribosome binding site, we added a ribosome site from the <italic>rbcL</italic> gene of <italic>E. breviscapus</italic> chloroplast at the 3&#x2032; end of Eb-Prrn by PCR amplification to ensure that the sequence can have transcription and translation initiation function. The 3&#x2032; UTR sequence of the prokaryotic expression system can ensure the stability of mRNA and facilitate the accumulation of protein level, these 3&#x2032; UTR sequences usually contain a stable stem-loop structure (<xref ref-type="bibr" rid="ref25">Monde et al., 2000</xref>). Through, the RNA secondary structure prediction analysis of Eb-TpsbA, Eb-TrbcL, and Eb-Trps16, we found that they all have a stable stem-loop structure. Before this report, IEE for the stacking and expression of foreign genes in the chloroplast of <italic>E. breviscapus</italic> had not been identified. Therefore, we preliminarily screened out the endogenous IEE sequence of <italic>E. breviscapus</italic> through sequence comparison and applied it to the construction of the polycistronic expression cassette of the vector.</p>
<p>To ensure that the selected regulatory elements have the correct biological functions, we have further identified these chloroplast regulatory elements through the prokaryotic expression system. The most effective selectable marker used in chloroplast transformation is the <italic>aadA</italic> gene, which confers resistance to streptomycin and spectinomycin. In this study, the transformed <italic>E. coli</italic> can grow on LB medium containing spectinomycin. The result indicated that these regulatory elements could regulate gene expression in <italic>E. coli</italic> and had the correct prokaryotic expression characteristics. Finally, we used a microplate reader to determine the accumulation level of green fluorescent protein <italic>in vitro</italic>, and compared the expression efficiency of different combinations of regulatory elements by fluorescence intensity. Compared with the empty vector, all the transformed <italic>E. coli</italic> exhibited extremely high fluorescence intensity, suggesting that Eb-Prrn and Eb-PpsbA had priming activity, which ensured the efficient transcription of the <italic>EGFP</italic> gene. In addition, the 5&#x2032; UTR regions and ribosome binding sites contained in these two sequences also promote the translation of green fluorescent protein in <italic>E. coli</italic>; Eb-TpsbA, Eb-TrbcL, and Eb-Trps16 can also promote the accumulation of green fluorescent protein. Comparing the fluorescence intensity of pBtEat2 and pBtEat3 strains, we found that different 3&#x2032; UTR sequences (Eb-TrbcL and Eb-Trps16) could cause different accumulation level of green fluorescent protein when <italic>EGFP</italic> gene was driven by Eb-Prrn. The pBtEIat strain could be grown on LB medium with spectinomycin and could be detected with green fluorescence. These results indicated that Eb-IEE could active the expression of <italic>EGFP</italic> gene and <italic>aadA</italic> gene as well as polycistronic expression. Notably, the pBtEat3 strain had the highest fluorescence intensity, which indicated that the combination of Eb-Prrn and Eb-Trps16 had higher transcription and translation efficiency. This combination could be used for subsequent chloroplast transformation system, which is the preferred choice for vector construction.</p>
<p>In conclusion, this study provided complete information on the cp genome of <italic>E. breviscapus</italic> and identified a series of chloroplast expression elements with prokaryotic expression characteristics, and these findings can provide a foundation for the establishment of a chloroplast genetic transformation system of <italic>E. breviscapus</italic>.</p>
</sec>
<sec id="sec24" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The data presented in the study are deposited in the GenBank repository, accession number: OK524211 (Eb-trnI), OK524212 (Eb-trnA), OK524213 (Eb-Prrn), OK524214 (Eb-PpsbA), OK524215 (Eb-Trps16), OK524216 (Eb-TpsbA), OK524217 (Eb-TrbcL), and OK524218 (Eb-IEE).</p>
</sec>
<sec id="sec25">
<title>Author Contributions</title>
<p>YY and YZ wrote the manuscript. YY, YZ, ZO, JG, and WZ performed the experiments. LH and YZ supervised the research. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="sec41" sec-type="funding-information">
<title>Funding</title>
<p>This work was supported by the National Key R&#x0026;D Program of China (2020YFA0908000), the National Natural Science Foundation of China (81891013), and the Key Project at Central Government Level: The ability establishment of sustainable use for valuable Chinese Medicine Resources (2060302) to LH, and the Fundamental Research Funds for the Central public welfare research institutes (ZZXT202103) to YZ.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec40" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="sec27" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fpls.2021.758290/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fpls.2021.758290/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.xlsx" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.xlsx" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Presentation_1.pptx" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.presentationml.presentation" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bock</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>Genetic engineering of the chloroplast: novel tools and new applications</article-title>. <source>Curr. Opin. Biotechnol.</source> <volume>26</volume>, <fpage>7</fpage>&#x2013;<lpage>13</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.copbio.2013.06.004</pub-id>, PMID: <pub-id pub-id-type="pmid">24679252</pub-id></citation></ref>
<ref id="ref2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cavalier-Smith</surname> <given-names>T.</given-names></name></person-group> (<year>2002</year>). <article-title>Chloroplast evolution: secondary symbiogenesis and multiple losses</article-title>. <source>Curr. Biol.</source> <volume>12</volume>, <fpage>R62</fpage>&#x2013;<lpage>R64</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0960-9822(01)00675-3</pub-id>, PMID: <pub-id pub-id-type="pmid">11818081</pub-id></citation></ref>
<ref id="ref3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>R.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>T.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Xiang</surname> <given-names>X.</given-names></name> <name><surname>Jian</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Integrated transcript and metabolite profiles reveal that EbCHI plays an important role in scutellarin accumulation in <italic>Erigeron breviscapus</italic> hairy roots</article-title>. <source>Front. Plant Sci.</source> <volume>9</volume>:<fpage>789</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpls.2018.00789</pub-id>, PMID: <pub-id pub-id-type="pmid">30013578</pub-id></citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cock</surname> <given-names>P.</given-names></name> <name><surname>Fields</surname> <given-names>C. J.</given-names></name> <name><surname>Naohisa</surname> <given-names>G.</given-names></name> <name><surname>Heuer</surname> <given-names>M. L.</given-names></name> <name><surname>Rice</surname> <given-names>P. M.</given-names></name></person-group> (<year>2009</year>). <article-title>The sanger FASTQ file format for sequences with quality scores, and the solexa/Illumina FASTQ variants</article-title>. <source>Nucleic Acids Res.</source> <volume>38</volume>, <fpage>1767</fpage>&#x2013;<lpage>1771</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkp1137</pub-id>, PMID: <pub-id pub-id-type="pmid">20015970</pub-id></citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Daniell</surname> <given-names>H.</given-names></name> <name><surname>Datta</surname> <given-names>R.</given-names></name> <name><surname>Varma</surname> <given-names>S.</given-names></name> <name><surname>Gray</surname> <given-names>S.</given-names></name> <name><surname>Lee</surname> <given-names>S. B.</given-names></name></person-group> (<year>1998</year>). <article-title>Containment of herbicide resistance through genetic engineering of the chloroplast genome</article-title>. <source>Nat. Biotechnol.</source> <volume>16</volume>, <fpage>345</fpage>&#x2013;<lpage>348</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nbt0498-345</pub-id>, PMID: <pub-id pub-id-type="pmid">9555724</pub-id></citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Daniell</surname> <given-names>H.</given-names></name> <name><surname>Vivekananda</surname> <given-names>J.</given-names></name> <name><surname>Nielsen</surname> <given-names>B. L.</given-names></name> <name><surname>Ye</surname> <given-names>G. N.</given-names></name> <name><surname>Tewari</surname> <given-names>K. K.</given-names></name> <name><surname>Sanford</surname> <given-names>J. C.</given-names></name></person-group> (<year>1990</year>). <article-title>Transient foreign gene expression in chloroplasts of cultured tobacco cells after biolistic delivery of chloroplast vectors</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>87</volume>, <fpage>88</fpage>&#x2013;<lpage>92</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.87.1.88</pub-id>, PMID: <pub-id pub-id-type="pmid">2404285</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eid</surname> <given-names>J.</given-names></name> <name><surname>Fehr</surname> <given-names>A.</given-names></name> <name><surname>Gray</surname> <given-names>J.</given-names></name> <name><surname>Luong</surname> <given-names>K.</given-names></name> <name><surname>Lyle</surname> <given-names>J.</given-names></name> <name><surname>Otto</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Real-time DNA sequencing from single polymerase molecules</article-title>. <source>Science</source> <volume>323</volume>, <fpage>133</fpage>&#x2013;<lpage>138</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1162986</pub-id>, PMID: <pub-id pub-id-type="pmid">19023044</pub-id></citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ewing</surname> <given-names>B.</given-names></name> <name><surname>Green</surname> <given-names>P.</given-names></name></person-group> (<year>1998</year>). <article-title>Base-calling of automated sequencer traces using Phred. II. Error probabilities</article-title>. <source>Genome Res.</source> <volume>8</volume>, <fpage>186</fpage>&#x2013;<lpage>194</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.8.3.186</pub-id>, PMID: <pub-id pub-id-type="pmid">9521922</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fei</surname> <given-names>Z.</given-names></name> <name><surname>Karcher</surname> <given-names>D.</given-names></name> <name><surname>Bock</surname> <given-names>R.</given-names></name></person-group> (<year>2007</year>). <article-title>Identification of a plastid intercistronic expression element (IEE) facilitating the expression of stable translatable monocistronic mRNAs from operons</article-title>. <source>Plant J.</source> <volume>52</volume>, <fpage>961</fpage>&#x2013;<lpage>972</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-313X.2007.03261.x</pub-id>, PMID: <pub-id pub-id-type="pmid">17825052</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fuentes</surname> <given-names>P.</given-names></name> <name><surname>Zhou</surname> <given-names>F.</given-names></name> <name><surname>Erban</surname> <given-names>A.</given-names></name> <name><surname>Karcher</surname> <given-names>D.</given-names></name> <name><surname>Kopka</surname> <given-names>J.</given-names></name> <name><surname>Bock</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>A new synthetic biology approach allows transfer of an entire metabolic pathway from a medicinal plant to a biomass crop</article-title>. <source>Elife</source> <volume>5</volume>:<fpage>e13664</fpage>. doi: <pub-id pub-id-type="doi">10.7554/eLife.13664</pub-id>, PMID: <pub-id pub-id-type="pmid">27296645</pub-id></citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gemayel</surname> <given-names>R.</given-names></name> <name><surname>Cho</surname> <given-names>J.</given-names></name> <name><surname>Boeynaems</surname> <given-names>S.</given-names></name> <name><surname>Verstrepen</surname> <given-names>K. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Beyond junk-variable tandem repeats as facilitators of rapid evolution of regulatory and coding sequences</article-title>. <source>Genes</source> <volume>3</volume>, <fpage>461</fpage>&#x2013;<lpage>480</lpage>. doi: <pub-id pub-id-type="doi">10.3390/genes3030461</pub-id>, PMID: <pub-id pub-id-type="pmid">24704980</pub-id></citation></ref>
<ref id="ref12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grassi</surname> <given-names>F.</given-names></name> <name><surname>Labra</surname> <given-names>M.</given-names></name> <name><surname>Scienza</surname> <given-names>A.</given-names></name> <name><surname>Imazio</surname> <given-names>S.</given-names></name></person-group> (<year>2002</year>). <article-title>Chloroplast SSR markers to assess DNA diversity in wild and cultivated grapevines</article-title>. <source>Vitis-Geilweilerhof</source> <volume>41</volume>, <fpage>157</fpage>&#x2013;<lpage>158</lpage>. doi: <pub-id pub-id-type="doi">10.1023/A:1017573817633</pub-id></citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gu</surname> <given-names>C.</given-names></name> <name><surname>Tembrock</surname> <given-names>L. R.</given-names></name> <name><surname>Johnson</surname> <given-names>N. G.</given-names></name> <name><surname>Simmons</surname> <given-names>M. P.</given-names></name> <name><surname>Wu</surname> <given-names>Z.</given-names></name></person-group> (<year>2016</year>). <article-title>The complete plastid genome of lagerstroemia fauriei and loss of rpl2 intron from lagerstroemia (Lythraceae)</article-title>. <source>PLoS One</source> <volume>11</volume>:<fpage>e0150752</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0150752</pub-id>, PMID: <pub-id pub-id-type="pmid">26950701</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jean-Simon</surname> <given-names>B.</given-names></name> <name><surname>Christian</surname> <given-names>O.</given-names></name> <name><surname>Claude</surname> <given-names>L.</given-names></name> <name><surname>Monique</surname> <given-names>T.</given-names></name></person-group> (<year>2010</year>). <article-title>The exceptionally large chloroplast genome of the green alga floydiella terrestris illuminates the evolutionary history of the chlorophyceae</article-title>. <source>Genome Biol. Evol.</source> <volume>2</volume>, <fpage>240</fpage>&#x2013;<lpage>256</lpage>. doi: <pub-id pub-id-type="doi">10.1093/gbe/evq014</pub-id>, PMID: <pub-id pub-id-type="pmid">20624729</pub-id></citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>N. H.</given-names></name> <name><surname>Zhang</surname> <given-names>G. H.</given-names></name> <name><surname>Zhang</surname> <given-names>J. J.</given-names></name> <name><surname>Shu</surname> <given-names>L. P.</given-names></name> <name><surname>Wei</surname> <given-names>Z.</given-names></name> <name><surname>Long</surname> <given-names>G. Q.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Analysis of the transcriptome of <italic>Erigeron breviscapus</italic> uncovers putative scutellarin and chlorogenic acids biosynthetic genes and genetic markers</article-title>. <source>PLoS One</source> <volume>9</volume>:<fpage>e100357</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0100357</pub-id>, PMID: <pub-id pub-id-type="pmid">24956277</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klein</surname> <given-names>R. R.</given-names></name> <name><surname>Mullet</surname> <given-names>J. E.</given-names></name></person-group> (<year>1987</year>). <article-title>Control of gene expression during higher plant chloroplast biogenesis. Protein synthesis and transcript levels of psbA, psaA-psaB, and rbcL in dark-grown and illuminated barley seedlings</article-title>. <source>J. Biol. Chem.</source> <volume>262</volume>, <fpage>4341</fpage>&#x2013;<lpage>4348</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0021-9258(18)61353-5</pub-id>, PMID: <pub-id pub-id-type="pmid">3558409</pub-id></citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krichevsky</surname> <given-names>A.</given-names></name> <name><surname>Meyers</surname> <given-names>B.</given-names></name> <name><surname>Vainstein</surname> <given-names>A.</given-names></name> <name><surname>Maliga</surname> <given-names>P.</given-names></name> <name><surname>Citovsky</surname> <given-names>V.</given-names></name> <name><surname>Uversky</surname> <given-names>V. N.</given-names></name></person-group> (<year>2010</year>). <article-title>Autoluminescent plants</article-title>. <source>PLoS One</source> <volume>5</volume>:<fpage>e15461</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0015461</pub-id>, PMID: <pub-id pub-id-type="pmid">21103397</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>N. G.</given-names></name> <name><surname>Liang</surname> <given-names>F. Z.</given-names></name> <name><surname>Zhang</surname> <given-names>J. R.</given-names></name> <name><surname>Xu</surname> <given-names>S. X.</given-names></name> <name><surname>Li</surname> <given-names>X. H.</given-names></name></person-group> (<year>1998</year>). <article-title>Effect of <italic>Erigeron breviscapus</italic> hand on hemorheology and microcirculation of acute cerebral infarction</article-title>. <source>Chinese J. Hemorheol.</source> <fpage>42</fpage>&#x2013;<lpage>44</lpage>.</citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z. J.</given-names></name> <name><surname>Liu</surname> <given-names>Y. Y.</given-names></name> <name><surname>Yang</surname> <given-names>C. W.</given-names></name> <name><surname>Qian</surname> <given-names>Z. G.</given-names></name> <name><surname>Li</surname> <given-names>G. D.</given-names></name></person-group> (<year>2019</year>). <article-title>The complete chloroplast genome sequence of <italic>Erigeron breviscapus</italic> and <italic>Erigeron multiradiatus</italic> (Asteraceae)</article-title>. <source>Mitochondrial DNA B Resour.</source> <volume>4</volume>, <fpage>3826</fpage>&#x2013;<lpage>3827</lpage>. doi: <pub-id pub-id-type="doi">10.1080/23802359.2019.1683478</pub-id>, PMID: <pub-id pub-id-type="pmid">33366206</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Song</surname> <given-names>K.</given-names></name> <name><surname>Yi</surname> <given-names>T.</given-names></name></person-group> (<year>2013</year>). <article-title>Conservation genetics and population diversity of <italic>Erigeron breviscapus</italic> (Asteraceae), an important Chinese herb</article-title>. <source>Biochem. Syst. Ecol.</source> <volume>49</volume>, <fpage>156</fpage>&#x2013;<lpage>166</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.bse.2013.03.009</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Lu</surname> <given-names>Y.</given-names></name> <name><surname>Lan</surname> <given-names>B.</given-names></name> <name><surname>Xu</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>Codon usage by chloroplast gene is bias in <italic>Hemiptelea davidii</italic></article-title>. <source>J. Genet.</source> <volume>99</volume>:<fpage>8</fpage>. doi: <pub-id pub-id-type="doi">10.1007/s12041-019-1167-1</pub-id>, PMID: <pub-id pub-id-type="pmid">32366734</pub-id></citation></ref>
<ref id="ref22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Xiao-Qiao</surname> <given-names>T.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>R.</given-names></name> <name><surname>Xiang-Lian</surname> <given-names>Y.</given-names></name> <name><surname>Xu-Dong</surname> <given-names>F. U.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Effects of scutellarin on rat cerebral blood flow determined by laser speckle imagine system</article-title>. <source>Chinese J. Hosp. Pharm.</source> <volume>30</volume>, <fpage>719</fpage>&#x2013;<lpage>722</lpage>.</citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>Y.</given-names></name> <name><surname>Rijzaani</surname> <given-names>H.</given-names></name> <name><surname>Karcher</surname> <given-names>D.</given-names></name> <name><surname>Ruf</surname> <given-names>S.</given-names></name> <name><surname>Bock</surname> <given-names>R.</given-names></name></person-group> (<year>2013</year>). <article-title>Efficient metabolic pathway engineering in transgenic tobacco and tomato plastids with synthetic multigene operons</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>110</volume>, <fpage>E623</fpage>&#x2013;<lpage>E632</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1216898110</pub-id>, PMID: <pub-id pub-id-type="pmid">23382222</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meng</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Dong</surname> <given-names>Z.</given-names></name> <name><surname>He</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Complete plastid genome sequence of <italic>Erigeron breviscapus</italic> (Asteraceae), an endemic traditional Chinese herbal medicine</article-title>. <source>Mitochondrial DNA B Resour.</source> <volume>4</volume>, <fpage>4077</fpage>&#x2013;<lpage>4078</lpage>. doi: <pub-id pub-id-type="doi">10.1080/23802359.2019.1605854</pub-id>, PMID: <pub-id pub-id-type="pmid">33366326</pub-id></citation></ref>
<ref id="ref25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Monde</surname> <given-names>R. A.</given-names></name> <name><surname>Greene</surname> <given-names>J. C.</given-names></name> <name><surname>Stern</surname> <given-names>D. B.</given-names></name></person-group> (<year>2000</year>). <article-title>The sequence and secondary structure of the 3'-UTR affect 3'-end maturation, RNA accumulation, and translation in tobacco chloroplasts</article-title>. <source>Plant Mol. Biol.</source> <volume>44</volume>, <fpage>529</fpage>&#x2013;<lpage>542</lpage>. doi: <pub-id pub-id-type="doi">10.1023/A:1026540310934</pub-id>, PMID: <pub-id pub-id-type="pmid">11197327</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murashige</surname> <given-names>T.</given-names></name> <name><surname>Skoog</surname> <given-names>F.</given-names></name></person-group> (<year>1962</year>). <article-title>A revised medium for rapid growth and bio assays with tobacco tissue cultures</article-title>. <source>Physiol. Plant.</source> <volume>15</volume>, <fpage>473</fpage>&#x2013;<lpage>497</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1399-3054.1962.tb08052.x</pub-id></citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nie</surname> <given-names>X.</given-names></name> <name><surname>Lv</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Du</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Biradar</surname> <given-names>S. S.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Complete chloroplast genome sequence of a major invasive species, Crofton weed (Ageratina adenophora)</article-title>. <source>PLoS One</source> <volume>7</volume>:<fpage>e36869</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0036869</pub-id>, PMID: <pub-id pub-id-type="pmid">22606302</pub-id></citation></ref>
<ref id="ref27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Radakovits</surname> <given-names>R.</given-names></name> <name><surname>Jinkerson</surname> <given-names>R. E.</given-names></name> <name><surname>Darzins</surname> <given-names>A.</given-names></name> <name><surname>Posewitz</surname> <given-names>M. C.</given-names></name></person-group> (<year>2010</year>). <article-title>Genetic engineering of algae for enhanced biofuel production</article-title>. <source>Eukaryot. Cell</source> <volume>9</volume>, <fpage>486</fpage>&#x2013;<lpage>501</lpage>. doi: <pub-id pub-id-type="doi">10.1128/EC.00364-09</pub-id>, PMID: <pub-id pub-id-type="pmid">20139239</pub-id></citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Redwan</surname> <given-names>R. M.</given-names></name> <name><surname>Saidin</surname> <given-names>A.</given-names></name> <name><surname>Kumar</surname> <given-names>S. V.</given-names></name></person-group> (<year>2015</year>). <article-title>Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass <italic>Commelinidae</italic></article-title>. <source>BMC Plant Biol.</source> <volume>15</volume>:<fpage>196</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12870-015-0587-1</pub-id>, PMID: <pub-id pub-id-type="pmid">26264372</pub-id></citation></ref>
<ref id="ref29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ruhlman</surname> <given-names>T.</given-names></name> <name><surname>Verma</surname> <given-names>D.</given-names></name> <name><surname>Samson</surname> <given-names>N.</given-names></name> <name><surname>Daniell</surname> <given-names>H.</given-names></name></person-group> (<year>2010</year>). <article-title>The role of heterologous chloroplast sequence elements in transgene integration and expression</article-title>. <source>Plant Physiol.</source> <volume>152</volume>, <fpage>2088</fpage>&#x2013;<lpage>2104</lpage>. doi: <pub-id pub-id-type="doi">10.1104/pp.109.152017</pub-id>, PMID: <pub-id pub-id-type="pmid">20130101</pub-id></citation></ref>
<ref id="ref30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sebastian</surname> <given-names>B.</given-names></name> <name><surname>Thomas</surname> <given-names>T.</given-names></name> <name><surname>Thomas</surname> <given-names>M.</given-names></name> <name><surname>Uwe</surname> <given-names>S.</given-names></name> <name><surname>Martin</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>MISA-web: a web server for microsatellite prediction</article-title>. <source>Bioinformatics</source> <volume>33</volume>, <fpage>2583</fpage>&#x2013;<lpage>2585</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btx198</pub-id>, PMID: <pub-id pub-id-type="pmid">28398459</pub-id></citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sheng</surname> <given-names>J.</given-names></name> <name><surname>Zhao</surname> <given-names>P.</given-names></name> <name><surname>Huang</surname> <given-names>Z.</given-names></name></person-group> (<year>1999</year>). <article-title>Influence of Deng Zhan xi Xin (<italic>Erigeron breviscapus</italic>) on thrombolytic treatment during acute coronary thrombosis by affecting function of blood platelet and coagulation</article-title>. <source>Chinese J. Cardiol.</source> <volume>27</volume>, <fpage>115</fpage>&#x2013;<lpage>117</lpage>.</citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>H.</given-names></name> <name><surname>Jiang</surname> <given-names>M.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>CPGAVAS2, an integrated plastome sequence annotator and analyzer</article-title>. <source>Nuclc Acids Res.</source> <volume>47</volume>, <fpage>W65</fpage>&#x2013;<lpage>W73</lpage>. doi: <pub-id pub-id-type="doi">10.1093/nar/gkz345</pub-id>, PMID: <pub-id pub-id-type="pmid">31066451</pub-id></citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>C.</given-names></name> <name><surname>Hu</surname> <given-names>N.</given-names></name> <name><surname>Huang</surname> <given-names>H.</given-names></name> <name><surname>Gao</surname> <given-names>J.</given-names></name> <name><surname>Zhao</surname> <given-names>Y. J.</given-names></name> <name><surname>Gao</surname> <given-names>L. Z.</given-names></name></person-group> (<year>2012</year>). <article-title>An improved chloroplast DNA extraction procedure for whole plastid genome sequencing</article-title>. <source>PLoS One</source> <volume>7</volume>:<fpage>e31468</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0031468</pub-id>, PMID: <pub-id pub-id-type="pmid">22384027</pub-id></citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Timme</surname> <given-names>R. E.</given-names></name> <name><surname>Kuehl</surname> <given-names>J. V.</given-names></name> <name><surname>Boore</surname> <given-names>J. L.</given-names></name> <name><surname>Jansen</surname> <given-names>R. K.</given-names></name></person-group> (<year>2007</year>). <article-title>A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: identification of divergent regions and categorization of shared repeats</article-title>. <source>Am. J. Bot.</source> <volume>94</volume>, <fpage>302</fpage>&#x2013;<lpage>312</lpage>. doi: <pub-id pub-id-type="doi">10.3732/ajb.94.3.302</pub-id>, PMID: <pub-id pub-id-type="pmid">21636403</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Voronova</surname> <given-names>A.</given-names></name> <name><surname>Belevich</surname> <given-names>V.</given-names></name> <name><surname>Jansons</surname> <given-names>A.</given-names></name> <name><surname>Rungis</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Stress-induced transcriptional activation of retrotransposon-like sequences in the scots pine (Pinus sylvestris L.) genome</article-title>. <source>Tree Genet. Genomes</source> <volume>10</volume>, <fpage>937</fpage>&#x2013;<lpage>951</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11295-014-0733-1</pub-id></citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wick</surname> <given-names>R. R.</given-names></name> <name><surname>Judd</surname> <given-names>L. M.</given-names></name> <name><surname>Gorrie</surname> <given-names>C. L.</given-names></name> <name><surname>Holt</surname> <given-names>K. E.</given-names></name></person-group> (<year>2017</year>). <article-title>Unicycler: resolving bacterial genome assemblies from short and long sequencing reads</article-title>. <source>PLoS Comput. Biol.</source> <volume>13</volume>:<fpage>e1005595</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005595</pub-id>, PMID: <pub-id pub-id-type="pmid">28594827</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wright</surname> <given-names>F.</given-names></name></person-group> (<year>1990</year>). <article-title>The 'effective number of codons' used in a gene</article-title>. <source>Gene</source> <volume>87</volume>, <fpage>23</fpage>&#x2013;<lpage>29</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0378-1119(90)90491-9</pub-id>, PMID: <pub-id pub-id-type="pmid">2110097</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Chen</surname> <given-names>W.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Hybrid de novo genome assembly of the Chinese herbal fleabane <italic>Erigeron breviscapus</italic></article-title>. <source>GigaScience</source> <volume>6</volume>, <fpage>1</fpage>&#x2013;<lpage>7</lpage>. doi: <pub-id pub-id-type="doi">10.1093/gigascience/gix028</pub-id>, PMID: <pub-id pub-id-type="pmid">28431028</pub-id></citation></ref></ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup><ext-link xlink:href="http://47.96.249.172:16019/analyzer/annotate" ext-link-type="uri">http://47.96.249.172:16019/analyzer/annotate</ext-link></p></fn>
<fn id="fn0002"><p><sup>2</sup><ext-link xlink:href="http://www.vmatch.de/" ext-link-type="uri">http://www.vmatch.de/</ext-link></p></fn></fn-group>
</back>
</article>