<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Bioeng. Biotechnol.</journal-id>
<journal-title>Frontiers in Bioengineering and Biotechnology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Bioeng. Biotechnol.</abbrev-journal-title>
<issn pub-type="epub">2296-4185</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fbioe.2020.00513</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Bioengineering and Biotechnology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Metabolic Engineering Strategies in Diatoms Reveal Unique Phenotypes and Genetic Configurations With Implications for Algal Genetics and Synthetic Biology</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>George</surname> <given-names>Jestin</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/927374/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Kahlke</surname> <given-names>Tim</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/422262/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Abbriano</surname> <given-names>Raffaela M.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/889529/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Kuzhiumparambil</surname> <given-names>Unnikrishnan</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/922406/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Ralph</surname> <given-names>Peter J.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/65640/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Fabris</surname> <given-names>Michele</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/756407/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>University of Technology Sydney, Climate Change Cluster, Faculty of Science</institution>, <addr-line>Ultimo, NSW</addr-line>, <country>Australia</country></aff>
<aff id="aff2"><sup>2</sup><institution>CSIRO Synthetic Biology Future Science Platform</institution>, <addr-line>Brisbane, QLD</addr-line>, <country>Australia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Maurycy Daroch, Peking University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Konstantinos Vavitsas, National and Kapodistrian University of Athens, Greece; Manuel Serif, Norwegian University of Science and Technology, Norway</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Michele Fabris <email>michele.fabris&#x00040;uts.edu.au</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Synthetic Biology, a section of the journal Frontiers in Bioengineering and Biotechnology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>06</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="collection">
<year>2020</year>
</pub-date>
<volume>8</volume>
<elocation-id>513</elocation-id>
<history>
<date date-type="received">
<day>16</day>
<month>12</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>30</day>
<month>04</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2020 George, Kahlke, Abbriano, Kuzhiumparambil, Ralph and Fabris.</copyright-statement>
<copyright-year>2020</copyright-year>
<copyright-holder>George, Kahlke, Abbriano, Kuzhiumparambil, Ralph and Fabris</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Diatoms are photosynthetic microeukaryotes that dominate phytoplankton populations and have increasing applicability in biotechnology. Uncovering their complex biology and elevating strains to commercial standards depends heavily on robust genetic engineering tools. However, engineering microalgal genomes predominantly relies on random integration of transgenes into nuclear DNA, often resulting in detrimental &#x0201C;position-effects&#x0201D; such as transgene silencing, integration into transcriptionally-inactive regions, and endogenous sequence disruption. With the recent development of extrachromosomal transgene expression via independent episomes, it is timely to investigate both strategies at the phenotypic and genomic level. Here, we engineered the model diatom <italic>Phaeodactylum tricornutum</italic> to produce the high-value heterologous monoterpenoid geraniol, which, besides applications as fragrance and insect repellent, is a key intermediate of high-value pharmaceuticals. Using high-throughput phenotyping we confirmed the suitability of episomes for synthetic biology applications and identified superior geraniol-yielding strains following random integration. We used third generation long-read sequencing technology to generate a complete analysis of all transgene integration events including their genomic locations and arrangements associated with high-performing strains at a genome-wide scale with subchromosomal detail, never before reported in any microalga. This revealed very large, highly concatenated insertion islands, offering profound implications on diatom functional genetics and next generation genome editing technologies, and is key for developing more precise genome engineering approaches in diatoms, including possible genomic safe harbour locations to support high transgene expression for targeted integration approaches. Furthermore, we have demonstrated that exogenous DNA is not integrated inadvertently into the nuclear genome of extrachromosomal-expression clones, an important characterisation of this novel engineering approach that paves the road to synthetic biology applications.</p></abstract>
<kwd-group>
<kwd>microalgae</kwd>
<kwd><italic>Phaeodactylum tricornutum</italic></kwd>
<kwd>extrachromosomal expression</kwd>
<kwd>random integration</kwd>
<kwd>long-read sequencing</kwd>
<kwd>integration islands</kwd>
<kwd>heterologous monoterpenoids</kwd>
<kwd>synthetic biology</kwd>
</kwd-group>
<contract-sponsor id="cn001">University of Technology Sydney<named-content content-type="fundref-id">10.13039/501100001775</named-content></contract-sponsor>
<contract-sponsor id="cn002">Commonwealth Scientific and Industrial Research Organisation<named-content content-type="fundref-id">10.13039/501100000943</named-content></contract-sponsor>
<counts>
<fig-count count="4"/>
<table-count count="2"/>
<equation-count count="1"/>
<ref-count count="109"/>
<page-count count="19"/>
<word-count count="14036"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Diatoms are a diverse group of unicellular Stramenopile microalgae that have received substantial attention for their ecological importance (Armbrust, <xref ref-type="bibr" rid="B5">2009</xref>) and biotechnological potential (Huang and Daboussi, <xref ref-type="bibr" rid="B46">2017</xref>). Newly developed genetic resources hold much promise for diatom functional genetics studies and have propelled the model pennate diatom <italic>Phaeodactylum tricornutum</italic> into the field of synthetic biology. Firstly, next-generation genetic engineering tools have now been established in <italic>P. tricornutum</italic>. This includes targeted genetic engineering via transcription activator-like effector nucleases (TALENs) (Daboussi et al., <xref ref-type="bibr" rid="B18">2014</xref>; Weyman et al., <xref ref-type="bibr" rid="B107">2015</xref>; Serif et al., <xref ref-type="bibr" rid="B96">2017</xref>) and CRISPR-Cas9 (Nymark et al., <xref ref-type="bibr" rid="B81">2016</xref>; Serif et al., <xref ref-type="bibr" rid="B95">2018</xref>; Sharma et al., <xref ref-type="bibr" rid="B97">2018</xref>) techniques with wide applications for biotechnology and basic research. Next-generation engineering strategies also include the recently demonstrated extrachromosomal approach. Here, potentially large episomes that contain various DNA parts can be maintained and expressed without requiring genomic integration (Karas et al., <xref ref-type="bibr" rid="B52">2015a</xref>). Consequently, extrachromosomal transformation is anticipated to become increasingly widely-used in diatom genetic engineering (Huang and Daboussi, <xref ref-type="bibr" rid="B46">2017</xref>). These next-generation engineering strategies are central to diatom genetics and synthetic biology primarily because they allow multi-gene stacking approaches (Goyal et al., <xref ref-type="bibr" rid="B35">2009</xref>; Ainley et al., <xref ref-type="bibr" rid="B1">2013</xref>). Secondly, the recently developed Universal Loop (uLoop) assembly kit provides a collection of useful parts for modular DNA assembly and high-throughput testing as well as more complex gene-stacking designs for <italic>P. tricornutum</italic> and other diatoms (Pollak et al., <xref ref-type="bibr" rid="B87">2019</xref>). Altogether, these resources offer unparalleled potential of diatoms such as <italic>P. tricornutum</italic> compared to other model algal chassis.</p>
<p>Further to these developments, intrinsic desirable biological traits have elevated this microbe as a promising alternative to well-established chassis, <italic>Escherichia coli</italic> and <italic>Saccharomyces cerevisiae</italic>. Such traits include its robustness and scalability for industrial-scale growth (Hamilton et al., <xref ref-type="bibr" rid="B40">2015</xref>); and&#x02014;unlike bacteria and yeast species&#x02014;its ability to fix carbon via photosynthesis for cheaper culture conditions. There is also an increased availability of transcriptomic, metabolomic, and proteomic datasets for uncovering previously unknown traits in diatoms (Ashworth et al., <xref ref-type="bibr" rid="B6">2016</xref>; Longworth et al., <xref ref-type="bibr" rid="B74">2016</xref>; Remmers et al., <xref ref-type="bibr" rid="B89">2018</xref>; Smith et al., <xref ref-type="bibr" rid="B101">2019</xref>) including unique aspects with potential biotechnological relevance (Kroth et al., <xref ref-type="bibr" rid="B62">2008</xref>; Allen et al., <xref ref-type="bibr" rid="B3">2011</xref>; Fabris et al., <xref ref-type="bibr" rid="B27">2012</xref>, <xref ref-type="bibr" rid="B26">2014</xref>).</p>
<p><italic>Phaeodactylum tricornutum</italic> is poised to become a widely-used, reliable chassis organism and has been validated in various high interest biotechnological applications, including in the production of bioplastic precursor compounds (Hempel et al., <xref ref-type="bibr" rid="B41">2011a</xref>), therapeutic antibodies (Hempel et al., <xref ref-type="bibr" rid="B42">2011b</xref>; Hempel and Maier, <xref ref-type="bibr" rid="B43">2012</xref>), biofuels (Yao et al., <xref ref-type="bibr" rid="B108">2014</xref>) and nutritional supplements (Hamilton et al., <xref ref-type="bibr" rid="B39">2014</xref>). However, these approaches have all relied on the first generation genetic engineering strategy of randomly integrated chromosomal expression (RICE) of exogenous DNA. While RICE has been crucial for generating a myriad of transgenic <italic>P. tricornutum</italic> strains, both for basic (Lavaud et al., <xref ref-type="bibr" rid="B63">2012</xref>; Liu et al., <xref ref-type="bibr" rid="B71">2016</xref>) and applied research (Hempel and Maier, <xref ref-type="bibr" rid="B43">2012</xref>); it is beset with silencing issues resulting in low transgene expression and stability (Cerutti et al., <xref ref-type="bibr" rid="B14">2011</xref>). This is largely attributed to the &#x0201C;position effect&#x0201D; phenomenon, whereby transgenes integrate into regions in the genome that are unfavourable for transgene expression (Gangl et al., <xref ref-type="bibr" rid="B31">2015</xref>; Doron et al., <xref ref-type="bibr" rid="B23">2016</xref>; Huang and Daboussi, <xref ref-type="bibr" rid="B46">2017</xref>), such as transcriptionally repressed regions (Elgin, <xref ref-type="bibr" rid="B24">1996</xref>). In microalgal research, there is virtually no information regarding the mechanisms driving and regulating RICE. Uncovering this knowledge is important for better understanding of a strategy that is still widely used today, including for CRISPR-Cas9 targeted genome editing (Hopes et al., <xref ref-type="bibr" rid="B44">2016</xref>; Nymark et al., <xref ref-type="bibr" rid="B81">2016</xref>; Greiner et al., <xref ref-type="bibr" rid="B36">2017</xref>).</p>
<p>In order for next-generation engineering tools to deliver a variety of synthetic biology applications in <italic>P. tricornutum</italic> and to replace RICE, they need to be better characterised. For example, even though targeted integration has been demonstrated in this species (Weyman et al., <xref ref-type="bibr" rid="B107">2015</xref>), there are no known safe harbour loci&#x02013;regions in the nuclear genome that facilitate stable, high transgene expression&#x02013;identified in <italic>P. tricornutum</italic> or any other eukaryotic photosynthetic microbe. Similarly, non-integrative episomes are extremely appealing for synthetic biology applications as backbones of self-maintaining mini-chromosomes. However, there is little knowledge available regarding mechanisms of episomal maintenance (Diner et al., <xref ref-type="bibr" rid="B22">2016</xref>), including the level of transgene expression that can be achieved by extrachromosomal expression (EE), and whether fragments of episomal DNA are inadvertently integrated into the nuclear genome. This is because EE technology has only recently been described in diatoms with limited examples of its use to express transgenes. Resolving these knowledge gaps will enable understanding of how different genetic engineering strategies may alter the diatom&#x00027;s biology and DNA integration patterns for developing more reliable next-generation engineering approaches required for complex synthetic biology.</p>
<p>Given the developments for diatom synthetic biology, <italic>P. tricornutum</italic> is now being explored for its potential for heterologous terpenoid production (Vavitsas et al., <xref ref-type="bibr" rid="B105">2018</xref>). <italic>Phaeodactylum tricornutum</italic> is a promising alternative chassis compared to the more widely used bacteria and yeast species, which require extensive engineering to increase relatively low flux to isoprenoid biosynthesis (Zurbriggen et al., <xref ref-type="bibr" rid="B109">2012</xref>; Paddon et al., <xref ref-type="bibr" rid="B82">2013</xref>; Bian et al., <xref ref-type="bibr" rid="B9">2017</xref>; Wang et al., <xref ref-type="bibr" rid="B106">2017</xref>). Until recently, <italic>P. tricornutum</italic> had only been metabolically engineered to produce heterologous terpenoids betulin and lupeol (D&#x00027;Adamo et al., <xref ref-type="bibr" rid="B19">2018</xref>) by RICE. However, we have since demonstrated the first use of EE for metabolic engineering in <italic>P. tricornutum</italic> (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>) by expressing a geraniol synthase from the medicinal plant <italic>Catharanthus roseus</italic> (EC 3.1.7.11, <italic>CrGES</italic>) for the heterologous production of geraniol. Geraniol is a commercially relevant monoterpenoid with a variety of applications as flavourant, fragrances, and insect repellent (Chen and Viljoen, <xref ref-type="bibr" rid="B15">2010</xref>). Geraniol is also the first intermediate in the monoterpenoid indole alkaloids (MIAs) biosynthesis pathway, which in <italic>C. roseus</italic> leads to the synthesis of the very high-value products with pharmaceutical applications (Van Moerkercke et al., <xref ref-type="bibr" rid="B104">2013</xref>; Caputi et al., <xref ref-type="bibr" rid="B12">2018</xref>).</p>
<p>Herein, with the aim of laying the basis for more sophisticated synthetic biology and metabolic engineering strategies, we generated and thoroughly profiled libraries of transgenic <italic>P. tricornutum</italic> cell lines engineered to produce geraniol via either EE or RICE. By adopting high-throughput phenotyping, we unveiled important intrinsic differences both between and within EE and RICE cell lines. This revealed a small selection of high-performing RICE strains, as well as a highly consistent transgene expression phenotype across EE strains. We used long-read DNA sequencing to interrogate the genomes of selected EE and RICE lines. Our results provide a complete analysis of all integration events, genomic locations, and transgene arrangements associated with high-performing RICE strains at both genome-wide and subchromosomal scale, never reported before in any microalga, and confirmed the non-integrative nature of EE.</p>
<p>Our findings are key for understanding the underexplored dynamics of EE in diatoms, to be used as the basis for gene-stacking based synthetic biology applications such as metabolic pathways and complex genetic circuit assembly and expression. They also highlight the importance of moving to next-generation genetic engineering strategies in <italic>P. tricornutum</italic> and lay the groundwork for identifying putative genomic safe harbour locations that support high transgene expression for targeted integration strategies.</p>
</sec>
<sec sec-type="methods" id="s2">
<title>Methods</title>
<sec>
<title>Microbial Strains and Growth Conditions</title>
<p><italic>Phaeodactylum tricornutum</italic> CCAP1055/1 was grown in liquid ESAW (Berges et al., <xref ref-type="bibr" rid="B8">2001</xref>) supplemented with 50 &#x003BC;g/mL zeocin (Invivogen, San Diego, CA, USA) where appropriate, under 100 &#x003BC;E m<sup>&#x02212;2</sup> s<sup>&#x02212;1</sup> light in 21 &#x000B0;C shaking at 95 rpm. <italic>Phaeodactylum tricornutum</italic> induction media was prepared following ESAW protocol but without any addition of phosphate. <italic>Escherichia coli</italic> was grown in Luria broth supplemented with 100 &#x003BC;g/mL ampicillin.</p>
</sec>
<sec>
<title>Cloning and Genetic Construct Assembly</title>
<p>Plasmids and episomes were constructed using Gibson assembly cloning kit (New England Biolabs, Hitchin, UK). Plasmids were propagated in <italic>E. coli</italic> strain Top10 and purified by Monarch Plasmid Miniprep Kit (New England Biolabs, Hitchin, UK). PCR amplification was performed using Q5 high fidelity polymerase (New England Biolabs, Hitchin, UK) and PCR screening was performed using GoTaq Flexi DNA polymerase (Promega, Wisconsin, United States) according to the manufacturer&#x00027;s instructions. Plasmid coding sequences were validated by Sanger sequencing (Macrogen Korea, Seoul, Korea). Episomes <italic>pPtPBR11_AP1p_CrGES-mVenus</italic> (mVenus NCBI Accesssion: AAZ65844.1) and a control <italic>pPtPBR11_AP1p_mVenus</italic> are described in Fabris et al. (<xref ref-type="bibr" rid="B25">2020</xref>). Expression plasmid for chromosomal integration, <italic>pUC19_AP1p_CrGES-mVenus</italic>, was ligated by Gibson assembly into the pUC19 cloning vector linearised with BamHI. Both the <italic>AP1p_CrGES-mVenus_FCBPt</italic> expression cassette and the <italic>FCBPp_ShBle_FCBPt</italic> zeocin resistant cassette were amplified from <italic>pPtPBR11_AP1p_CrGES-mVenus</italic> using 5&#x02032;-tcgagctcggtacccgggCTAACAGGATTAGTGCAATTC-3&#x02032; forward primer and 5&#x02032;-aggtcgactAGACGAGCTAGTGTTATTC-3&#x02032; reverse primer; and 5&#x02032;-agctcgtctAGTCGACCTGCACATATG-3&#x02032; forward primer and 5&#x02032;-tgcaggtcgactctagagAGACGAGCTAGTGTTATTC-3&#x02032; reverse primer, respectively. Similarly, <italic>pUC19_AP1p_mVenus</italic> expression plasmid for genomic integration was ligated by Gibson assembly into the pUC19 cloning vector linearised with BamHI. The <italic>AP1p_mVenus_FCBPt</italic> expression cassette was amplified using the same primers for amplifying <italic>AP1p_CrGES-mVenus_FCBPt</italic> expression cassette from <italic>pPtPBR11_AP1p_mVenus</italic> episome. Sequences of the vectors used in this work are provided in <xref ref-type="supplementary-material" rid="SM2">Supplementary File 2</xref>.</p>
</sec>
<sec>
<title>Diatom Transformation and Conjugation</title>
<p><italic>Phaeodactylum tricornutum</italic> was transformed by biolistic bombardment using PDS-1000/He System with Hepta adapter (Bio-Rad) for nuclear genomic integration of plasmid DNA (Kroth, <xref ref-type="bibr" rid="B61">2007</xref>). Afterward, the cell mixture was left to recover 12 h before being scraped and plated onto fresh &#x000BD; ESAW 50 &#x003BC;g/mL zeocin agar plates and left for 4&#x02013;5 weeks for single colonies to appear. <italic>E. coli</italic> containing <italic>pTA-Mob</italic> (Karas et al., <xref ref-type="bibr" rid="B52">2015a</xref>) and <italic>pPtPBR11_AP1p_CrGES-mVenus</italic> or <italic>pPtPBR11_AP1p_mVenus</italic> plasmids (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>) were used for conjugation with <italic>P. tricornutum</italic> according to protocol described by Diner et al. (<xref ref-type="bibr" rid="B22">2016</xref>). The cell mixture was scraped and plated onto 3&#x02013;5 fresh &#x000BD; ESAW zeocin agar plates and left for 10&#x02013;15 days when single colonies appeared. Single colonies generated by biolistic bombardment and conjugation were picked and inoculated into individual wells of 96-well round bottom plates containing 200 &#x003BC;l of ESAW supplemented with 50 &#x003BC;g/mL zeocin. The EE and RICE generated cell lines were incubated at 21&#x000B0;C with 100 &#x003BC;E m<sup>&#x02212;2</sup> s<sup>&#x02212;1</sup> light for 1 week to adjust to liquid growth, after which they were subcultured every 4 days. For high-throughput screening, cell lines were subcultured over 3 weeks in ESAW supplemented with 50 &#x003BC;g/mL zeocin (or without supplementation for stability analysis), and induced with phosphate-free ESAW for 24 h before flow cytometry.</p>
</sec>
<sec>
<title>Flow Cytometry and Fluorescence-Activated Cell Sorting (FACS)</title>
<p>Once antibiotic resistant single colonies were established in liquid culture, they were used to inoculate fresh 200 &#x003BC;L of ESAW with and without zeocin. Cell lines were subcultured 1:7 (v:v) every 4 days for 3 weeks under these conditions. On day 4 of culture, plates were centrifuged at 2,300 g for 3 min to pellet cells. The supernatant was removed and cells were washed with 150 &#x003BC;L induction media twice before being resuspended in 200 &#x003BC;L of induction media and induced for 24 hrs. Induced cells were screened by flow cytometry using CytoFLEX S (Beckman Coulter). Fluorescence was excited using a 488 nm laser. mVenus fluorescence was detected using a 525/40 nm filter and chlorophyll fluorescence was detected using 690/50 nm filter. Compensation of chlorophyll channel was set to 0.3. The distribution of eight single colonies of wild type <italic>P. tricornutum</italic> cultured and induced in the same way as transgenic cell lines were used as controls to determine the mVenus auto fluorescence for screening. Only chorophyll positive cells were included in the analysis to account for cell debris and other background events.</p>
<p>The per cell mVenus fluorescence of 20,000 events was log normalised and violin plots were created in Python using the seaborn data visualisation library (v. 0.9.0) (Allen et al., <xref ref-type="bibr" rid="B4">2018</xref>). Cell lines selected for geraniol production analysis were sorted using BD Influx FACS (BD Biosciences). All cell lines were cultured and induced as described above prior to FACS. Yellow mVenus fluorescence was detected using a 488 nm laser for excitation and a 530/40 nm filter. Wild type <italic>P. tricornutum</italic> cultured and induced in the same way as the transformants, was used as a control to determine the mVenus auto fluorescence distribution. A preliminary screen of a pooled sample of the top eight RICE_mV transformants was used to define the mVenus positive gate which did not overlap with the wild type control. One thousand cells from each transformant cell line that fell into this gate were collected into 96-well round bottom plate wells containing 200 &#x003BC;L ESAW media. After sorting, cells were incubated for 1&#x02013;2 weeks in 200 &#x003BC;L ESAW supplemented with zeocin 50 &#x003BC;g/mL in 96-well round bottom plate, after which they were subcultured as described previously.</p>
</sec>
<sec>
<title>Geraniol Capture and Analysis</title>
<p>Cell lines analysed for geraniol production were scaled up to 50 mL preculture in ESAW supplemented with zeocin 50 &#x003BC;g/mL. Precultures were used to inoculate 50 mL fresh ESAW media without zeocin in 250 mL shake flasks at 10,000 cells/mL density on Day 0. Cultures in late-exponential phase were induced in the presence of isopropyl myristate (C<sub>17</sub>H<sub>34</sub>O<sub>2</sub>) to capture volatile monoterpenoids (Jiang et al., <xref ref-type="bibr" rid="B49">2017</xref>). On Day 4 the total culture was collected and centrifuged at 3,000 g for 4 min to pellet cells. Cells were resuspended in 4 mL induction media and washed twice in induction media before being resuspended in 30 mL induction media in fresh 250 mL shake flasks with 1.6 mL isopropyl myristate, which was harvested 72 h after induction and stored &#x02212;80&#x000B0;C until being analysed by GCMS. Geraniol was captured, sampled and analysed as described in Fabris et al. (<xref ref-type="bibr" rid="B25">2020</xref>).</p>
</sec>
<sec>
<title>High Molecular Weight Genomic DNA (gDNA) Extraction</title>
<p>Genomic DNA from <italic>P. tricornutum</italic> transformants was extracted using 7 &#x000D7; 10<sup>7</sup> cells in 4&#x02013;6 extractions to obtain &#x0007E;7.5 &#x003BC;g high molecular weight, purified gDNA (dx.doi.org/10.17504/protocols.io.qzudx6w). The DNA was resuspended in 25&#x02013;45 &#x003BC;L ultrapure water overnight at room temperature.</p>
</sec>
<sec>
<title>MinION Sequencing</title>
<p>MinION sequencing libraries were prepared according to the 1D Genomic DNA by ligation (SQK-LSK108) protocol supplied by the MinION manufacturer (Oxford Nanopore Technologies) with modifications. Briefly, the DNA fragmentation step was replaced with two bead-cleaning steps. An initial 1:0.1 bead clean of gDNA to GC Biotech CleanNGS (CNGS-0005) beads was performed and the sample was gently mixed by flicking, and was slowly repeatedly inverted for 5 min, and pelleted on a magnetic rack to collect the supernatant. A second 1:1 bead clean was performed using the supernatant and beads. The sample was gently mixed by flicking, incubated by slowly rotating by hand for 5 min, and pelleted on a magnetic rack. The supernatant was removed and DNA bound to the beads was washed with 200 &#x003BC;L freshly prepared 70% ethanol twice without removing the tube off the magnet or disturbing the pellet. The beads and DNA were resuspended in 46 &#x003BC;L ultrapure water, incubated at room temperature for 5 min, the beads were pelleted on the magnet and the supernatant containing the DNA was collected and used according to the manufacturer protocol. Samples were sequenced until a coverage of seven to ten times was achieved. Raw reads were base called and quality filtered using albacore v2.2.6 with default settings. All the sequencing data has been deposited in NCBI BioProject under the ID PRJNA593624.</p>
</sec>
<sec>
<title>Identifying Transgene Integration Locations in Nuclear Genome</title>
<p>To identify reads which contain RICE plasmid DNA and remove reads made up of genomic DNA only, all reads were aligned against <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid using BLAST. Reads that did align to RICE plasmid were defined as initial hits. To account for regions in <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid that contain native <italic>P. tricornutum</italic> genomic regions, such as promoters or terminators, the <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid sequence was aligned against <italic>P. tricornutum</italic> genome using BLAST (EnsemblProtists, ASM15095v2). Initial hits that only aligned to those regions of <italic>pUC19_AP1p_CrGES-mVenus</italic> that are native to <italic>P. tricornutum</italic> were filtered out as false-positive hits using custom awk commands. The resulting true-positive hits were manually checked to identify the chromosomes of the integration sites. For detailed analyses all reads were mapped to each of the matching chromosomes using bwa v0.7.15 (Li and Durbin, <xref ref-type="bibr" rid="B66">2009</xref>). The resulting sam files were sorted, converted to bam-format and indexed using samtools 1.3.1 (Li et al., <xref ref-type="bibr" rid="B67">2009</xref>) and potential integration sites were manually checked using the Integrative Genomics Viewer v2.4.16 (Robinson et al., <xref ref-type="bibr" rid="B91">2011</xref>). Further analysis for ambiguous hits was performed using the Artemis Comparison Tool (ACT) v13.0.0 (Carver et al., <xref ref-type="bibr" rid="B13">2005</xref>).</p>
</sec>
</sec>
<sec id="s3">
<title>Results and Discussion</title>
<p>Terpenoid engineering in <italic>P. tricornutum</italic> has only recently been reported (D&#x00027;Adamo et al., <xref ref-type="bibr" rid="B19">2018</xref>; Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>) and consequently, there is limited prior knowledge to inform metabolic engineering strategies in diatoms to obtain elevated terpenoid production. Recently, we demonstrated that extrachromosomal expression (EE) can be used to efficiently express the fusion protein CrGES-mVenus in <italic>P. tricornutum</italic> cytosol to produce up to 0.21 &#x003BC;g/10<sup>7</sup> cells (0.309 mg/L) geraniol following bacterial conjugation (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>). EE of transgenes is not subject to position effect (Karas et al., <xref ref-type="bibr" rid="B52">2015a</xref>) and could therefore provide highly reproducible, consistent, and controllable expression, which is a basic requisite for synthetic biology. In contrast, randomly integrated chromosomal expression (RICE) can result in genetically dissimilar transformants and consequently varied transgene expression among them. However, diatom phenotypes derived from EE and RICE have not been systematically parameterised. Because little is known regarding the mechanisms and effects following EE and RICE of transgenes, it is unclear how these different engineering strategies will compare regarding the expression of <italic>CrGES-mVenus</italic>, and consequently, heterologous geraniol production. Therefore, we analysed the phenotypes of EE and RICE of <italic>Ap1_CrGES-mVenus</italic> in <italic>P. tricornutum</italic> cell lines both at the expression level and in terms of geraniol yield.</p>
<p>Two identical DNA expression cassettes of <italic>AP1p_CrGES-mVenus</italic> and <italic>AP1p_mVenus</italic> as control (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>) were cloned either into an pPtPBR11 episome (Diner et al., <xref ref-type="bibr" rid="B22">2016</xref>) or a pUC19 plasmid (Norrander et al., <xref ref-type="bibr" rid="B80">1983</xref>), and delivered either by bacterial conjugation or DNA-coated particle bombardment, respectively, in order to create EE and RICE <italic>P. tricornutum</italic> transformant libraries. Both <italic>AP1p_CrGES-mVenus</italic> constructs contained the <italic>CrGES</italic> gene fused at the carboxyl-terminus to a mVenus yellow fluorescent protein (YFP) (Kremers et al., <xref ref-type="bibr" rid="B60">2006</xref>) for rapidly screening the cell lines by flow cytometry. The <italic>CrGES-mVenus</italic> fusion gene was driven by the <italic>P. tricornutum</italic> native alkaline phosphatase (<italic>AP1, Phatr3_J49678</italic>) promoter (hereafter <italic>AP1p</italic>), which is induced in low phosphate conditions for controllable expression (Lin et al., <xref ref-type="bibr" rid="B69">2017</xref>).</p>
<p>Upon transformation of <italic>P. tricornutum</italic> with the episomes and plasmids described above, the resulting antibiotic resistant cell lines were used to create four transgenic diatom libraries. Cell lines transformed with <italic>pPtPBR11_AP1p_CrGES-mVenus</italic> and <italic>pPtPBR11_AP1p_mVenus</italic> for EE were denoted as EE_GmV and EE_mV, respectively. Likewise, cell lines transformed with <italic>pUC19_AP1p_CrGES-mVenus</italic> and <italic>pUC19_AP1p_mVenus</italic> for RICE were denoted as RICE_GmV and RICE_mV, respectively.</p>
<sec>
<title>EE Transformants Demonstrate Consistent mVenus Fluorescence, While RICE Transformants Demonstrate Higher, but More Variable Signals</title>
<p>In order to compare transgene expression across all four transgenic libraries (EE_GmV, EE_mV, RICE_GmV, and RICE_mV), we required a high-throughput screening strategy to quantify the relative heterologous protein production. We used flow cytometry to rapidly evaluate the CrGES-mVenus expression in an unprecedented number of transformants to identify unique features among and within cell lines generated by EE and RICE. This strategy enabled us to confirm the correct expression of the fusion protein, as well as quantify its relative abundance, offering a proxy for identifying high-expressing transformant variants from the low expressing or silenced variants (Sheff and Thorn, <xref ref-type="bibr" rid="B98">2004</xref>; Delvigne et al., <xref ref-type="bibr" rid="B21">2014</xref>). It is generally accepted that RICE transformants will exhibit different levels of heterologous protein production from each other (Hallmann, <xref ref-type="bibr" rid="B38">2007</xref>; Jeon et al., <xref ref-type="bibr" rid="B48">2017</xref>; Tanwar et al., <xref ref-type="bibr" rid="B103">2018</xref>). Conversely, EE exconjugants theoretically offer more consistent levels of expression (Karas et al., <xref ref-type="bibr" rid="B52">2015a</xref>). However, this has never been shown over a large scale of transformants or through direct comparison. Therefore, it is not known to what extent the expression of heterologous protein can vary across EE exconjugants or RICE transformants.</p>
<p>Single colonies from each of the four libraries were screened according to mVenus fluorescence to evaluate differences in transgene expression. Our results demonstrate that, as predicted and previously reported on a smaller sample (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>), EE results in consistent transgene expression among exconjugants. The mean mVenus fluorescence fold change of all EE_GmV lines and all EE_mV lines was 58.50- and 57.73- fold, respectively, when compared to wild type (WT) auto-fluorescence and were not significantly different from each other (<italic>p</italic> &#x0003E; 0.9999) (<xref ref-type="fig" rid="F1">Figure 1A</xref>). This suggested that construct size and complexity&#x02014;at least of this degree&#x02014;does not affect expression, and confirms that EE might not be affected by variable transgene silencing typically associated with position effect. For EE_GmV strains, the DNA construct contained the fusion gene geraniol synthase and mVenus (total construct size of 10,849 bp); whereas EE_mV strains were transformed with the mVenus containing DNA (total construct size 9,082 bp). Within each EE library, there was also little variation among EE_GmV lines (SEM = 3.54) and EE_mV lines (SEM = 2.95) (<xref ref-type="fig" rid="F1">Figure 1A</xref>). These results indicate that EE transformants are highly similar. Interestingly, every EE transformant analysed, across both EE_GmV and EE_mV libraries, showed a mean mVenus fluorescence greater than WT auto-fluorescence (<xref ref-type="fig" rid="F1">Figure 1B</xref>). This shows that EE is highly efficient and reliable in generating cell lines that express transgenes and does not require extensive screening, as is required for RICE engineering strategies.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>mVenus fluorescence intensities of transgenic <italic>P. tricornutum</italic> extrachromosomal expression (EE) and randomly integrated chromosomal expression (RICE) libraries. <bold>(A)</bold> Fold change of mean mVenus fluorescence of RICE_GmV and RICE_mV transformant libraries and EE_GmV and EE_mV libraries compared to wild type auto-fluorecence. Peach indicates CrGES-mVenus transgenic cell lines and teal indicates mVenus transgenic cell lines. Statistical comparisons were made using Kruskal-Wallis non-parametric ANOVA and Dunn&#x00027;s <italic>post-hoc</italic> test. For EE_GmV library <italic>n</italic> = 96 cell lines total, EE_mV library <italic>n</italic> = 96 cell lines total, RICE_GmV library <italic>n</italic> = 74 cell lines total and RICE_mV library <italic>n</italic> = 95 cell lines total. <bold>(B)</bold> Percentage of pooled RICE libraries (green) compared to percentage of pooled EE libraries (yellow) binned according to mean mVenus fluorescence fold change. <bold>(C&#x02013;F)</bold> Violin plots indicate the per cell mVenus fluorescence intensity of ten representative cell lines for each library <bold>(C)</bold> EE_GmV <bold>(D)</bold> EE_mV <bold>(E)</bold> RICE_GmV <bold>(F)</bold> RICE_mV, ranked from lowest to mean mVenus expression (<italic>n</italic> = 20,000 cells for each cell line). <bold>(G,H)</bold> Representative cell lines from transgene stability analysis for <bold>(G)</bold> RICE_GmV and <bold>(H)</bold> RICE_mV libraries. Pink indicates selection free growth conditions, green indicates zeocin selection growth conditions, cell lines are ranked by mean mVenus intensity (<italic>n</italic> = 20,000 cells for each cell line).</p></caption>
<graphic xlink:href="fbioe-08-00513-g0001.tif"/>
</fig>
<p>Unlike EE, RICE is subject to position effects and is therefore expected to result in transformants with variable transgene expression. We confirmed this in both RICE_GmV and RICE_mV libraries, in which transformants showed highly variable mVenus fluorescence intensities (<xref ref-type="fig" rid="F1">Figure 1A</xref>). For example, the mean mVenus fluorescence in RICE_GmV lines ranged between 0.04 and 719.20- fold change (SEM = 21.37) and RICE_mV lines ranged between 0.40 and 1695.00- fold change (SEM = 43.22). Furthermore, the RICE_GmV and RICE_mV libraries were significantly different from each other (<italic>p</italic> &#x0003C; 0.0001), demonstrating mean mVenus fold changes of 137.80- and 368.60-fold, respectively. Together these results suggested that when transgenes are integrated randomly in to the genome, features of the transgene, such as size and complexity, may affect its expression. It is plausible that gene silencing plays a role, as mVenus is present as a large fusion protein in the RICE_GmV library, whereas it is a smaller, free fluorescent protein in the RICE_mV library.</p>
<p>To further evaluate the heterogeneity of mVenus fluorescence across RICE libraries, we arbitrarily binned transformants based on the vast spread of mean mVenus fluorescence profiles recorded. We generated five groups comprising of &#x0003C;10-, 10- to 250-, 250- to 800-, 800- to 1,500-, and &#x0003E;1,500-fold change in mVenus fluorescence compared to wild type auto-fluorescence. In the RICE_GmV and RICE_mV libraries (<xref ref-type="fig" rid="F1">Figure 1B</xref>), &#x0007E;25% of transformants showed &#x0003C;10-fold mean mVenus than wild type auto-fluorescence (<xref ref-type="fig" rid="F1">Figure 1B</xref>). Biolistic bombardment is expected to result in random fragmentation of plasmid DNA (Hopes et al., <xref ref-type="bibr" rid="B44">2016</xref>). This could theoretically result in antibiotic resistant transformants that contain the selection cassette without the intact <italic>AP1p_CrGES-mVenus</italic> transcriptional unit, possibly resulting in antibiotic resistant transformants with fluorescence profiles indistinguishable from WT auto-fluorescence. Additionally, it is plausible that transformants associated with such low mean mVenus signals might have integrated the <italic>CrGES-mVenus</italic> expression cassette at transcriptionally repressed genomic loci, or in arrangements that may have triggered gene silencing (Kim et al., <xref ref-type="bibr" rid="B55">2015</xref>). About 37% of RICE transformants and 100% of EE exconjugants demonstrated mean mVenus fluorescence 10&#x02013;250-fold greater than WT auto-fluorescence. A further 28% of RICE transformants showed a 250&#x02013;800-fold increase. Only 7% showed an 800- to 1,500-fold increase, and 1% (corresponding to 2 cell lines both from RICE_mV library) reached a remarkable 1,500-fold increase in fluorescence compared to WT auto-fluorescence.</p>
<p>Together, these results demonstrate that transformants generated by RICE require extensive screening at the protein expression level, as up to a quarter can show no to low expression. Interestingly, RICE transformants were able to demonstrate exceptionally higher maximum transgene expression compared to EE using the same transgene cassette design. This warrants further investigation into these transgenic genomes, as there may be aspects of chromosome-integration that could be useful, particularly with regard to multi-generation transgene stability (Kohli and Christou, <xref ref-type="bibr" rid="B56">2008</xref>). Although this high RICE-related expression could be advantageous for simple transgenic constructs, our results show that it would not be suitable for testing larger, more complex assemblies especially without reporter genes. Instead, the high, virtually size-independent consistency of phenotypes associated with EE offer a more suitable platform for applications involving multi-gene constructs, with the advantage of not requiring large scale screening.</p>
</sec>
<sec>
<title>Clonal Variegation Is Broadly Distributed in EE but Discretely Defined in RICE</title>
<p>After having determined marked differences in the expression profile between EE and RICE libraries, we exploited the resolution of high-throughput flow cytometry to investigate the population composition within each cell line of EE and RICE libraries. In doing so, we identified relevant variations in the distributions of mVenus fluorescence within individual cell lines, known as cell mosaicism or variegation (Kaufman et al., <xref ref-type="bibr" rid="B54">2008</xref>). Most EE transformants showed a relatively homogenous distribution of mVenus abundance within each cell line, such as that of EE_GmV-28,&#x02212;47,&#x02212;7 and&#x02212;19, and EE_mV-72,&#x02212;14 and&#x02212;36 (<xref ref-type="fig" rid="F1">Figures 1C,D</xref>). However, some showed increasingly diverse mVenus distribution profiles within individual cell lines, such as EE_GmV-92,&#x02212;80,&#x02212;33,&#x02212;97, and&#x02212;67 and EE_mV-22,&#x02212;40,&#x02212;5,&#x02212;1,&#x02212;65, and&#x02212;97. Intriguingly, these cell lines tended to show higher mean mVenus abundance (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 1</xref>). This suggested that EE transformants which exhibited higher mean mVenus signals were composed of cells that were highly dissimilar from each other in a more continuous, non-discrete manner. This observation raises important questions about the dynamics of EE in diatoms; namely, how are episomal copies maintained within each cell, and how dynamic is episomal copy number and segregation across individual cells at different stages of the life cycle? Such mechanisms have been uncovered in other eukaryotic, non-microalgal species. For example, maintenance of viral episomes in mammalian cells and plasmids in <italic>S. cerevisiae</italic> have been attributed to chromosome tethering and hitchhiking mechanisms (Ghosh et al., <xref ref-type="bibr" rid="B33">2007</xref>; Liu et al., <xref ref-type="bibr" rid="B73">2008</xref>; McBride, <xref ref-type="bibr" rid="B75">2008</xref>; Sau et al., <xref ref-type="bibr" rid="B93">2019</xref>). In yeast synthetic biology research, episomal DNA sequences have been characterised based on traits including transformation efficiency, copy number, transgene expression and plasmid stability of clonal populations (Bouton and Smith, <xref ref-type="bibr" rid="B10">1986</xref>; Nakamura et al., <xref ref-type="bibr" rid="B78">2018</xref>; Gu et al., <xref ref-type="bibr" rid="B37">2019</xref>). For example, Nakamura et al. (<xref ref-type="bibr" rid="B78">2018</xref>) tested various episome-regulation sequences in <italic>Pichia pastoris</italic> transgenic strains expressing EGFP extrachromosomally. They reported that strains containing the autonomously replicating sequence (ARS) without a centromeric region (CEN) showed broad fluorescence profiles similar to those that we report here, whereby cells within a single clonal population show a wide spread of EGFP fluorescence. However, when combined with centromeric region (CEN2), they reported a more discrete distribution of high EGFP fluorescence profiles that were more consistent with our RICE transformants. This was likely due to a strong bias for the mother cell over the daughter cells during cell division (Gehlen et al., <xref ref-type="bibr" rid="B32">2011</xref>). While the pPtPBR11 plasmid used in this study contains CEN region (Diner et al., <xref ref-type="bibr" rid="B22">2016</xref>), it is still not yet known how such features contribute to episome expression in diatoms (Karas et al., <xref ref-type="bibr" rid="B52">2015a</xref>). Such factors could influence this cell-to-cell phenotypic heterogeneity, but for unknown reasons, this seemed to become more prominent at higher mean mVenus fluorescence.</p>
<p>Overall, RICE transformants demonstrated homogeneous fluorescence distribution profiles within individual cell lines, such as RICE_GmV-44,&#x02212;41, and&#x02212;64 and RICE_mV-93,&#x02212;54,&#x02212;50, and&#x02212;4 (<xref ref-type="fig" rid="F1">Figures 1E,F</xref>). Other RICE transformants also demonstrated heterogeneous mean mVenus profiles, but as numerous discrete populations within single cell lines (<xref ref-type="fig" rid="F1">Figures 1E,F</xref>). For example, RICE_GmV-125,&#x02212;120 and&#x02212;3 and RICE_mV-24,&#x02212;92,&#x02212;2, and&#x02212;85 all were composed of two unique populations of mVenus fluorescence distribution (<xref ref-type="fig" rid="F1">Figures 1E,F</xref>). Transformant RICE_GmV-89 even showed three populations (<xref ref-type="fig" rid="F1">Figure 1E</xref>). These results demonstrate that individual cells within a clonal transformant RICE cell line, generally assumed to have identical phenotypes, can be highly heterogeneous with regard to transgene expression, but that this heterogeneity can be distributed into unique, discrete populations. This is a major difference with the highly heterogeneous EE cell lines, which were instead characterized by a wide distribution of heterogeneity within the population, although RICE_GmV-68 transformant also followed this distribution.</p>
</sec>
<sec>
<title>RICE Cell Lines Exhibit Dramatically Varied Stability That Does Not Correlate to <italic>CrGES-mVenus</italic> Expression</title>
<p>Given the extremely high outliers, we investigated the stability of the RICE libraries and how this related to expression level. Random chromosomal integration can result in stable maintenance and expression of transgenes, even in absence of selective pressure. Transformants that looked indistinguishable from wild type auto-fluorescence in the selective treatment did not change when selective pressure was removed (RICE_GmV-48 and RICE_mV-65, <xref ref-type="fig" rid="F1">Figures 1G,H</xref>). We also identified RICE transformants that did not retain their mVenus fluorescence in absence of selective pressure (<xref ref-type="fig" rid="F1">Figures 1G,H</xref>). For example, RICE_GmV-3 and&#x02212;127 and RICE_mV-45,&#x02212;59,&#x02212;85 (<xref ref-type="fig" rid="F1">Figures 1G,H</xref>, respectively) demonstrated a complete reduction in mVenus fluorescence when cultured in the absence of zeocin that was indistinguishable from wild type auto-fluorescence. Without selective pressure, cells that have silenced their resistance transgene (and by proxy, the transgene of interest) can outcompete and take over the culture due to the disadvantages of reduced energy and resource investment associated with transgene expression.</p>
<p>Interestingly, transgene expression level did not correlate to transgene stability, as seen in RICE_GmV-127 and&#x02212;99, which showed similar expression levels with selection, but only&#x02212;127 lost mVenus fluorescence when selection was removed. Likewise, RICE_GmV-3 and&#x02212;127 cell lines show dissimilar mVenus signals in presence of selection but lost signal completely in selection-free conditions (<xref ref-type="fig" rid="F1">Figures 1G,H</xref>). This suggested that there may be transgene integration events or arrangements that facilitate stable transgene expression and highlight the importance of designing screening procedures based on stability not only on transgene expression. Potential mechanisms of action include progressive transcriptional silencing via DNA methylation or histone modification, including <italic>de novo</italic> DNA methylation triggered by transgene recognition (Kohli et al., <xref ref-type="bibr" rid="B59">2010</xref>); and posttranscriptional silencing known as RNA interference (Meyer, <xref ref-type="bibr" rid="B76">1995</xref>; Le&#x000F3;n-Ba&#x000F1;ares et al., <xref ref-type="bibr" rid="B65">2004</xref>; Cerutti et al., <xref ref-type="bibr" rid="B14">2011</xref>; Doron et al., <xref ref-type="bibr" rid="B23">2016</xref>). Epigenetic silencing of nuclear-integrated exogenous DNA have been attributed to defense mechanisms against viruses and transposable elements in plants (Rajeevkumar et al., <xref ref-type="bibr" rid="B88">2015</xref>) and mammalian cells (Alhaji et al., <xref ref-type="bibr" rid="B2">2019</xref>) alike. Silencing mechanisms, and indeed transgene regulation mechanisms yet to be identified, can influence daughter cells from the same original clonal population differently (Kaufman et al., <xref ref-type="bibr" rid="B54">2008</xref>). In fact, it is not known how stable randomly integrated exogenous DNA fragments are once they have been integrated, or how these insertions are genetically maintained over time.</p>
<p>In other RICE transformants, such as RICE_GmV-68,&#x02212;33 and&#x02212;10 and RICE_mV-17 and&#x02212;70, we detected a reduction in mVenus abundance in absence of selection. Here, a distinct population of cells within each transformant showed signals similar to those in presence of selection, as well as a secondary population of noticeably lower mVenus fluorescence (<xref ref-type="fig" rid="F1">Figures 1G,H</xref>). Other lines showed only a very small reduction in expression, such as RICE_GmV-76,&#x02212;117, and&#x02212;64 and RICE_mV-44,&#x02212;94, and&#x02212;25. Finally, we were able to identify some RICE transformants that maintained mVenus signals both in presence and absence of selective pressure, namely transformants RICE_GmV-99,&#x02212;84,&#x02212;24, and&#x02212;47 and RICE_mV-66,&#x02212;93, and&#x02212;74 (<xref ref-type="fig" rid="F1">Figures 1G,H</xref>). Some of these transformants also demonstrated comparatively high mVenus fluorescence abundance, particularly RICE_GmV-41,&#x02212;47 and RICE_mV-74 (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 1</xref>). This suggested that they might contain integration events or arrangements that bypass silencing mechanisms. These transformants could provide empirical evidence for putative safe-harbour loci, which have been previously verified in various other organisms including mammalian cell lines (Lee et al., <xref ref-type="bibr" rid="B64">2015</xref>; Cheng et al., <xref ref-type="bibr" rid="B16">2016</xref>; Papapetrou and Schambach, <xref ref-type="bibr" rid="B83">2016</xref>; Salsman and Dellaire, <xref ref-type="bibr" rid="B92">2016</xref>), rice (Cantos et al., <xref ref-type="bibr" rid="B11">2014</xref>) and cyanobacteria (Bentley et al., <xref ref-type="bibr" rid="B7">2014</xref>; Pinto et al., <xref ref-type="bibr" rid="B86">2015</xref>).</p>
<p>Together, these results once again highlight that RICE is not suitable for more complex synthetic biology and that efforts to move toward next-generation genetic engineering strategies is crucial. High expressing RICE transformants can be unstable, as well as contain unknown genomic disturbances and mutations due to damage to the genome itself, as demonstrated in rice and maize (Liu et al., <xref ref-type="bibr" rid="B70">2018</xref>). However, a better understanding of high transgene expression in RICE transformants may reveal aspects of exogenous DNA integration that would be useful for targeted insertion strategies.</p>
</sec>
<sec>
<title>Long-Read Whole-Genome Sequencing Reveals No Chromosomal Integration of Episomal DNA, Whereas Biolistic Bombardment Caused Exogenous DNA to Integrate at Unique Chromosomal Loci</title>
<p>To date, transgenic genomic research has been restricted by prohibitive costs of whole-genome sequencing and limited techniques that only reveal certain aspects of random integration events (Scaife and Smith, <xref ref-type="bibr" rid="B94">2016</xref>; Jeon et al., <xref ref-type="bibr" rid="B48">2017</xref>). In diatoms, such strategies include Southern blotting, which showed 1&#x02013;10 transgene copies of foreign DNA integrated into the genome (Falciatore et al., <xref ref-type="bibr" rid="B28">1999</xref>); quantitative polymerase chain reaction (qPCR), which revealed that copy number was relatively consistent between transformants variants (average of three) (D&#x00027;Adamo et al., <xref ref-type="bibr" rid="B19">2018</xref>); and thermal asymmetric interlaced PCR (TAIL-PCR), which revealed integration loci of exogenous DNA (Johansson et al., <xref ref-type="bibr" rid="B50">2019</xref>). Similarly, the recently developed method of EE has been shown to require no transgene integration via episome recovery experiments (Karas et al., <xref ref-type="bibr" rid="B52">2015a</xref>; Diner et al., <xref ref-type="bibr" rid="B22">2016</xref>). However, it is not yet known if integration does occur alongside extrachromosomal maintenance of episomes. Consequently, there is no knowledge of RICE or EE transgenic microalgal genomes with regard to exogenous DNA integration arrangements, the frequency of integration events throughout the genome, or any precise genomic integration loci. Answering some of these knowledge gaps is required for advancing synthetic biology design, progressing next-generation engineering tools &#x02013;such as possible safe harbour loci to target&#x02013;, and providing better understanding of the transgenic genome architecture, particularly regions associated with high transgene expression.</p>
<p>Developments in long-read sequencing technologies, namely Oxford Nanopore and PacBio, have allowed more continuous genome assemblies which can be done in real-time in the lab (Jain et al., <xref ref-type="bibr" rid="B47">2018</xref>). To date, these technologies are mostly used for metagenomics analyses (Robertsen et al., <xref ref-type="bibr" rid="B90">2016</xref>; Pinder et al., <xref ref-type="bibr" rid="B85">2019</xref>) or sequencing new, non-model species (Davis et al., <xref ref-type="bibr" rid="B20">2016</xref>; Fournier et al., <xref ref-type="bibr" rid="B30">2017</xref>). Herein, we applied Oxford Nanopore sequencing to interrogate bacteria-conjugated and biolistic-bombarded transgenic <italic>P. tricornutum</italic> cell lines to explore integrated transgene arrangements, integration locations, and associated genetic architecture, as has been recently done in <italic>Arabidopsis thaliana</italic> and mouse models (Jupe et al., <xref ref-type="bibr" rid="B51">2019</xref>; Nicholls et al., <xref ref-type="bibr" rid="B79">2019</xref>). Given the phenotypic consistency between EE lines, we analysed a single EE line, EE_GmV-97, and assessed all the reads for alignment to the <italic>pPtPBR11_AP1p_CrGES-mVenus</italic> episome DNA. For RICE lines, we analysed two lines that showed high stability and mVenus fluorescence, RICE_GmV-41 and&#x02212;47. These cell lines showed very similar transgene expression profiles and stabilities. Therefore, we aimed to identify differences or similarities regarding their transgenes at the genome-wide scale. These RICE reads were assessed for alignment to the RICE plasmid <italic>pUC19_AP1p_CrGES-mVenus</italic>. All EE and RICE reads with hits to their respective exogenous DNA constructs were then aligned to the wild type <italic>P. tricornutum</italic> genome in order to identify genomic integration events. The results of the Oxford Nanopore sequencing analysis are summarised in <xref ref-type="table" rid="T1">Table 1</xref>. For each cell line, we sequenced over 250 million nucleotides, resulting in genome coverage of five to nine times, with a probability &#x0003E;99% that the complete genome of each cell line was covered (Clarke and Carbon, <xref ref-type="bibr" rid="B17">1976</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Summarised details of MinION sequencing of EE and RICE transformants.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center"><bold>EE_ GmV-97</bold></th>
<th valign="top" align="center"><bold>RICE_GmV-41_A</bold></th>
<th valign="top" align="center"><bold>RICE_GmV-41_B</bold></th>
<th valign="top" align="center"><bold>RICE_GmV-47_A</bold></th>
<th valign="top" align="center"><bold>RICE_GmV-47_B</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Nucleotides (total)</td>
<td valign="top" align="center">203,640,859</td>
<td valign="top" align="center">254,953,905</td>
<td valign="top" align="center">279,294,842</td>
<td valign="top" align="center">257,611,430</td>
<td valign="top" align="center">325,110,058</td>
</tr>
<tr>
<td valign="top" align="left">Reads (total)</td>
<td valign="top" align="center">26,455</td>
<td valign="top" align="center">37,373</td>
<td valign="top" align="center">31,581</td>
<td valign="top" align="center">18,525</td>
<td valign="top" align="center">20,153</td>
</tr>
<tr>
<td valign="top" align="left">Average read length</td>
<td valign="top" align="center">7,697.63</td>
<td valign="top" align="center">6,822</td>
<td valign="top" align="center">8,843.60</td>
<td valign="top" align="center">13,906</td>
<td valign="top" align="center">16,132.09</td>
</tr>
<tr>
<td valign="top" align="left">Coverage estimated</td>
<td valign="top" align="center">&#x0007E;5.8x</td>
<td valign="top" align="center">&#x0007E; 7.2x</td>
<td valign="top" align="center">&#x0007E;7.9x</td>
<td valign="top" align="center">&#x0007E; 7.3x</td>
<td valign="top" align="center">&#x0007E;9.2x</td>
</tr>
<tr>
<td valign="top" align="left">Total reads aligning to RICE plasmid</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">248</td>
<td valign="top" align="center">210</td>
<td valign="top" align="center">157</td>
<td valign="top" align="center">151</td>
</tr>
<tr>
<td valign="top" align="left">Reads aligning to RICE plasmid and genome on both borders</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Reads aligning to RICE plasmid and genome on either border</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">27</td>
</tr>
<tr>
<td valign="top" align="left">Reads aligning to RICE plasmid only</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">230</td>
<td valign="top" align="center">196</td>
<td valign="top" align="center">138</td>
<td valign="top" align="center">124</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>For RICE_GmV cell lines, the analysis was performed on two biological replicates (A and B)</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>Our results strongly suggest that no traces of episome were integrated into EE_GmV-97 (<xref ref-type="table" rid="T1">Tables 1</xref>, <xref ref-type="table" rid="T2">2</xref>). Although we identified 26 reads that aligned to the episome, these reads contained no regions which aligned to the <italic>P. tricornutum</italic> genome (<xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="fig" rid="F2">Figure 2A</xref>), strongly suggesting that these DNA fragments did not get integrated into the nuclear chromosomes. This is the first demonstration that no episomal exogenous DNA is inadvertently integrated into the nuclear genome following bacterial conjugation. Such knowledge is important for identifying any genetic disturbances that may go undetected in exconjugants, progressing knowledge for a better understanding of episomal regulation mechanisms, and for synthetic biology applications with <italic>P. tricornutum</italic> more broadly.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Summarised details of integration events of EE and RICE transformants.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Clone</bold></th>
<th valign="top" align="left"><bold>Insertion site ID</bold></th>
<th valign="top" align="left"><bold>Estimated chromosomal location (bp)</bold></th>
<th valign="top" align="left"><bold><italic>In silico</italic> assembly of island complete?</bold></th>
<th valign="top" align="center"><bold>Size island (Kbp)</bold></th>
<th valign="top" align="left"><bold>Genetic feature at integration site</bold></th>
<th valign="top" align="left"><bold>Putative annotation of feature at integration site</bold></th>
<th valign="top" align="left"><bold>AA size</bold></th>
<th valign="top" align="left"><bold>Upstream features within 1 Kbp</bold></th>
<th valign="top" align="left"><bold>Downstream features within 1 Kbp</bold></th>
</tr>
</thead>
<tbody>
<tr style="border-bottom: thin solid #000000;">
<td valign="top" align="left">EE_GmV-97</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center" colspan="8">NA</td>
</tr> <tr>
<td valign="top" align="left">RICE_GmV-41</td>
<td valign="top" align="left">41-1</td>
<td valign="top" align="left">ch1: 2,477,260</td>
<td valign="top" align="left">Incomplete.</td>
<td valign="top" align="center">&#x0003E;43</td>
<td valign="top" align="left">Intergenic</td>
<td valign="top" align="left">None</td>
<td valign="top" align="left">NA</td>
<td valign="top" align="left">Phatr3_J8770 (protein coding)</td>
<td valign="top" align="left">Phatr3_J54066 (protein coding)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">41-11</td>
<td valign="top" align="left">ch11: 316,959&#x02013;317,016</td>
<td valign="top" align="left">Complete. (Single read spanned island).</td>
<td valign="top" align="center">&#x0007E;10</td>
<td valign="top" align="left">Intergenic</td>
<td valign="top" align="left">None</td>
<td valign="top" align="left">NA</td>
<td valign="top" align="left">Phatr3_J46733 (protein coding)</td>
<td valign="top" align="left">Phatr3_EG00809 (protein coding)</td>
</tr>
<tr>
<td valign="top" align="left">RICE_GmV-47</td>
<td valign="top" align="left">47-9</td>
<td valign="top" align="left">ch9: 865,083&#x02013;865,119</td>
<td valign="top" align="left">Incomplete.</td>
<td valign="top" align="center">&#x0003E;124</td>
<td valign="top" align="left">Phatr3_J46300 (single exon protein coding gene).</td>
<td valign="top" align="left">CM000612 Genomic DNA Translation: EEC47937.1</td>
<td valign="top" align="left">402 aa</td>
<td valign="top" align="left">NA</td>
<td valign="top" align="left">Phatr3_J46301 (protein coding)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">47-10</td>
<td valign="top" align="left">ch10: 609,260&#x02013;609,276</td>
<td valign="top" align="left">Incomplete.</td>
<td valign="top" align="center">&#x0003E;87</td>
<td valign="top" align="left">Phatr3_J46528 (single exon protein coding gene).</td>
<td valign="top" align="left">CM000613 Genomic DNA Translation: EEC47687.1</td>
<td valign="top" align="left">429 aa</td>
<td valign="top" align="left">NA</td>
<td valign="top" align="left">NA</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>NA, Not applicable</italic>.</p>
</table-wrap-foot>
</table-wrap>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Graphic representation of exogenous DNA constructs in extrachromosomal and chromosomal DNA of the transgenic cell lines, based on long-read sequencing. Only reads aligning to both exogenous DNA and wild type <italic>P. tricornutum</italic> genome, and not those aligning to the genome alone are depicted. <bold>(A)</bold> EE_GmV-97 transformant showed no reads which aligned to both exogenous episomal DNA <italic>pPtPBR11_AP1p_CrGES-mVenus</italic> (yellow), and the reference <italic>P. tricornutum</italic> genome, indicating that no exogenous DNA was integrated into the genome. Instead, some reads showed alignment (red) only to episome DNA, suggesting these reads came from episomal DNA which was extracted and analysed with genomic DNA. <bold>(B)</bold> Transformant RICE_GmV-41 generated by biolistic bombardment showed reads which aligned to both exogenous RICE plasmid, <italic>pUC19_AP1p_CrGES-mVenus</italic>, and the reference <italic>P. tricornutum</italic> genome, indicating a frequency of only two integration islands, 41-1 and 41-11, occurring throughout the whole genome. Island 41-1 occurred on chromosome 1 where the longest left border read (LB-R) and right border read (RB-R) collectively indicated that this island was a minimum of 43 Kbp in size. Island 41-11 occurred on chromosome 11 and was spanned by a single read, left-right border read (LRB-R), which aligned to the reference genome at both left and right borders (light blue), as well as the exogenous RICE plasmid. Red indicates alignment in sense orientation and dark blue indicates alignment in antisense orientation, representing the highly concatenated integration events observed. <bold>(C)</bold> RICE_GmV-47 transformant showed reads which aligned to both exogenous RICE plasmid, <italic>pUC19_AP1p_CrGES-mVenus</italic>, and the reference <italic>P. tricornutum</italic> genome, indicating a frequency of only two integration islands, 47-9 and 47-10, occurring throughout the whole genome. Island 47-9 occurred on chromosome 9 where the longest left border read (LB-R) and right border read (RB-R) collectedly indicate that this island is a minimum of 124 kB in size. Island 47-10 occurred on chromosome 10 where the longest left border read (LB-R) and right border read (RB-R) collectedly indicate that this island is a minimum of 87 Kbp in size.</p></caption>
<graphic xlink:href="fbioe-08-00513-g0002.tif"/>
</fig>
<p>In the RICE lines, we identified only two independent integration loci in both RICE_GmV-41 and RICE_GmV-47, respectively (<xref ref-type="table" rid="T1">Tables 1</xref>, <xref ref-type="table" rid="T2">2</xref>; <xref ref-type="fig" rid="F2">Figures 2B,C</xref>). These four sites were detected and confirmed in both biological replicates (<xref ref-type="table" rid="T2">Table 2</xref>; <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 2</xref>). The integration events associated with these four independent loci consisted of concatenations of various fragments of the <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid (<xref ref-type="fig" rid="F2">Figures 2B,C</xref>). The genomic features and details associated with each of the four integration events are summarised in <xref ref-type="table" rid="T2">Table 2</xref>.</p>
<p>DNA extracted from each cell line does not come from a single cell, but instead a clonal population, and it is plausible that endogenous genomic regions around the insertion site are not stable (Kohli et al., <xref ref-type="bibr" rid="B58">1998</xref>, <xref ref-type="bibr" rid="B57">2006</xref>). Therefore, it was not possible to resolve every integration event to the single nucleotide level, but only at &#x0003C; 60 bp range. For example, Kohli demonstrated that endogenous DNA concatenations can be assembled prior to or during integration in rice crop species, and suggested recombination could occur even after integration (Kohli et al., <xref ref-type="bibr" rid="B58">1998</xref>, <xref ref-type="bibr" rid="B57">2006</xref>). It is also possible that insertions, deletions, or a combination of both (INDELs) can occur at the borders of an exogenous DNA integration event, driven by non-homologous integration (Shin et al., <xref ref-type="bibr" rid="B99">2016</xref>). Such INDELs would cause reads at this small border region to show no alignment to wild type genome, as seen in RICE_GmV-47 integration event 47-10 (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 2</xref>). Finally, the high sequencing error rate associated with Nanopore sequencing (15%) can also influence integration site determination.</p>
<p>In RICE_GmV-41, fragments of the <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid were inserted at two unique genomic loci, ch1: 2,477,260 (integration island 41-1) and ch11: 316,959&#x02013;317,016 (integration island 41-11) (<xref ref-type="table" rid="T2">Table 2</xref>). Both integration island 41-1 and 41-11 occur at intergenic regions in the genome; however, they are both flanked by predicted protein coding genes (<xref ref-type="table" rid="T2">Table 2</xref>; <xref ref-type="fig" rid="F2">Figure 2B</xref>). Integration island 41-1 is situated 199 bp downstream of the 3&#x02032; end of <italic>Phatr3_J8770</italic> and 479 bp upstream of the 5&#x02032; start of <italic>Phatr3_J54066</italic> (<xref ref-type="table" rid="T2">Table 2</xref>; <xref ref-type="fig" rid="F2">Figure 2B</xref>). <italic>Phatr3_J8770</italic> contains dynamin domains and <italic>Phatr3_J54066</italic> is putatively involved in vesicle trafficking functions according to HMMER (Finn et al., <xref ref-type="bibr" rid="B29">2011</xref>) searches. Integration island 41-11 occurs &#x0007E;900 bp downstream of the 3&#x02032; end of <italic>Phatr3_J46733</italic> and &#x0007E;100 bp upstream of the 5&#x02032; start of <italic>Phatr3_EG00809</italic> (<xref ref-type="table" rid="T2">Table 2</xref>; <xref ref-type="fig" rid="F2">Figure 2B</xref>). <italic>Phatr3_J46733</italic> contains a transmembrane feature at its C terminus (Uniprot) and a VAD1 Analog of StAR-related lipid transfer domain (VASt) according to HMMER (Finn et al., <xref ref-type="bibr" rid="B29">2011</xref>). <italic>Phatr3_EG00809</italic> showed no predicted functional annotations, nor similarity to known protein domains (Finn et al., <xref ref-type="bibr" rid="B29">2011</xref>).</p>
<p>Neither of these islands disrupted the protein coding regions of these neighbouring genes and we did not detect any growth defective phenotypes for these cell lines. However, the close proximity of the islands to these neighbouring genes means that the integration events may have affected their associated endogenous regulatory regions (<xref ref-type="table" rid="T2">Table 2</xref>, <xref ref-type="fig" rid="F2">Figure 2B</xref>).</p>
<p>In transformant RICE_GmV-47, two integration events were localised to ch9: 865,083&#x02013;865,119 (integration island 47-9) and ch10: 609,260&#x02013;609,276 (integration island 47-10) (<xref ref-type="table" rid="T2">Table 2</xref>, <xref ref-type="fig" rid="F2">Figure 2C</xref>). Both of these loci harbour predicted single-exon protein coding regions <italic>Phatr3_J46300</italic> and <italic>Phatr3_J46528</italic>, respectively, with no predicted functional annotations, nor similarity to known protein domains (Finn et al., <xref ref-type="bibr" rid="B29">2011</xref>).</p>
<p>Interestingly, all four integration events were contained within unique sites across the entire genome of both cell lines, instead of occurring in a more scattered arrangement at a high number of locations, as has been demonstrated following biolistic bombardment in the plants <italic>Oryza sativa</italic> and <italic>Zea mays</italic> (Liu et al., <xref ref-type="bibr" rid="B70">2018</xref>).</p>
</sec>
<sec>
<title>Biolistic Bombardment Results in Extremely Large Integration Islands Containing Highly Repetitive Arrangements of Exogenous DNA</title>
<p>Due to the size of the <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid (6.5 Kbp) and the length of the longer reads we obtained (up to 193.5 Kbp in length), we expected that single reads sequenced using this technology could span the entire integration site. We found this to be true for integration island 41-11, as demonstrated with left-right border read (LRB-R), 28.6 Kbp in length (<xref ref-type="fig" rid="F2">Figure 2B</xref>). This read revealed &#x0007E;10 Kbp aligned to the RICE plasmid and 4 Kbp upstream and 14 Kbp downstream of the &#x0201C;integration island&#x0201D; aligning to adjacent loci in the reference genome (<xref ref-type="fig" rid="F3">Figure 3A</xref>). This is the first visualisation of a complete integration event in any microalgae and clearly shows both the integration location in the genome as well as the integration event arrangement.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Graphic representation of rearrangements of exogenous DNA in <italic>P. tricornutum</italic> chromosomes, based on long-read sequencing. Red channels show alignment in sense orientation and blue channels show alignment in antisense orientation. Regions that are not highlighted did align to the plasmid, but with below-threshold for hit length of percent identity used for the visualisation, which was performed manually. <bold>(A)</bold> Alignment of a left-right border read (LRB-Read) (top) from integration event 41-11 to RICE plasmid <italic>pUC19_AP1p_CrGES-mVenus</italic> (bottom) and to the wild type <italic>P. tricornutum</italic> genome (green). <bold>(B)</bold> A single 97.5 Kbp read (bottom) with no regions of similarity to the <italic>P. tricornutum</italic> wild type reference genome aligned to the RICE plasmid <italic>pUC19_AP1p_CrGES-mVenus</italic> (top). <bold>(C)</bold> Integration island 47-9 made up by two reads; the left border read (LB-Read) (middle) contains approximately 42 Kbp aligned to the RICE plasmid (bottom) and 3 Kbp aligned to the <italic>P. tricornutum</italic> wild type reference genome (top). The right border read (RB-Read) (middle) contains approximately 82 Kbp of aligned to the RICE plasmid and 2 Kbp aligned to the <italic>P. tricornutum</italic> wild type reference genome. <bold>(D)</bold> Alignments of left and right border reads to each other for integration island 41-1, 47-9, and 47-10. These reads do not align to each other to &#x0201C;close&#x0201D; the integration island, suggesting that some &#x0201C;filler&#x0201D; reads are missing.</p></caption>
<graphic xlink:href="fbioe-08-00513-g0003.tif"/>
</fig>
<p>However, this was the only integration event which was detected on a single read. We did not expect that every other integration event would be so large that it would not be detectable within a single long read, but instead only align to one border of the integration island. This is demonstrated by representative reads 47-9_LB-R and 47-9_RB-R that contained a short 5&#x02032; flank aligning to ch9: 860,407&#x02013;865,083 and a 3&#x02032; flank aligning to ch9: 865,119&#x02013;867,673, respectively (<xref ref-type="fig" rid="F3">Figure 3C</xref>). The majority portion of these two reads (42 Kbp for 47-9_RB-R and 82 Kbp for 47-9_RB-R) in fact aligned to RICE plasmid and were found to contain a high frequency of adjacent concatenations of <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid, in both sense and antisense orientations (<xref ref-type="fig" rid="F3">Figure 3C</xref>). This type of &#x0201C;bordered&#x0201D; configuration was detected around the integration islands 41-1, 47-9, and 47-10.</p>
<p>The highly repetitive, shuffled structure of the integration islands may be responsible for the high <italic>CrGES-mVenus</italic> expression associated with these cell lines compared to others in this library. Alternatively or in addition to this, it is plausible that either high intact transcriptional unit copy number, or highly repetitive, chimeric promoter and/or terminator arrangements of the <italic>CrGES-mVenus</italic> cassette might act as enhancers for the expression of this construct. Given that we were able to detect mVenus fluorescence and geraniol production (<xref ref-type="fig" rid="F4">Figures 4B,C</xref>), we can infer that some of these fragments contained functional transgene cassettes. However, the &#x0007E;15% error rate of Nanopore sequencing technology (Jain et al., <xref ref-type="bibr" rid="B47">2018</xref>) and the inability to detect single nucleotide polymorphisms make it near impossible to infer the exact number of functional copies present. Furthermore, it is unclear how stable these large integration islands are, if they are prone to recombination events, or what molecular mechanisms occurred to generate them.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Geraniol production in three selected RICE_GmV and EE_GmV cell lines. <italic>N</italic> = 3, error bars represent SEM, statistical comparisons were made using one-way ANOVA and Tukey&#x00027;s multiple comparisons <italic>post-hoc</italic> test. Significance is demonstrated with asterisks, where &#x0002A;&#x0002A;&#x0002A;&#x0002A;<italic>p</italic> &#x02264; 0.0001; &#x0002A;&#x0002A;&#x0002A;<italic>p</italic> &#x02264; 0.001; and &#x0002A;<italic>p</italic> &#x02264; 0.05. <bold>(A)</bold> Growth curve for all cell lines. <bold>(B)</bold> mVenus fluorescence intensity 24 h after induction. <bold>(C)</bold> geraniol produced after 72 h induction.</p></caption>
<graphic xlink:href="fbioe-08-00513-g0004.tif"/>
</fig>
<p>Together, the left and right border reads for integration islands 41-1, 47-9, and 47-9 indicate that these islands are a minimum of 43, 124, and 87 Kbp, respectively. We then looked to assess whether these left and right border reads for these three integration islands aligned to each other to &#x0201C;close&#x0201D; the integration island, but were unable to due to their repetitive nature (<xref ref-type="fig" rid="F3">Figure 3D</xref>).</p>
<p>Interestingly, over 80% of the reads we identified that aligned to the <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid did so in their entirety (<xref ref-type="table" rid="T1">Table 1</xref>); i.e., these reads contained no flanks which aligned to the genome at all (<xref ref-type="fig" rid="F3">Figure 3B</xref>). Theoretically, these large reads with no genome-aligning flanks could sit between the left and right border reads of the integration islands. Although we cannot align these highly concatenated reads to each other to confirm this, we speculate that islands 41-1, 47-9, and 47-10 could certainly be hundreds of kilobase pairs in size. Given that RICE_GmV-41 transformant has only two integration events and we knew the size of integration island 41-11 (&#x0007E;10 Kbp) and the sizes of all the &#x0201C;RICE plasmid only&#x0201D; aligning reads together (1,808,244 nt), we can roughly estimate the size of integration island 41-1 through</p>
<disp-formula id="E1"><mml:math id="M3"><mml:mtable columnalign="left"><mml:mtr><mml:mtd><mml:mfrac><mml:mrow><mml:mi>n</mml:mi><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>l</mml:mi></mml:mrow><mml:mrow><mml:mn>11</mml:mn></mml:mrow></mml:msub><mml:mtext>&#x000A0;</mml:mtext><mml:mo>*</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mi>c</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>ni</italic> is the number of nucleotides in &#x0201C;RICE plasmid only&#x0201D; aligned reads, <italic>l</italic><sub>11</sub> is the length of integration island 41-11, and <italic>c</italic> is the estimated coverage. Following this the estimated length of the second integration island 41-1 is &#x0007E;250 Kpbs. This correlates to 38 hypothetical back-to-back integrations of the full <italic>pUC19_AP1p_CrGES-mVenus</italic> RICE plasmid. Such large hypothetical islands are supported by similar results in transgenic rice and maize lines obtained following biolistic bombardment, in which large integration islands up to 1.6 Mbp in size were reported (Liu et al., <xref ref-type="bibr" rid="B70">2018</xref>). Jupe et al. (<xref ref-type="bibr" rid="B51">2019</xref>) also used whole-genome sequencing to elucidate transgene integration structure following <italic>Agrobacterium</italic>-mediated transformation in <italic>Arabidopsis thaliana</italic>. They reported between one and seven integration islands of between 20 and 230 Kbp per strain. While these results are from higher plant species, they confirm that these huge, highly concatenated islands are not specific to biolistic transformation, nor to diatoms.</p>
<p>As previously described, mechanisms involved with RICE are not well-understood but have been investigated in plants (Kohli et al., <xref ref-type="bibr" rid="B57">2006</xref>). Our results highlight a need to explore mechanisms driving the assembly and maintenance of these islands in diatoms, especially given the range of sizes possible that suggest numerous strategies may be at play. This is the first insight into how nuclear integration occurs in diatoms with widespread implications for existing understanding and future studies. Previous short-read sequencing techniques such as targeted gene-walking (Parker et al., <xref ref-type="bibr" rid="B84">1991</xref>), and inverse PCR or TAIL-PCR (Huang et al., <xref ref-type="bibr" rid="B45">2000</xref>; Liu and Chen, <xref ref-type="bibr" rid="B72">2007</xref>; Johansson et al., <xref ref-type="bibr" rid="B50">2019</xref>) have been useful for identifying integration loci, but would not have been able to detect such large integration events of hundreds of kilobase pairs in size, nor would it be able to detect the complex transgene rearrangements (Nicholls et al., <xref ref-type="bibr" rid="B79">2019</xref>) that we unveiled through long-read sequencing. Furthermore, highly concatenated integrations contain many repeated sequences (<xref ref-type="fig" rid="F3">Figure 3</xref>), which nested primers used in TAIL-PCR are able to anneal to. Consequently, PCR strategies would result in many non-specific amplicons and difficulty in determining an unknown integration location. Whilst Southern blotting is a useful approach for determining gene copy number, it does not provide information about the integration site. Hence, our results demonstrate that long-read whole-genome sequencing is an ideal, rapid and affordable approach for determining highly complex transgene integration events.</p>
</sec>
<sec>
<title>RICE <italic>CrGES-mVenus</italic> Transformants Are Associated With Higher Expression and Higher Geraniol Yield</title>
<p>We previously demonstrated that wild type <italic>P. tricornutum</italic> does not naturally produce geraniol, but that it can be efficiently engineered to extrachromosomally express <italic>CrGES-mV</italic> to produce it heterologously (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>). In order to determine whether extrachromosomal or chromosomal-integrated expression of the fusion construct <italic>CrGES-mVenus</italic> affected the heterologous production of monoterpenoids, we quantified the amount of geraniol produced in three independent RICE_GmV transformant lines and three EE_GmV exconjugant lines. Using mVenus fluorescence as a proxy for GES expression, we selected six transformants (RICE_GmV-24,&#x02212;41,&#x02212;24; and EE_GmV-97,&#x02212;98 and&#x02212;99, respectively) based on their mean fluorescence intensity and stability (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 1</xref>). With the aim of enriching the clonal populations associated with higher mean mVenus fluorescence, RICE_mV-74, RICE_GmV-24,&#x02212;41 and&#x02212;47 and EE_mV-97, EE_GmV-97,&#x02212;98 and&#x02212;99 were induced and sorted based on mVenus fluorescence using fluorescence activated cell sorting (FACS). A preliminary screen of a pooled sample of the top eight RICE_mV transformants was used to define the mVenus positive gate which did not overlap with the wild type control. During FACS, one thousand cells from each transformant that fell into this gate were collected and scaled up. We concluded that cell sorting did not enrich phenotypic populations, and in this specific case did not improve mean mVenus intensity (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 3</xref>).</p>
<p>Geraniol production in transgenic cell lines was evaluated in a bi-phasic, batch fermentation experiment (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>). All <italic>CrGES-mVenus</italic> transformants and exconjugants were induced by resuspension in lower volumes (30 ml) of phosphate-free media and showed similar growth to <italic>mVenus</italic> transformant and exconjugant controls prior to induction, indicating no identifiable loss of fitness in <italic>CrGES-mVenus</italic> expressing lines (<xref ref-type="fig" rid="F4">Figure 4A</xref>).</p>
<p>We tracked mVenus fluorescence daily and quantified the accumulated geraniol 72 h after inducing the expression of the <italic>CrGES-mVenus</italic> fusion gene. The RICE_GmV transformants, which showed increased mVenus fluorescence 24 h after induction, produced more geraniol than the episomal exconjugant equivalents (<xref ref-type="fig" rid="F4">Figures 4B,C</xref>). Chromosome-integrated transformant production yields were 150.9 ng/10<sup>7</sup> cells (0.37 mg/L), 304.4 ng/10<sup>7</sup> cells (0.89 mg/L) and 235.3 ng/10<sup>7</sup> cells (0.61 mg/L) for RICE_GmV-24, RICE_GmV-41 and RICE_GmV-47, respectively (<xref ref-type="fig" rid="F4">Figure 4C</xref>). Conversely, non-integrated exconjugants demonstrated consistently lower yields that were 49.1 ng/10<sup>7</sup> cells (0.10 mg/L), 61.4 ng/10<sup>7</sup> cells (0.15 mg/L) and 72.5 ng/10<sup>7</sup> cells (0.15 mg/L) for EE_GmV-97, EE_GmV-98, and EE_GmV-99, respectively (<xref ref-type="fig" rid="F4">Figure 4C</xref>). Neither EE nor RICE mVenus controls showed any detectable geraniol (<xref ref-type="fig" rid="F4">Figure 4C</xref>). These results indicate that mVenus fluorescence is a reliable proxy for geraniol production, as mVenus fluorescence correlated with geraniol yields. Also, the high geraniol yields achieved support the hypothesis that <italic>P. tricornutum</italic> may have an available free pool of cytosolic geranyl diphosphate (GPP), the prenylphosphate precursor that CrGES converts into geraniol (Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>), and that the heterologous synthesis of this monoterpenoid might not be limited by substrate availability in these settings. In light of these results, strategies involving promoter optimisation and targeted integration&#x02014;both currently being evaluated in our laboratory&#x02014;could further increase the geraniol production. Together, our results warrant the development of <italic>P. tricornutum</italic> for enhanced monoterpenoid production and suggest that it would be possible to improve production levels further by optimising <italic>CrGES</italic> expression at the genetic level.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s4">
<title>Conclusions</title>
<p>Within the emerging application of monoterpenoid engineering in diatoms, we set out to generate specific knowledge to inform genetic optimisation strategies, including multi-gene approaches for synthetic biology and pathway engineering. We provided for the first time a comprehensive comparative analysis of two main types of transgene expression in diatoms, the conventional RICE and in the newly developed EE, using large-scale, high-throughput phenotyping, which allowed us to uncover details of these genetic resources between and within cell lines. The genetic differences between EE and RICE were reflected by the varied yields of the relevant monoterpenoid geraniol in a selection of transgenic diatom cell lines expressing a CrGES-mVenus fusion enzyme. The geraniol yield was more than 4-fold higher in the best RICE transformant, reaching the titre of 304.4 ng/10<sup>7</sup> cells (0.89 mg/L), compared to the best EE exconjugants, reaching 72.47 ng/10<sup>7</sup> cells (0.15 mg/L). Thus, this work evaluated how previously unexplored genetic strategies can improve heterologous production of geraniol in <italic>P. tricornutum</italic>, in addition to more conventional strategies such as metabolic engineering or bioprocessing. While diatom engineering for terpenoid production has only recently been demonstrated, our results show that <italic>P. tricornutum</italic> is a promising photosynthetic microbial factory (D&#x00027;Adamo et al., <xref ref-type="bibr" rid="B19">2018</xref>; Fabris et al., <xref ref-type="bibr" rid="B25">2020</xref>).</p>
<p>We report profound differences in the phenotypes of RICE and EE <italic>P. tricornutum</italic> cell lines in terms of expression levels, phenotypic consistency and sub-clonal population composition. Non-integrative episomes are associated with much more consistent phenotypes in the scenarios we tested regarding overexpression and EE exconjugants do not seem to require extensive screening. Furthermore, bacterial conjugation tends to result in more clones in a shorter amount of time than biolistic bombardment. Altogether, these results indicate that EE will be an invaluable resource for genetic parts validation and modular assembly, and even automation of the design-build-test-learn cycle. These aspects of more complex synthetic biology strategies are crucial for heterologous production of high-value products such as monoterpenoids. Our results highlighted the particularly limited knowledge available on key aspects of EE in diatoms. This included stability, copy number and segregation patterns, and episome re-arrangement, which we did not observe but has been reported (Slattery et al., <xref ref-type="bibr" rid="B100">2018</xref>). Such characterisations still need to be addressed to fully exploit EE as a synthetic biology platform in diatoms.</p>
<p>In contrast, RICE cell lines are associated with high variability and overall higher expression levels, and by using large-scale screening it is possible to isolate particularly high expressing lines. These superior diatom cell lines bear highly concatenated arrangements of exogenous DNA, present as vast islands within or nearby predicted protein-coding genes. This raises a concern about this widespread method of generating transgenic diatom cell lines, as disrupting numerous protein coding regions can introduce unknown changes to <italic>P. tricornutum</italic> physiology that may not be easily detected. This is a particularly relevant issue in functional genetics studies involving overexpression, knock-down or knock-out constructs, which are traditionally delivered by biolistics, and randomly integrated in the genome of diatoms. On the contrary, it is not yet known if such large, highly concatenated integration events might be a factor in transgene stability and expression. In such scenario, RICE via biolistic bombardment, might be preferable over EE for obtaining high expressing cell lines. Finally, although it has been shown that high copy number and transgene tandem repeats can cause transcriptional silencing of transgene cassettes in other organisms (Kaufman et al., <xref ref-type="bibr" rid="B54">2008</xref>; Moritz et al., <xref ref-type="bibr" rid="B77">2016</xref>), our findings highlight the need to explore copy number and transgene arrangement optimisation in more detail, as this may well not be the case in <italic>P. tricornutum</italic>.</p>
<p>Whilst it is generally accepted that exogenous DNA delivered by biolistic bombardment randomly integrates in diatom chromosomes, the implications of this may have previously been overlooked, particularly at a time when CRISPR-Cas9 technology is being developed. While there is a general concern in CRISPR research to monitor and prevent off-target cutting by CRISPR-Cas9 itself, our results demonstrate that off-target effects from random integration of exogenous constructs such as vector backbone and DNA-encoded CRISPR-Cas9 components, could be just as much cause for concern. In this way, generating a precise knock-in or knock-out genotype by randomly integrating CRISPR-Cas9 components is suboptimal. As suggested by other works (Sharma et al., <xref ref-type="bibr" rid="B97">2018</xref>; Stukenberg et al., <xref ref-type="bibr" rid="B102">2018</xref>), our findings clearly demonstrate the need to move toward non-integrative alternatives, such as episomal expression (Slattery et al., <xref ref-type="bibr" rid="B100">2018</xref>) and ribonucleoprotein delivery (Serif et al., <xref ref-type="bibr" rid="B95">2018</xref>).</p>
<p>This research also identified putative safe-harbour or neutral loci that could be tested for targeted integration in <italic>P. tricornutum</italic>. Neutral sites in cyanobacteria have been used for targeted integration in metabolic engineering for multigene pathway assembly (Bentley et al., <xref ref-type="bibr" rid="B7">2014</xref>) and dual knock-in knock-out modifications (Li et al., <xref ref-type="bibr" rid="B68">2016</xref>). Synthetic &#x0201C;landing pads&#x0201D; are useful for gene stacking via &#x0201C;domino cloning,&#x0201D; but depend on the knowledge of robust, reliable safe harbour loci prior to being feasibly applied to diatoms (Karas et al., <xref ref-type="bibr" rid="B53">2015b</xref>). While some of the loci we identified harbour predicted protein coding regions, this is not unusual for safe harbours, as seen in human cell lines (e.g., CCR5 and ROSA26 loci) and mouse cell lines (e.g., Rosa26 locus), which all occur within protein coding regions. Furthermore, regions that may currently appear to be intergenic or non-functional may be re-categorised in the future, as more information about &#x0201C;junk DNA,&#x0201D; transcripts without function (TUFs) and unannotated regulatory regions are discovered (Gingeras, <xref ref-type="bibr" rid="B34">2007</xref>).</p>
<p>Finally, to the best of our knowledge, this work reports for the first time the suitability and utility of third generation long-read whole-genome sequencing to reveal the previously unknown nature of chromosomal integration sites, that would not have been feasible with conventional short-read sequencing. Future work investigating trans-genomes, such as low expression RICE cell lines or epigenetic modifications including DNA methylation patterns (Jain et al., <xref ref-type="bibr" rid="B47">2018</xref>; Jupe et al., <xref ref-type="bibr" rid="B51">2019</xref>), would build upon this knowledge to help uncover mechanisms driving transgene integration in diatoms. Such knowledge is important for developing better functional genomics tools including targeted genome editing. Our research primarily aimed at tracking specific, known transgenic constructs in EE and RICE transgenic diatoms cell lines. Long-read whole-genome sequencing technology can also be used to identify changes to the genome independent of an integration event, such as large translocations (Jupe et al., <xref ref-type="bibr" rid="B51">2019</xref>) and deletions (Nicholls et al., <xref ref-type="bibr" rid="B79">2019</xref>), purely due to the disruptive nature of the DNA delivery method. Our work lays the basis for future research efforts specifically focused on these relevant aspects, to investigate the impact of biolistic bombardment itself on genome integrity.</p>
<p>In conclusion, advancing synthetic pathway construction in <italic>P. tricornutum</italic> would ideally combine the reproducibility of EE with the high expression achievable through RICE, which could be achieved by targeted chromosomal integration. This work lays key groundwork for these developments which are crucial for extending knowledge on diatom biology and elevating model species such as <italic>P. tricornutum</italic> as a widely used synthetic biology chassis organism in a broad array of biotechnological applications.</p>
</sec>
<sec sec-type="data-availability-statement" id="s5">
<title>Data Availability Statement</title>
<p>The datasets generated for this study can be found in NCBI BioProject repository under the ID PRJNA593624.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>JG designed the study, conducted the experiments, analysed data and wrote the manuscript. TK performed DNA extraction and sequencing, analysed data, and contributed writing the manuscript. RA analysed data and contributed writing the manuscript. UK performed GC-MS analyses and contributed writing the manuscript. PR revised and edited the manuscript. MF designed the study, conducted and supervised the experiments, and contributed writing the manuscript.</p>
</sec>
<sec id="s7">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack><p>The authors would like to thank Kun Xiao and Taya Lapshina for technical assistance.</p>
</ack><sec sec-type="supplementary-material" id="s8">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fbioe.2020.00513/full&#x00023;supplementary-material">https://www.frontiersin.org/articles/10.3389/fbioe.2020.00513/full&#x00023;supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_2.docx" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ainley</surname> <given-names>W. M.</given-names></name> <name><surname>Sastry-Dent</surname> <given-names>L.</given-names></name> <name><surname>Welter</surname> <given-names>M. E.</given-names></name> <name><surname>Murray</surname> <given-names>M. G.</given-names></name> <name><surname>Zeitler</surname> <given-names>B.</given-names></name> <name><surname>Amora</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Trait stacking via targeted genome editing</article-title>. <source>Plant Biotechnol. J.</source> <volume>11</volume>, <fpage>1126</fpage>&#x02013;<lpage>1134</lpage>. <pub-id pub-id-type="doi">10.1111/pbi.12107</pub-id><pub-id pub-id-type="pmid">23953646</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alhaji</surname> <given-names>S. Y.</given-names></name> <name><surname>Ngai</surname> <given-names>S. C.</given-names></name> <name><surname>Abdullah</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>Silencing of transgene expression in mammalian cells by DNA methylation and histone modifications in gene therapy perspective</article-title>. <source>Biotechnol. Genet. Eng. Rev.</source> <volume>35</volume>, <fpage>1</fpage>&#x02013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.1080/02648725.2018.1551594</pub-id><pub-id pub-id-type="pmid">30514178</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allen</surname> <given-names>A. E.</given-names></name> <name><surname>Dupont</surname> <given-names>C. L.</given-names></name> <name><surname>Oborn&#x000ED;k</surname> <given-names>M.</given-names></name> <name><surname>Hor&#x000E1;k</surname> <given-names>A.</given-names></name> <name><surname>Nunes-Nesi</surname> <given-names>A.</given-names></name> <name><surname>McCrow</surname> <given-names>J. P.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Evolution and metabolic significance of the urea cycle in photosynthetic diatoms</article-title>. <source>Nature</source> <volume>473</volume>, <fpage>203</fpage>&#x02013;<lpage>207</lpage>. <pub-id pub-id-type="doi">10.1038/nature10074</pub-id><pub-id pub-id-type="pmid">21562560</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allen</surname> <given-names>M.</given-names></name> <name><surname>Poggiali</surname> <given-names>D.</given-names></name> <name><surname>Whitaker</surname> <given-names>K.</given-names></name> <name><surname>Marshall</surname> <given-names>T. R.</given-names></name></person-group> (<year>2018</year>). <article-title>Raincloud plots: a multi-platform tool for robust data visualization</article-title>. <source>Wellcome Open Res.</source> <volume>4</volume>:<fpage>63</fpage>. <pub-id pub-id-type="doi">10.7287/peerj.preprints.27137</pub-id><pub-id pub-id-type="pmid">31069261</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Armbrust</surname> <given-names>E. V.</given-names></name></person-group> (<year>2009</year>). <article-title>The life of diatoms in the world&#x00027;s oceans</article-title>. <source>Nature</source> <volume>459</volume>, <fpage>185</fpage>&#x02013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1038/nature08057</pub-id><pub-id pub-id-type="pmid">19444204</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashworth</surname> <given-names>J.</given-names></name> <name><surname>Turkarslan</surname> <given-names>S.</given-names></name> <name><surname>Harris</surname> <given-names>M.</given-names></name> <name><surname>Orellana</surname> <given-names>M. V.</given-names></name> <name><surname>Baliga</surname> <given-names>N. S.</given-names></name></person-group> (<year>2016</year>). <article-title>Pan-transcriptomic analysis identifies coordinated and orthologous functional modules in the diatoms <italic>Thalassiosira pseudonana</italic> and <italic>Phaeodactylum tricornutum</italic></article-title>. <source>Mar. Genomics</source> <volume>26</volume>, <fpage>21</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1016/j.margen.2015.10.011</pub-id><pub-id pub-id-type="pmid">26560047</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bentley</surname> <given-names>F. K.</given-names></name> <name><surname>Zurbriggen</surname> <given-names>A.</given-names></name> <name><surname>Melis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Heterologous expression of the mevalonic acid pathway in cyanobacteria enhances endogenous carbon partitioning to isoprene</article-title>. <source>Mol. Plant</source> <volume>7</volume>, <fpage>71</fpage>&#x02013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1093/mp/sst134</pub-id><pub-id pub-id-type="pmid">24157609</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berges</surname> <given-names>J. A.</given-names></name> <name><surname>Franklin</surname> <given-names>D. J.</given-names></name> <name><surname>Harrison</surname> <given-names>P. J.</given-names></name></person-group> (<year>2001</year>). <article-title>Evolution of an artificial seawater medium: improvements in enriched seawater, artificial water over the last two decades</article-title>. <source>J. Phycol.</source> <volume>37</volume>, <fpage>1138</fpage>&#x02013;<lpage>1145</lpage>. <pub-id pub-id-type="doi">10.1046/j.1529-8817.2001.01052.x</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bian</surname> <given-names>G.</given-names></name> <name><surname>Deng</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name></person-group> (<year>2017</year>). <article-title>Strategies for terpenoid overproduction and new terpenoid discovery</article-title>. <source>Curr. Opin. Biotechnol.</source> <volume>48</volume>, <fpage>234</fpage>&#x02013;<lpage>241</lpage>. <pub-id pub-id-type="doi">10.1016/j.copbio.2017.07.002</pub-id><pub-id pub-id-type="pmid">28779606</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bouton</surname> <given-names>A. H.</given-names></name> <name><surname>Smith</surname> <given-names>M. M.</given-names></name></person-group> (<year>1986</year>). <article-title>Fine-structure analysis of the DNA sequence requirements for autonomous replication of <italic>Saccharomyces cerevisiae</italic> plasmids</article-title>. <source>Mol. Cell. Biol.</source> <volume>6</volume>, <fpage>2354</fpage>&#x02013;<lpage>2363</lpage>. <pub-id pub-id-type="doi">10.1128/MCB.6.7.2354</pub-id><pub-id pub-id-type="pmid">3023929</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cantos</surname> <given-names>C.</given-names></name> <name><surname>Francisco</surname> <given-names>P.</given-names></name> <name><surname>Trijatmiko</surname> <given-names>K. R.</given-names></name> <name><surname>Slamet-Loedin</surname> <given-names>I.</given-names></name> <name><surname>Chadha-Mohanty</surname> <given-names>P. K.</given-names></name></person-group> (<year>2014</year>). <article-title>Identification of &#x0201C;safe harbor&#x0201D; loci in indica rice genome by harnessing the property of zinc-finger nucleases to induce DNA damage and repair</article-title>. <source>Front. Plant Sci.</source> <volume>5</volume>:<fpage>302</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2014.00302</pub-id><pub-id pub-id-type="pmid">25018764</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caputi</surname> <given-names>L.</given-names></name> <name><surname>Franke</surname> <given-names>J.</given-names></name> <name><surname>Farrow</surname> <given-names>S. C.</given-names></name> <name><surname>Chung</surname> <given-names>K.</given-names></name> <name><surname>Payne</surname> <given-names>R. M. E.</given-names></name> <name><surname>Nguyen</surname> <given-names>T. D.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Missing enzymes in the biosynthesis of the anticancer drug vinblastine in <italic>Madagascar periwinkle</italic></article-title>. <source>Science</source> <volume>360</volume>, <fpage>1235</fpage>&#x02013;<lpage>1239</lpage>. <pub-id pub-id-type="doi">10.1126/science.aat4100</pub-id><pub-id pub-id-type="pmid">29724909</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carver</surname> <given-names>T. J.</given-names></name> <name><surname>Rutherford</surname> <given-names>K. M.</given-names></name> <name><surname>Berriman</surname> <given-names>M.</given-names></name> <name><surname>Rajandream</surname> <given-names>M. A.</given-names></name> <name><surname>Barrell</surname> <given-names>B. G.</given-names></name> <name><surname>Parkhill</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>ACT: the artemis comparison tool</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>3422</fpage>&#x02013;<lpage>3423</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti553</pub-id><pub-id pub-id-type="pmid">15976072</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cerutti</surname> <given-names>H.</given-names></name> <name><surname>Ma</surname> <given-names>X.</given-names></name> <name><surname>Msanne</surname> <given-names>J.</given-names></name> <name><surname>Repas</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>RNA-mediated silencing in algae: biological roles and tools for analysis of gene function</article-title>. <source>Eukaryotic Cell</source> <volume>10</volume>, <fpage>1164</fpage>&#x02013;<lpage>1172</lpage>. <pub-id pub-id-type="doi">10.1128/EC.05106-11</pub-id><pub-id pub-id-type="pmid">21803865</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>W.</given-names></name> <name><surname>Viljoen</surname> <given-names>A. M.</given-names></name></person-group> (<year>2010</year>). <article-title>Geraniol - a review of a commercially important fragrance material</article-title>. <source>South Afr. J. Bot.</source> <volume>76</volume>, <fpage>643</fpage>&#x02013;<lpage>651</lpage>. <pub-id pub-id-type="doi">10.1016/j.sajb.2010.05.008</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>J. K.</given-names></name> <name><surname>Lewis</surname> <given-names>A. M.</given-names></name> <name><surname>Kim</surname> <given-names>D. S.</given-names></name> <name><surname>Dyess</surname> <given-names>T.</given-names></name> <name><surname>Alper</surname> <given-names>H. S.</given-names></name></person-group> (<year>2016</year>). <article-title>Identifying and retargeting transcriptional hot spots in the human genome</article-title>. <source>Biotechnol. J.</source> <volume>11</volume>, <fpage>1100</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1002/biot.201600015</pub-id><pub-id pub-id-type="pmid">27311394</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clarke</surname> <given-names>L.</given-names></name> <name><surname>Carbon</surname> <given-names>J.</given-names></name></person-group> (<year>1976</year>). <article-title>A colony bank containing synthetic Col El hybrid plasmids representative of the entire <italic>E. coli</italic> genome</article-title>. <source>Cell</source> <volume>9</volume>, <fpage>91</fpage>&#x02013;<lpage>99</lpage>. <pub-id pub-id-type="doi">10.1016/0092-8674(76)90055-6</pub-id><pub-id pub-id-type="pmid">788919</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Daboussi</surname> <given-names>F.</given-names></name> <name><surname>Leduc</surname> <given-names>S.</given-names></name> <name><surname>Mar&#x000E9;chal</surname> <given-names>A.</given-names></name> <name><surname>Dubois</surname> <given-names>G.</given-names></name> <name><surname>Guyot</surname> <given-names>V.</given-names></name> <name><surname>Perez-Michaut</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Genome engineering empowers the diatom <italic>Phaeodactylum tricornutum</italic> for biotechnology</article-title>. <source>Nat. Commun.</source> <volume>5</volume>:<fpage>3831</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms4831</pub-id><pub-id pub-id-type="pmid">24871200</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>D&#x00027;Adamo</surname> <given-names>S.</given-names></name> <name><surname>Schiano di Visconte</surname> <given-names>G.</given-names></name> <name><surname>Lowe</surname> <given-names>G.</given-names></name> <name><surname>Szaub-Newton</surname> <given-names>J.</given-names></name> <name><surname>Beacham</surname> <given-names>T.</given-names></name> <name><surname>Landels</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Engineering the unicellular alga phaeodactylum tricornutum for high-value plant triterpenoid production</article-title>. <source>Plant Biotechnol. J.</source> <volume>17</volume>, <fpage>75</fpage>&#x02013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1111/pbi.12948</pub-id><pub-id pub-id-type="pmid">29754445</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davis</surname> <given-names>A. M.</given-names></name> <name><surname>Iovinella</surname> <given-names>M.</given-names></name> <name><surname>James</surname> <given-names>S.</given-names></name> <name><surname>Robshaw</surname> <given-names>T.</given-names></name> <name><surname>Dodson</surname> <given-names>R.</given-names></name> <name><surname>Herrero-davila</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2016</year>). Using MinION nanopore sequencing to generate. <source>bioRxiv</source>. <pub-id pub-id-type="doi">10.1101/076208</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delvigne</surname> <given-names>F.</given-names></name> <name><surname>Zune</surname> <given-names>Q.</given-names></name> <name><surname>Lara</surname> <given-names>A. R.</given-names></name> <name><surname>Al-Soud</surname> <given-names>W.</given-names></name> <name><surname>S&#x000F8;rensen</surname> <given-names>S. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Metabolic variability in bioprocessing: implications of microbial phenotypic heterogeneity</article-title>. <source>Trends Biotechnol</source>. <volume>32</volume>, <fpage>608</fpage>&#x02013;<lpage>616</lpage>. <pub-id pub-id-type="doi">10.1016/j.tibtech.2014.10.002</pub-id><pub-id pub-id-type="pmid">25457387</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Diner</surname> <given-names>R. E.</given-names></name> <name><surname>Bielinski</surname> <given-names>V. A.</given-names></name> <name><surname>Dupont</surname> <given-names>C. L.</given-names></name> <name><surname>Allen</surname> <given-names>A. E.</given-names></name> <name><surname>Weyman</surname> <given-names>P. D.</given-names></name></person-group> (<year>2016</year>). <article-title>Refinement of the diatom episome maintenance sequence and improvement of conjugation-based DNA delivery methods</article-title>. <source>Front. Bioeng. Biotechnol.</source> <volume>4</volume>:<fpage>65</fpage>. <pub-id pub-id-type="doi">10.3389/fbioe.2016.00065</pub-id><pub-id pub-id-type="pmid">27551676</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doron</surname> <given-names>L.</given-names></name> <name><surname>Segal</surname> <given-names>N.</given-names></name> <name><surname>Shapira</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Transgene expression in microalgae&#x02014;from tools to applications</article-title>. <source>Front. Plant Sci.</source> <volume>7</volume>:<fpage>505</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2016.00505</pub-id><pub-id pub-id-type="pmid">27148328</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Elgin</surname> <given-names>S. C. R.</given-names></name></person-group> (<year>1996</year>). <article-title>Heterochromatin and gene regulation in <italic>Drosophila</italic></article-title>. <source>Curr. Opin. Genet. Dev.</source> <volume>6</volume>, <fpage>193</fpage>&#x02013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.1016/S0959-437X(96)80050-5</pub-id><pub-id pub-id-type="pmid">8722176</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fabris</surname> <given-names>M.</given-names></name> <name><surname>George</surname> <given-names>J.</given-names></name> <name><surname>Kuzhiumparambil</surname> <given-names>U.</given-names></name> <name><surname>Lawson</surname> <given-names>C. A.</given-names></name> <name><surname>Jaramillo Madrid</surname> <given-names>A. C.</given-names></name> <name><surname>Abbriano</surname> <given-names>R. M.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Extrachromosomal genetic engineering of the marine diatom Phaeodactylum tricornutum enables the heterologous production of monoterpenoids</article-title>. <source>ACS Synth. Biol.</source> <volume>9</volume>, <fpage>598</fpage>&#x02013;<lpage>612</lpage>. <pub-id pub-id-type="doi">10.1021/acssynbio.9b00455</pub-id><pub-id pub-id-type="pmid">32032487</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fabris</surname> <given-names>M.</given-names></name> <name><surname>Matthijs</surname> <given-names>M.</given-names></name> <name><surname>Carbonelle</surname> <given-names>S.</given-names></name> <name><surname>Moses</surname> <given-names>T.</given-names></name> <name><surname>Pollier</surname> <given-names>J.</given-names></name> <name><surname>Dasseville</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Tracking the sterol biosynthesis pathway of the diatom <italic>Phaeodactylum tricornutum</italic></article-title>. <source>N. Phytol.</source> <volume>204</volume>, <fpage>521</fpage>&#x02013;<lpage>535</lpage>. <pub-id pub-id-type="doi">10.1111/nph.12917</pub-id><pub-id pub-id-type="pmid">24996048</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fabris</surname> <given-names>M.</given-names></name> <name><surname>Matthijs</surname> <given-names>M.</given-names></name> <name><surname>Rombauts</surname> <given-names>S.</given-names></name> <name><surname>Vyverman</surname> <given-names>W.</given-names></name> <name><surname>Goossens</surname> <given-names>A.</given-names></name> <name><surname>Baart</surname> <given-names>G. J. E.</given-names></name></person-group> (<year>2012</year>). <article-title>The metabolic blueprint of <italic>Phaeodactylum tricornutum</italic> reveals a eukaryotic entner-doudoroff glycolytic pathway</article-title>. <source>Plant J.</source> <volume>70</volume>, <fpage>1004</fpage>&#x02013;<lpage>1014</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2012.04941.x</pub-id><pub-id pub-id-type="pmid">22332784</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Falciatore</surname> <given-names>A.</given-names></name> <name><surname>Casotti</surname> <given-names>R.</given-names></name> <name><surname>Leblanc</surname> <given-names>C.</given-names></name> <name><surname>Abrescia</surname> <given-names>C.</given-names></name> <name><surname>Bowler</surname> <given-names>C.</given-names></name></person-group> (<year>1999</year>). <article-title>Transformation of nonselectable reporter genes in marine diatoms</article-title>. <source>Mar. Biotechnol.</source> <volume>1</volume>, <fpage>239</fpage>&#x02013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.1007/PL00011773</pub-id><pub-id pub-id-type="pmid">10383998</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finn</surname> <given-names>R. D.</given-names></name> <name><surname>Clements</surname> <given-names>J.</given-names></name> <name><surname>Eddy</surname> <given-names>S. R.</given-names></name></person-group> (<year>2011</year>). <article-title>HMMER web server: interactive sequence similarity searching</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>29</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr367</pub-id><pub-id pub-id-type="pmid">21593126</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fournier</surname> <given-names>T.</given-names></name> <name><surname>Gounot</surname> <given-names>J. S.</given-names></name> <name><surname>Freel</surname> <given-names>K.</given-names></name> <name><surname>Cruaud</surname> <given-names>C.</given-names></name> <name><surname>Lemainque</surname> <given-names>A.</given-names></name> <name><surname>Aury</surname> <given-names>J. M.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>High-quality <italic>de novo</italic> genome assembly of the dekkera bruxellensis yeast using nanopore MinION sequencing</article-title>. <source>G3 (Bethesda)</source> <volume>7</volume>, <fpage>3243</fpage>&#x02013;<lpage>3250</lpage>. <pub-id pub-id-type="doi">10.1534/g3.117.300128</pub-id><pub-id pub-id-type="pmid">28983066</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gangl</surname> <given-names>D.</given-names></name> <name><surname>Zedler</surname> <given-names>J. A. Z.</given-names></name> <name><surname>Rajakumar</surname> <given-names>P. D.</given-names></name> <name><surname>Martinez</surname> <given-names>E. M. R.</given-names></name> <name><surname>Riseley</surname> <given-names>A.</given-names></name> <name><surname>Wodarczyk</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Biotechnological exploitation of microalgae</article-title>. <source>J. Exp. Bot.</source> <volume>66</volume>, <fpage>6975</fpage>&#x02013;<lpage>6990</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/erv426</pub-id><pub-id pub-id-type="pmid">26400987</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gehlen</surname> <given-names>L. R.</given-names></name> <name><surname>Nagai</surname> <given-names>S.</given-names></name> <name><surname>Shimada</surname> <given-names>K.</given-names></name> <name><surname>Meister</surname> <given-names>P.</given-names></name> <name><surname>Taddei</surname> <given-names>A.</given-names></name> <name><surname>Gasser</surname> <given-names>S. M.</given-names></name></person-group> (<year>2011</year>). <article-title>Nuclear geometry and rapid mitosis ensure asymmetric episome segregation in yeast</article-title>. <source>Curr. Biol.</source> <volume>21</volume>, <fpage>25</fpage>&#x02013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2010.12.016</pub-id><pub-id pub-id-type="pmid">21194950</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ghosh</surname> <given-names>S. K.</given-names></name> <name><surname>Hajra</surname> <given-names>S.</given-names></name> <name><surname>Jayaram</surname> <given-names>M.</given-names></name></person-group> (<year>2007</year>). <article-title>Faithful segregation of the multicopy yeast plasmid through cohesin-mediated recognition of sisters</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>104</volume>, <fpage>13034</fpage>&#x02013;<lpage>13039</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0702996104</pub-id><pub-id pub-id-type="pmid">17670945</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gingeras</surname> <given-names>T.</given-names></name></person-group> (<year>2007</year>). <article-title>Origin of phenotypes: genes and transcripts</article-title>. <source>Genome Res.</source> <volume>17</volume>, <fpage>682</fpage>&#x02013;<lpage>690</lpage>. <pub-id pub-id-type="doi">10.1101/gr.6525007</pub-id><pub-id pub-id-type="pmid">17567989</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Goyal</surname> <given-names>A.</given-names></name> <name><surname>Bhowmik</surname> <given-names>P. K.</given-names></name> <name><surname>Basu</surname> <given-names>S. K.</given-names></name></person-group> (<year>2009</year>). <article-title>Minichromosomes : the second generation genetic engineering tool</article-title>. <source>Plant Omics J.</source> <volume>2</volume>, <fpage>1</fpage>&#x02013;<lpage>8</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.pomics.com/Saikat_January2009_2_1_1_8.pdf">https://www.pomics.com/Saikat_January2009_2_1_1_8.pdf</ext-link></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Greiner</surname> <given-names>A.</given-names></name> <name><surname>Kelterborn</surname> <given-names>S.</given-names></name> <name><surname>Evers</surname> <given-names>H.</given-names></name> <name><surname>Kreimer</surname> <given-names>G.</given-names></name> <name><surname>Sizova</surname> <given-names>I.</given-names></name> <name><surname>Hegemann</surname> <given-names>P.</given-names></name></person-group> (<year>2017</year>). <article-title>Targeting of photoreceptor genes in <italic>Chlamydomonas reinhardtii</italic> via zinc- finger nucleases and CRISPR/Cas9</article-title>. <source>Plant Cell</source> <volume>29</volume>, <fpage>2498</fpage>&#x02013;<lpage>2518</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.17.00659</pub-id><pub-id pub-id-type="pmid">28978758</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gu</surname> <given-names>Y.</given-names></name> <name><surname>Gao</surname> <given-names>J.</given-names></name> <name><surname>Cao</surname> <given-names>M.</given-names></name> <name><surname>Dong</surname> <given-names>C.</given-names></name> <name><surname>Lian</surname> <given-names>J.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Construction of a series of episomal plasmids and their application in the development of an efficient CRISPR/Cas9 system in <italic>Pichia pastoris</italic></article-title>. <source>World J. Microbiol. Biotechnol.</source> <volume>35</volume>:<fpage>79</fpage>. <pub-id pub-id-type="doi">10.1007/s11274-019-2654-5</pub-id><pub-id pub-id-type="pmid">31134410</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Hallmann</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>Algal transgenics and biotechnology</article-title>. <source>Transgenic Plant J</source>, <volume>1</volume>, <fpage>81</fpage>&#x02013;<lpage>98</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.globalsciencebooks.info/Online/GSBOnline/images/0706/TPJ_1(1)/TPJ_1(1)81-98o.pdf">http://www.globalsciencebooks.info/Online/GSBOnline/images/0706/TPJ_1(1)/TPJ_1(1)81-98o.pdf</ext-link></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamilton</surname> <given-names>M. L.</given-names></name> <name><surname>Haslam</surname> <given-names>R. P.</given-names></name> <name><surname>Napier</surname> <given-names>J. A.</given-names></name> <name><surname>Sayanova</surname> <given-names>O.</given-names></name></person-group> (<year>2014</year>). <article-title>Metabolic engineering of <italic>Phaeodactylum tricornutum</italic> for the enhanced accumulation of omega-3 long chain polyunsaturated fatty acids</article-title>. <source>Metab. Eng.</source> <volume>22</volume>, <fpage>3</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1016/j.ymben.2013.12.003</pub-id><pub-id pub-id-type="pmid">24333273</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamilton</surname> <given-names>M. L.</given-names></name> <name><surname>Warwick</surname> <given-names>J.</given-names></name> <name><surname>Terry</surname> <given-names>A.</given-names></name> <name><surname>Allen</surname> <given-names>M. J.</given-names></name> <name><surname>Napier</surname> <given-names>J. A.</given-names></name> <name><surname>Sayanova</surname> <given-names>O.</given-names></name></person-group> (<year>2015</year>). <article-title>Towards the industrial production of omega-3 long chain polyunsaturated fatty acids from a genetically modified diatom <italic>Phaeodactylum tricornutum</italic></article-title>. <source>PLoS ONE</source> <volume>10</volume>:<fpage>e0144054</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0144054</pub-id><pub-id pub-id-type="pmid">26658738</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hempel</surname> <given-names>F.</given-names></name> <name><surname>Bozarth</surname> <given-names>A. S.</given-names></name> <name><surname>Lindenkamp</surname> <given-names>N.</given-names></name> <name><surname>Klingl</surname> <given-names>A.</given-names></name> <name><surname>Zauner</surname> <given-names>S.</given-names></name> <name><surname>Linne</surname> <given-names>U.</given-names></name> <etal/></person-group>. (<year>2011a</year>). <article-title>Microalgae as bioreactors for bioplastic production</article-title>. <source>Microb. Cell Fact.</source> <volume>10</volume>:<fpage>81</fpage>. <pub-id pub-id-type="doi">10.1186/1475-2859-10-81</pub-id><pub-id pub-id-type="pmid">22004563</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hempel</surname> <given-names>F.</given-names></name> <name><surname>Lau</surname> <given-names>J.</given-names></name> <name><surname>Klingl</surname> <given-names>A.</given-names></name> <name><surname>Maier</surname> <given-names>U. G.</given-names></name></person-group> (<year>2011b</year>). <article-title>Algae as protein factories: expression of a human antibody and the respective antigen in the diatom <italic>Phaeodactylum tricornutum</italic></article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e28424</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0028424</pub-id><pub-id pub-id-type="pmid">22164289</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hempel</surname> <given-names>F.</given-names></name> <name><surname>Maier</surname> <given-names>U. G.</given-names></name></person-group> (<year>2012</year>). <article-title>An engineered diatom acting like a plasma cell secreting human IgG antibodies with high efficiency</article-title>. <source>Microb. Cell Fact.</source> <volume>11</volume>:<fpage>126</fpage>. <pub-id pub-id-type="doi">10.1186/1475-2859-11-126</pub-id><pub-id pub-id-type="pmid">22970838</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hopes</surname> <given-names>A.</given-names></name> <name><surname>Nekrasov</surname> <given-names>V.</given-names></name> <name><surname>Kamoun</surname> <given-names>S.</given-names></name> <name><surname>Mock</surname> <given-names>T.</given-names></name></person-group> (<year>2016</year>). <article-title>Editing of the urease gene by CRISPR-Cas in the diatom <italic>Thalassiosira pseudonana</italic></article-title>. <source>Plant Methods</source> <volume>12</volume>:<fpage>49</fpage>. <pub-id pub-id-type="doi">10.1186/s13007-016-0148-0</pub-id><pub-id pub-id-type="pmid">27904648</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Birch</surname> <given-names>R. G.</given-names></name></person-group> (<year>2000</year>). <article-title>Rapid amplification and cloning of Tn5 flanking fragments by inverse PCR</article-title>. <source>Lett. Appl. Microbiol.</source> <volume>31</volume>, <fpage>149</fpage>&#x02013;<lpage>153</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-2672.2000.00781.x</pub-id><pub-id pub-id-type="pmid">10972718</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>W.</given-names></name> <name><surname>Daboussi</surname> <given-names>F.</given-names></name></person-group> (<year>2017</year>). <article-title>Genetic and metabolic engineering in diatoms</article-title>. <source>Philos. Trans. R. Soc. Lond. B Biol. Sci.</source> <volume>372</volume>:<fpage>20160411</fpage>. <pub-id pub-id-type="doi">10.1098/rstb.2016.0411</pub-id><pub-id pub-id-type="pmid">28717021</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jain</surname> <given-names>M.</given-names></name> <name><surname>Koren</surname> <given-names>S.</given-names></name> <name><surname>Miga</surname> <given-names>K. H.</given-names></name> <name><surname>Quick</surname> <given-names>J.</given-names></name> <name><surname>Rand</surname> <given-names>A. C.</given-names></name> <name><surname>Sasani</surname> <given-names>T. A.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Nanopore sequencing and assembly of a human genome with ultra-long reads</article-title>. <source>Nat. Biotechnol.</source> <volume>36</volume>, <fpage>338</fpage>&#x02013;<lpage>345</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.4060</pub-id><pub-id pub-id-type="pmid">29431738</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jeon</surname> <given-names>S.</given-names></name> <name><surname>Lim</surname> <given-names>J-M.</given-names></name> <name><surname>Lee</surname> <given-names>H-G.</given-names></name> <name><surname>Shin</surname> <given-names>S-E.</given-names></name> <name><surname>Kang</surname> <given-names>N. K.</given-names></name> <name><surname>Park</surname> <given-names>Y-I.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Current status and perspectives of genome editing technology for microalgae</article-title>. <source>Biotechnol. Biofuels</source> <volume>10</volume>:<fpage>267</fpage>. <pub-id pub-id-type="doi">10.1186/s13068-017-0957-z</pub-id><pub-id pub-id-type="pmid">29163669</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>G. Z.</given-names></name> <name><surname>Yao</surname> <given-names>M. D.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Zhou</surname> <given-names>L.</given-names></name> <name><surname>Song</surname> <given-names>T. Q.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Manipulation of GES and ERG20 for geraniol overproduction in <italic>Saccharomyces cerevisiae</italic></article-title>. <source>Metab. Eng.</source> <volume>41</volume>, <fpage>57</fpage>&#x02013;<lpage>66</lpage>. <pub-id pub-id-type="doi">10.1016/j.ymben.2017.03.005</pub-id><pub-id pub-id-type="pmid">28359705</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johansson</surname> <given-names>O. N.</given-names></name> <name><surname>T&#x000F6;pel</surname> <given-names>M.</given-names></name> <name><surname>Pinder</surname> <given-names>M. I. M.</given-names></name> <name><surname>Kourtchenko</surname> <given-names>O.</given-names></name> <name><surname>Blomberg</surname> <given-names>A.</given-names></name> <name><surname>Godhe</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Skeletonema marinoi as a new genetic model for marine chain-forming diatoms</article-title>. <source>Sci. Rep.</source> <volume>9</volume>:<fpage>5392</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-019-41085-5</pub-id><pub-id pub-id-type="pmid">30940823</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jupe</surname> <given-names>F.</given-names></name> <name><surname>Rivkin</surname> <given-names>A. C.</given-names></name> <name><surname>Michael</surname> <given-names>T. P.</given-names></name> <name><surname>Zander</surname> <given-names>M.</given-names></name> <name><surname>Motley</surname> <given-names>S. T.</given-names></name> <name><surname>Sandoval</surname> <given-names>J. P.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>The complex architecture and epigenomic impact of plant T-DNA insertions</article-title>. <source>PLoS Genetics</source> <volume>15</volume>:<fpage>e1007819</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1007819</pub-id><pub-id pub-id-type="pmid">30657772</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karas</surname> <given-names>B. J.</given-names></name> <name><surname>Diner</surname> <given-names>R. E.</given-names></name> <name><surname>Lefebvre</surname> <given-names>S. C.</given-names></name> <name><surname>McQuaid</surname> <given-names>J.</given-names></name> <name><surname>Phillips</surname> <given-names>A. P. R.</given-names></name> <name><surname>Noddings</surname> <given-names>C. M.</given-names></name> <etal/></person-group>. (<year>2015a</year>). <article-title>Designer diatom episomes delivered by bacterial conjugation</article-title>. <source>Nat. Commun.</source> <volume>6</volume>:<fpage>6925</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms7925</pub-id><pub-id pub-id-type="pmid">25897682</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karas</surname> <given-names>B. J.</given-names></name> <name><surname>Suzuki</surname> <given-names>Y.</given-names></name> <name><surname>Weyman</surname> <given-names>P. D.</given-names></name></person-group> (<year>2015b</year>). <article-title>Strategies for cloning and manipulating natural and synthetic chromosomes</article-title>. <source>Chromosome Res.</source> <volume>23</volume>, <fpage>57</fpage>&#x02013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.1007/s10577-014-9455-3</pub-id><pub-id pub-id-type="pmid">25596826</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaufman</surname> <given-names>W. L.</given-names></name> <name><surname>Kocman</surname> <given-names>I.</given-names></name> <name><surname>Agrawal</surname> <given-names>V.</given-names></name> <name><surname>Rahn</surname> <given-names>H. P.</given-names></name> <name><surname>Besser</surname> <given-names>D.</given-names></name> <name><surname>Gossen</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>Homogeneity and persistence of transgene expression by omitting antibiotic selection in cell line isolation</article-title>. <source>Nucleic Acids Research</source>, <volume>36</volume>:<fpage>e111</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gkn508</pub-id><pub-id pub-id-type="pmid">18682524</pub-id></citation></ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>E. J.</given-names></name> <name><surname>Ma</surname> <given-names>X.</given-names></name> <name><surname>Cerutti</surname> <given-names>H.</given-names></name></person-group> (<year>2015</year>). <article-title>Gene silencing in microalgae: mechanisms and biological roles</article-title>. <source>Bioresour. Technol.</source> <volume>184</volume>, <fpage>23</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1016/j.biortech.2014.10.119</pub-id><pub-id pub-id-type="pmid">25466994</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kohli</surname> <given-names>A.</given-names></name> <name><surname>Christou</surname> <given-names>P.</given-names></name></person-group> (<year>2008</year>). <article-title>Stable transgenes bear fruit</article-title>. <source>Nat. Biotechnol.</source> <volume>26</volume>, <fpage>653</fpage>&#x02013;<lpage>654</lpage>. <pub-id pub-id-type="doi">10.1038/nbt0608-653</pub-id><pub-id pub-id-type="pmid">18536688</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kohli</surname> <given-names>A.</given-names></name> <name><surname>Gonz&#x000E1;lez-Melendi</surname> <given-names>P.</given-names></name> <name><surname>Abranches</surname> <given-names>R.</given-names></name> <name><surname>Capell</surname> <given-names>T.</given-names></name> <name><surname>Stoger</surname> <given-names>E.</given-names></name> <name><surname>Christou</surname> <given-names>P.</given-names></name></person-group> (<year>2006</year>). <article-title>The quest to understand the basis and mechanisms that control expression of introduced transgenes in crop plants</article-title>. <source>Plant Signal. Behav.</source> <volume>1</volume>, <fpage>185</fpage>&#x02013;<lpage>195</lpage>. <pub-id pub-id-type="doi">10.4161/psb.1.4.3195</pub-id><pub-id pub-id-type="pmid">19521484</pub-id></citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kohli</surname> <given-names>A.</given-names></name> <name><surname>Leech</surname> <given-names>M.</given-names></name> <name><surname>Vain</surname> <given-names>P.</given-names></name> <name><surname>Laurie</surname> <given-names>D. A.</given-names></name> <name><surname>Christou</surname> <given-names>P.</given-names></name></person-group> (<year>1998</year>). <article-title>Transgene organization in rice engineered through direct DNA transfer supports a two-phase integration mechanism mediated by the establishment of integration hot spots</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>95</volume>, <fpage>7203</fpage>&#x02013;<lpage>7208</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.95.12.7203</pub-id><pub-id pub-id-type="pmid">9618563</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kohli</surname> <given-names>A.</given-names></name> <name><surname>Miro</surname> <given-names>B.</given-names></name> <name><surname>Twyman</surname> <given-names>R. M.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;Transgene integration, expression and stability in plants : strategies for improvements,&#x0201D;</article-title> in <source>Transgenic Crop Plants</source>, eds C. Kole, C. H. Michler, A. G. Abbott, T. C. Hall (<publisher-loc>Berlin, Heidelberg: Springer</publisher-loc>), <fpage>201</fpage>&#x02013;<lpage>237</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-04809-8_7</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kremers</surname> <given-names>G. J.</given-names></name> <name><surname>Goedhart</surname> <given-names>J.</given-names></name> <name><surname>Van Munster</surname> <given-names>E. B.</given-names></name> <name><surname>Gadella</surname> <given-names>T. W. J.</given-names></name></person-group> (<year>2006</year>). <article-title>Cyan and yellow super fluorescent proteins with improved brightness, protein folding, and FRET f&#x000F6;rster radius</article-title>. <source>Biochemistry</source> <volume>45</volume>, <fpage>6570</fpage>&#x02013;<lpage>6580</lpage>. <pub-id pub-id-type="doi">10.1021/bi0516273</pub-id><pub-id pub-id-type="pmid">16716067</pub-id></citation></ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kroth</surname> <given-names>P. G.</given-names></name></person-group> (<year>2007</year>). <article-title>Genetic transformation: a tool to study protein targeting in diatoms</article-title>. <source>Methods Mol. Biol.</source> <volume>390</volume>, <fpage>257</fpage>&#x02013;<lpage>267</lpage>. <pub-id pub-id-type="doi">10.1385/1-59745-466-4:257</pub-id><pub-id pub-id-type="pmid">17951693</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kroth</surname> <given-names>P. G.</given-names></name> <name><surname>Chiovitti</surname> <given-names>A.</given-names></name> <name><surname>Gruber</surname> <given-names>A.</given-names></name> <name><surname>Martin-Jezeque</surname> <given-names>V.</given-names></name> <name><surname>Mock</surname> <given-names>T.</given-names></name> <name><surname>Parker</surname> <given-names>M. S.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>A model for carbohydrate metabolism in the diatom <italic>Phaeodactylum tricornutum</italic> deduced from comparative whole genome analysis</article-title>. <source>PLoS ONE</source> <volume>3</volume>:<fpage>e1426</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0001426</pub-id><pub-id pub-id-type="pmid">18183306</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lavaud</surname> <given-names>J.</given-names></name> <name><surname>Materna</surname> <given-names>A. C.</given-names></name> <name><surname>Sturm</surname> <given-names>S.</given-names></name> <name><surname>Vugrinec</surname> <given-names>S.</given-names></name> <name><surname>Kroth</surname> <given-names>P. G.</given-names></name></person-group> (<year>2012</year>). <article-title>Silencing of the violaxanthin de-epoxidase gene in the diatom <italic>Phaeodactylum tricornutum</italic> reduces diatoxanthin synthesis and non-photochemical quenching</article-title>. <source>PLoS ONE</source> <volume>7</volume>:<fpage>e36806</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0036806</pub-id><pub-id pub-id-type="pmid">22629333</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>J. S.</given-names></name> <name><surname>Kallehauge</surname> <given-names>T. B.</given-names></name> <name><surname>Pedersen</surname> <given-names>L. E.</given-names></name> <name><surname>Kildegaard</surname> <given-names>H. F.</given-names></name></person-group> (<year>2015</year>). <article-title>Site-specific integration in CHO cells mediated by CRISPR/Cas9 and homology-directed DNA repair pathway</article-title>. <source>Sci. Rep.</source> <volume>5</volume>:<fpage>8572</fpage>. <pub-id pub-id-type="doi">10.1038/srep08572</pub-id><pub-id pub-id-type="pmid">25712033</pub-id></citation></ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Le&#x000F3;n-Ba&#x000F1;ares</surname> <given-names>R.</given-names></name> <name><surname>Gonz&#x000E1;lez-Ballester</surname> <given-names>D.</given-names></name> <name><surname>Galv&#x000E1;n</surname> <given-names>A.</given-names></name> <name><surname>Fern&#x000E1;ndez</surname> <given-names>E.</given-names></name></person-group> (<year>2004</year>). <article-title>Transgenic microalgae as green cell-factories</article-title>. <source>Trends Biotechnol</source>. <volume>22</volume>, <fpage>45</fpage>&#x02013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1016/j.tibtech.2003.11.003</pub-id><pub-id pub-id-type="pmid">14690622</pub-id></citation></ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Fast and accurate long-read alignment with burrows-wheeler transform</article-title>. <source>Bioinformatics</source> <volume>26</volume>, <fpage>589</fpage>&#x02013;<lpage>595</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp698</pub-id><pub-id pub-id-type="pmid">20080505</pub-id></citation></ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>The sequence alignment/map format and SAMtools</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>2078</fpage>&#x02013;<lpage>2079</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id></citation></ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Shen</surname> <given-names>C. R.</given-names></name> <name><surname>Huang</surname> <given-names>C-H.</given-names></name> <name><surname>Sung</surname> <given-names>L-Y.</given-names></name> <name><surname>Wu</surname> <given-names>M-Y.</given-names></name> <name><surname>Hu</surname> <given-names>Y-C.</given-names></name></person-group> (<year>2016</year>). <article-title>CRISPR-Cas9 for the genome engineering of cyanobacteria and succinate production</article-title>. <source>Metab. Eng.</source> <volume>38</volume>, <fpage>293</fpage>&#x02013;<lpage>302</lpage>. <pub-id pub-id-type="doi">10.1016/j.ymben.2016.09.006</pub-id><pub-id pub-id-type="pmid">27693320</pub-id></citation></ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>H. Y.</given-names></name> <name><surname>Yen</surname> <given-names>S. C.</given-names></name> <name><surname>Kuo</surname> <given-names>P. C.</given-names></name> <name><surname>Chung</surname> <given-names>C. Y.</given-names></name> <name><surname>Yeh</surname> <given-names>K. L.</given-names></name> <name><surname>Huang</surname> <given-names>C. H.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Alkaline phosphatase promoter as an efficient driving element for exogenic recombinant in the marine diatom <italic>Phaeodactylum tricornutum</italic></article-title>. <source>Algal Res.</source> <volume>23</volume>, <fpage>58</fpage>&#x02013;<lpage>65</lpage>. <pub-id pub-id-type="doi">10.1016/j.algal.2017.01.007</pub-id></citation></ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Nannas</surname> <given-names>N. J.</given-names></name> <name><surname>Fu</surname> <given-names>F.</given-names></name> <name><surname>Shi</surname> <given-names>J.</given-names></name> <name><surname>Aspinwall</surname> <given-names>B.</given-names></name> <name><surname>Parrott</surname> <given-names>W. A.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Genome-scale sequence disruption following biolistic transformation in rice and maize</article-title>. <source>Plant Cell</source> <volume>31</volume>, <fpage>368</fpage>&#x02013;<lpage>383</lpage>. <pub-id pub-id-type="doi">10.1101/379644</pub-id><pub-id pub-id-type="pmid">30651345</pub-id></citation></ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Hempel</surname> <given-names>F.</given-names></name> <name><surname>Stork</surname> <given-names>S.</given-names></name> <name><surname>Bolte</surname> <given-names>K.</given-names></name> <name><surname>Moog</surname> <given-names>D.</given-names></name> <name><surname>Heimerl</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Addressing various compartments of the diatom model organism <italic>Phaeodactylum tricornutum</italic> via sub-cellular marker proteins</article-title>. <source>Algal Res.</source> <volume>20</volume>, <fpage>249</fpage>&#x02013;<lpage>257</lpage>. <pub-id pub-id-type="doi">10.1016/j.algal.2016.10.018</pub-id></citation></ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y. G.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name></person-group> (<year>2007</year>). <article-title>High-efficiency thermal asymmetric interlaced PCR for amplification of unknown flanking sequences</article-title>. <source>BioTechniques</source> <volume>43</volume>, <fpage>649</fpage>&#x02013;<lpage>656</lpage>. <pub-id pub-id-type="doi">10.2144/000112601</pub-id><pub-id pub-id-type="pmid">18072594</pub-id></citation></ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y-T.</given-names></name> <name><surname>Sau1</surname> <given-names>S.</given-names></name> <name><surname>Ma1</surname> <given-names>C-H.</given-names></name> <name><surname>Kachroo1</surname> <given-names>A. H.</given-names></name> <name><surname>Rowley1</surname> <given-names>P. A.</given-names></name> <name><surname>Chang</surname> <given-names>K-M.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>The partitioning and copy number control systems of the selfish yeast plasmid: an optimized molecular design for stable persistence in host cells</article-title>. <source>Microbiol. Spectr</source>. <volume>23</volume>, <fpage>1</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1128/microbiolspec.PLAS-0003-2013</pub-id><pub-id pub-id-type="pmid">25541598</pub-id></citation></ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Longworth</surname> <given-names>J.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Huete-Ortega</surname> <given-names>M.</given-names></name> <name><surname>Wright</surname> <given-names>P. C.</given-names></name> <name><surname>Vaidyanathan</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Proteome response of <italic>Phaeodactylum tricornutum</italic>, during lipid accumulation induced by nitrogen depletion</article-title>. <source>Algal Res.</source> <volume>18</volume>, <fpage>213</fpage>&#x02013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1016/j.algal.2016.06.015</pub-id><pub-id pub-id-type="pmid">27812494</pub-id></citation></ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McBride</surname> <given-names>A. A.</given-names></name></person-group> (<year>2008</year>). <article-title>Chapter 4 replication and partitioning of papillomavirus genomes</article-title>. <source>Adv. Virus Res.</source> <volume>72</volume>, <fpage>155</fpage>&#x02013;<lpage>205</lpage>. <pub-id pub-id-type="doi">10.1016/S0065-3527(08)00404-1</pub-id><pub-id pub-id-type="pmid">19081491</pub-id></citation></ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meyer</surname> <given-names>P.</given-names></name></person-group> (<year>1995</year>). <article-title>Understanding and controlling transgene expression</article-title>. <source>Trends Biotechnol.</source> <volume>13</volume>, <fpage>332</fpage>&#x02013;<lpage>337</lpage>. <pub-id pub-id-type="doi">10.1016/S0167-7799(00)88977-5</pub-id></citation></ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moritz</surname> <given-names>B.</given-names></name> <name><surname>Woltering</surname> <given-names>L.</given-names></name> <name><surname>Becker</surname> <given-names>P. B.</given-names></name> <name><surname>G&#x000F6;pfert</surname> <given-names>U.</given-names></name></person-group> (<year>2016</year>). <article-title>High levels of histone H3 acetylation at the CMV promoter are predictive of stable expression in Chinese hamster ovary cells</article-title>. <source>Biotechnol. Prog.</source> <volume>32</volume>, <fpage>776</fpage>&#x02013;<lpage>786</lpage>. <pub-id pub-id-type="doi">10.1002/btpr.2271</pub-id><pub-id pub-id-type="pmid">27073121</pub-id></citation></ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakamura</surname> <given-names>Y.</given-names></name> <name><surname>Teruyuki</surname> <given-names>N.</given-names></name> <name><surname>Noguchi</surname> <given-names>R.</given-names></name> <name><surname>Ito</surname> <given-names>Y.</given-names></name> <name><surname>Watanabe</surname> <given-names>T.</given-names></name> <name><surname>Nishiyama</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>A stable, autonomously replicating plasmid vector containing pichia pastoris centromeric <italic>DNA</italic></article-title>. <source>Appl. Environ. Microbiol.</source> <volume>84</volume>:<fpage>e02882</fpage>&#x02013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.02882-17</pub-id><pub-id pub-id-type="pmid">29802190</pub-id></citation></ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nicholls</surname> <given-names>P. K.</given-names></name> <name><surname>Bellott</surname> <given-names>D. W.</given-names></name> <name><surname>Cho</surname> <given-names>T. J.</given-names></name> <name><surname>Pyntikova</surname> <given-names>T.</given-names></name> <name><surname>Page</surname> <given-names>D. C.</given-names></name></person-group> (<year>2019</year>). <article-title>Locating and characterizing a transgene integration site by nanopore sequencing</article-title>. <source>G3 (Bethesda)</source> <volume>9</volume>, <fpage>1481</fpage>&#x02013;<lpage>1486</lpage>. <pub-id pub-id-type="doi">10.1534/g3.119.300582</pub-id><pub-id pub-id-type="pmid">30837263</pub-id></citation></ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Norrander</surname> <given-names>J.</given-names></name> <name><surname>Kempe</surname> <given-names>T.</given-names></name> <name><surname>Messing</surname> <given-names>J.</given-names></name></person-group> (<year>1983</year>). <article-title>Construction of improved Ml3 vectors using oligodeoxynucleotidedirected mutagenesis (Ml3 cloning; synthetic DNA primers; gene synthesis; phosphoramidites; DNA polymerase I Klenow frag-ment)</article-title>. <source>Bren Road East</source> <volume>26</volume>, <fpage>935</fpage>&#x02013;<lpage>7335</lpage>. <pub-id pub-id-type="pmid">6323249</pub-id></citation></ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nymark</surname> <given-names>M.</given-names></name> <name><surname>Sharma</surname> <given-names>A. K.</given-names></name> <name><surname>Sparstad</surname> <given-names>T.</given-names></name> <name><surname>Bones</surname> <given-names>A. M.</given-names></name> <name><surname>Winge</surname> <given-names>P.</given-names></name></person-group> (<year>2016</year>). <article-title>A CRISPR/Cas9 system adapted for gene editing in marine algae</article-title>. <source>Sci. Rep.</source> <volume>6</volume>:<fpage>24951</fpage>. <pub-id pub-id-type="doi">10.1038/srep24951</pub-id><pub-id pub-id-type="pmid">27108533</pub-id></citation></ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paddon</surname> <given-names>C. J.</given-names></name> <name><surname>Westfall</surname> <given-names>P. J.</given-names></name> <name><surname>Pitera</surname> <given-names>D. J.</given-names></name> <name><surname>Benjamin</surname> <given-names>K.</given-names></name> <name><surname>Fisher</surname> <given-names>K.</given-names></name> <name><surname>McPhee</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>High-level semi-synthetic production of the potent antimalarial artemisinin</article-title>. <source>Nature</source> <volume>496</volume>, <fpage>528</fpage>&#x02013;<lpage>532</lpage>. <pub-id pub-id-type="doi">10.1038/nature12051</pub-id><pub-id pub-id-type="pmid">23575629</pub-id></citation></ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Papapetrou</surname> <given-names>E. P.</given-names></name> <name><surname>Schambach</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>Gene insertion into genomic safe harbors for human gene therapy</article-title>. <source>Mol. Ther.</source> <volume>24</volume>, <fpage>678</fpage>&#x02013;<lpage>684</lpage>. <pub-id pub-id-type="doi">10.1038/mt.2016.38</pub-id><pub-id pub-id-type="pmid">26867951</pub-id></citation></ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parker</surname> <given-names>J. D.</given-names></name> <name><surname>Rabinovitch</surname> <given-names>P. S.</given-names></name> <name><surname>Burmer</surname> <given-names>G. C.</given-names></name></person-group> (<year>1991</year>). <article-title>Walking polymerase chain reaction</article-title>. <source>Gene</source> <volume>19</volume>, <fpage>3055</fpage>&#x02013;<lpage>3060</lpage>. <pub-id pub-id-type="doi">10.1093/nar/19.11.3055</pub-id><pub-id pub-id-type="pmid">2057362</pub-id></citation></ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pinder</surname> <given-names>M. I. M.</given-names></name> <name><surname>Johansson</surname> <given-names>O. N.</given-names></name> <name><surname>Almstedt</surname> <given-names>A.</given-names></name> <name><surname>Kourtchenko</surname> <given-names>O.</given-names></name> <name><surname>Clarke</surname> <given-names>A. K.</given-names></name> <name><surname>Godhe</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Genome sequence of kordia sp. strain SMS9 identified in a non-axenic culture of the diatom <italic>Skeletonema marinoi</italic></article-title>. <source>J. Genomics</source> <volume>7</volume>, <fpage>46</fpage>&#x02013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.7150/jgen.35061</pub-id><pub-id pub-id-type="pmid">31171940</pub-id></citation></ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pinto</surname> <given-names>F.</given-names></name> <name><surname>Pacheco</surname> <given-names>C. C.</given-names></name> <name><surname>Oliveira</surname> <given-names>P.</given-names></name> <name><surname>Montagud</surname> <given-names>A.</given-names></name> <name><surname>Landels</surname> <given-names>A.</given-names></name> <name><surname>Couto</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Improving a Synechocystis-based photoautotrophic chassis through systematic genome mapping and validation of neutral sites</article-title>. <source>DNA Res.</source> <volume>22</volume>, <fpage>425</fpage>&#x02013;<lpage>437</lpage>. <pub-id pub-id-type="doi">10.1093/dnares/dsv024</pub-id><pub-id pub-id-type="pmid">26490728</pub-id></citation></ref>
<ref id="B87">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pollak</surname> <given-names>B.</given-names></name> <name><surname>Matute</surname> <given-names>T.</given-names></name> <name><surname>Nunez</surname> <given-names>I.</given-names></name> <name><surname>Cerda</surname> <given-names>A.</given-names></name> <name><surname>Lopez</surname> <given-names>C.</given-names></name> <name><surname>Kan</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Universal Loop assembly (uLoop): open, efficient, and species- agnostic DNA fabrication</article-title>. <source>Biorxiv.</source> &#x02018; <pub-id pub-id-type="doi">10.1101/744854</pub-id><pub-id pub-id-type="pmid">32161816</pub-id></citation></ref>
<ref id="B88">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rajeevkumar</surname> <given-names>S.</given-names></name> <name><surname>Anunanthini</surname> <given-names>P.</given-names></name> <name><surname>Sathishkumar</surname> <given-names>R.</given-names></name></person-group> (<year>2015</year>). <article-title>Epigenetic silencing in transgenic plants</article-title>. <source>Front. Plant Sci.</source> <volume>6</volume>:<fpage>693</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2015.00693</pub-id><pub-id pub-id-type="pmid">26442010</pub-id></citation></ref>
<ref id="B89">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Remmers</surname> <given-names>I. M.</given-names></name> <name><surname>D&#x00027;Adamo</surname> <given-names>S.</given-names></name> <name><surname>Martens</surname> <given-names>D. E.</given-names></name> <name><surname>de Vos</surname> <given-names>R. C. H.</given-names></name> <name><surname>Mumm</surname> <given-names>R.</given-names></name> <name><surname>America</surname> <given-names>A. H. P.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Orchestration of transcriptome, proteome and metabolome in the diatom <italic>Phaeodactylum tricornutum</italic> during nitrogen limitation</article-title>. <source>Algal Res.</source> <volume>35</volume>, <fpage>33</fpage>&#x02013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1016/j.algal.2018.08.012</pub-id></citation></ref>
<ref id="B90">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robertsen</surname> <given-names>E. M.</given-names></name> <name><surname>Kahlke</surname> <given-names>T.</given-names></name> <name><surname>Raknes</surname> <given-names>I. A.</given-names></name> <name><surname>Pedersen</surname> <given-names>E.</given-names></name> <name><surname>Semb</surname> <given-names>E. K.</given-names></name> <name><surname>Ernstsen</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2016</year>). META-pipe - pipeline annotation, analysis and visualization of marine metagenomic sequence data. <fpage>1</fpage>&#x02013;<lpage>22</lpage>. arXiv:1604.04103.</citation></ref>
<ref id="B91">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname> <given-names>J. T.</given-names></name> <name><surname>Thorvaldsd&#x000F3;ttir</surname> <given-names>H.</given-names></name> <name><surname>Winckler</surname> <given-names>W.</given-names></name> <name><surname>Guttman</surname> <given-names>M.</given-names></name> <name><surname>Lander</surname> <given-names>E. S.</given-names></name> <name><surname>Getz</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Integrative genomics viewer</article-title>. <source>OECD Observ.</source> <volume>29</volume>, <fpage>23</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.1754</pub-id><pub-id pub-id-type="pmid">21221095</pub-id></citation></ref>
<ref id="B92">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salsman</surname> <given-names>J.</given-names></name> <name><surname>Dellaire</surname> <given-names>G.</given-names></name></person-group> (<year>2016</year>). <article-title>Precision Genome Editing in the CRISPR Era</article-title>. <source>Biochem. Cell Biol.</source> <volume>95</volume>, <fpage>187</fpage>&#x02013;<lpage>201</lpage>. <pub-id pub-id-type="doi">10.1139/bcb-2016-0137</pub-id><pub-id pub-id-type="pmid">28177771</pub-id></citation></ref>
<ref id="B93">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sau</surname> <given-names>S.</given-names></name> <name><surname>Ghosh</surname> <given-names>S. K.</given-names></name> <name><surname>Liu</surname> <given-names>Y. T.</given-names></name> <name><surname>Ma</surname> <given-names>C. H.</given-names></name> <name><surname>Jayaram</surname> <given-names>M.</given-names></name></person-group> (<year>2019</year>). <article-title>Hitchhiking on chromosomes: a persistence strategy shared by diverse selfish DNA elements</article-title>. <source>Plasmid</source> <volume>102</volume>, <fpage>19</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1016/j.plasmid.2019.01.004</pub-id><pub-id pub-id-type="pmid">30726706</pub-id></citation></ref>
<ref id="B94">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scaife</surname> <given-names>M. A.</given-names></name> <name><surname>Smith</surname> <given-names>A. G.</given-names></name></person-group> (<year>2016</year>). <article-title>Towards developing algal synthetic biology</article-title>. <source>Biochem. Soc. Transac.</source> <volume>44</volume>, <fpage>716</fpage>&#x02013;<lpage>722</lpage>. <pub-id pub-id-type="doi">10.1042/BST20160061</pub-id><pub-id pub-id-type="pmid">27284033</pub-id></citation></ref>
<ref id="B95">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serif</surname> <given-names>M.</given-names></name> <name><surname>Dubois</surname> <given-names>G.</given-names></name> <name><surname>Finoux</surname> <given-names>A. L.</given-names></name> <name><surname>Teste</surname> <given-names>M. A.</given-names></name> <name><surname>Jallet</surname> <given-names>D.</given-names></name> <name><surname>Daboussi</surname> <given-names>F.</given-names></name></person-group> (<year>2018</year>). <article-title>One-step generation of multiple gene knock-outs in the diatom <italic>Phaeodactylum tricornutum</italic> by DNA-free genome editing</article-title>. <source>Nat. Commun.</source> <volume>9</volume>:<fpage>3924</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-018-06378-9</pub-id><pub-id pub-id-type="pmid">30254261</pub-id></citation></ref>
<ref id="B96">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serif</surname> <given-names>M.</given-names></name> <name><surname>Lepetit</surname> <given-names>B.</given-names></name> <name><surname>Wei&#x000DF;ert</surname> <given-names>K.</given-names></name> <name><surname>Kroth</surname> <given-names>P. G.</given-names></name> <name><surname>Bartulos</surname> <given-names>C. R.</given-names></name></person-group> (<year>2017</year>). <article-title>A fast and reliable strategy to generate TALEN-mediated gene knockouts in the diatom <italic>Phaeodactylum tricornutum</italic></article-title>. <source>Algal Res.</source> <volume>23</volume>, <fpage>186</fpage>&#x02013;<lpage>195</lpage>. <pub-id pub-id-type="doi">10.1016/j.algal.2017.02.005</pub-id></citation></ref>
<ref id="B97">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharma</surname> <given-names>A. K.</given-names></name> <name><surname>Nymark</surname> <given-names>M.</given-names></name> <name><surname>Sparstad</surname> <given-names>T.</given-names></name> <name><surname>Bones</surname> <given-names>A. M.</given-names></name> <name><surname>Winge</surname> <given-names>P.</given-names></name></person-group> (<year>2018</year>). <article-title>Transgene-free genome editing in marine algae by bacterial conjugation &#x02013; comparison with biolistic CRISPR/Cas9 transformation</article-title>. <source>Sci. Rep.</source> <volume>8</volume>:<fpage>14401</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-018-32342-0</pub-id><pub-id pub-id-type="pmid">30258061</pub-id></citation></ref>
<ref id="B98">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sheff</surname> <given-names>M. A.</given-names></name> <name><surname>Thorn</surname> <given-names>K. S.</given-names></name></person-group> (<year>2004</year>). <article-title>Optimized cassettes for fluorescent protein tagging in <italic>Saccharomyces cerevisiae</italic></article-title>. <source>Yeast</source> <volume>21</volume>, <fpage>661</fpage>&#x02013;<lpage>670</lpage>. <pub-id pub-id-type="doi">10.1002/yea.1130</pub-id><pub-id pub-id-type="pmid">15197731</pub-id></citation></ref>
<ref id="B99">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shin</surname> <given-names>S-E.</given-names></name> <name><surname>Lim</surname> <given-names>J-M.</given-names></name> <name><surname>Koh</surname> <given-names>H. G.</given-names></name> <name><surname>Kim</surname> <given-names>E. K.</given-names></name> <name><surname>Kang</surname> <given-names>N. K.</given-names></name> <name><surname>Jeon</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>CRISPR/Cas9-induced knockout and knock-in mutations in <italic>Chlamydomonas reinhardtii</italic> SUPP</article-title>. <source>Sci. Rep.</source> <volume>6</volume>:<fpage>27810</fpage>. <pub-id pub-id-type="doi">10.1038/srep27810</pub-id><pub-id pub-id-type="pmid">27291619</pub-id></citation></ref>
<ref id="B100">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slattery</surname> <given-names>S. S.</given-names></name> <name><surname>Diamond</surname> <given-names>A.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Therrien</surname> <given-names>J. A.</given-names></name> <name><surname>Lant</surname> <given-names>J. T.</given-names></name> <name><surname>Jazey</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>An expanded plasmid-based genetic toolbox enables Cas9 genome editing and stable maintenance of synthetic pathways in <italic>Phaeodactylum tricornutum</italic></article-title>. <source>ACS Synth. Biol.</source> <volume>7</volume>, <fpage>328</fpage>&#x02013;<lpage>338</lpage>. <pub-id pub-id-type="doi">10.1021/acssynbio.7b00191</pub-id><pub-id pub-id-type="pmid">29298053</pub-id></citation></ref>
<ref id="B101">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>S. R.</given-names></name> <name><surname>Dupont</surname> <given-names>C. L.</given-names></name> <name><surname>McCarthy</surname> <given-names>J. K.</given-names></name> <name><surname>Broddrick</surname> <given-names>J. T.</given-names></name> <name><surname>Oborn&#x000ED;k</surname> <given-names>M.</given-names></name> <name><surname>Hor&#x000E1;k</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Evolution and regulation of nitrogen flux through compartmentalized metabolic networks in a marine diatom</article-title>. <source>Nat. Commun.</source> <volume>10</volume>:<fpage>4552</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-12407-y</pub-id><pub-id pub-id-type="pmid">31591397</pub-id></citation></ref>
<ref id="B102">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stukenberg</surname> <given-names>D.</given-names></name> <name><surname>Zauner</surname> <given-names>S.</given-names></name> <name><surname>Aquila</surname> <given-names>G. D.</given-names></name> <name><surname>Maier</surname> <given-names>U. G.</given-names></name></person-group> (<year>2018</year>). <article-title>Optimizing CRISPR/Cas9 for the diatom <italic>Phaeodactylum tricornutum</italic></article-title>. <source>Front. Plant Sci.</source> <volume>9</volume>:<fpage>740</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2018.00740</pub-id><pub-id pub-id-type="pmid">29928285</pub-id></citation></ref>
<ref id="B103">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanwar</surname> <given-names>A.</given-names></name> <name><surname>Sharma</surname> <given-names>S.</given-names></name> <name><surname>Kumar</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>Targeted genome editing in algae using CRISPR/Cas9</article-title>. <source>Indian J. Plant Physiol.</source> <volume>23</volume>, <fpage>653</fpage>&#x02013;<lpage>669</lpage>. <pub-id pub-id-type="doi">10.1007/s40502-018-0423-3</pub-id></citation></ref>
<ref id="B104">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Moerkercke</surname> <given-names>A.</given-names></name> <name><surname>Fabris</surname> <given-names>M.</given-names></name> <name><surname>Pollier</surname> <given-names>J.</given-names></name> <name><surname>Baart</surname> <given-names>G. J. E.</given-names></name> <name><surname>Rombauts</surname> <given-names>S.</given-names></name> <name><surname>Hasnain</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>CathaCyc, a metabolic pathway database built from catharanthus roseus RNA-seq data</article-title>. <source>Plant Cell Physiol.</source> <volume>54</volume>, <fpage>673</fpage>&#x02013;<lpage>685</lpage>. <pub-id pub-id-type="doi">10.1093/pcp/pct039</pub-id><pub-id pub-id-type="pmid">23493402</pub-id></citation></ref>
<ref id="B105">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vavitsas</surname> <given-names>K.</given-names></name> <name><surname>Fabris</surname> <given-names>M.</given-names></name> <name><surname>Vickers</surname> <given-names>C. E.</given-names></name></person-group> (<year>2018</year>). <article-title>Terpenoid metabolic engineering in photosynthetic microorganisms</article-title>. <source>Genes</source> <volume>9</volume>:<fpage>520</fpage>. <pub-id pub-id-type="doi">10.3390/genes9110520</pub-id><pub-id pub-id-type="pmid">30360565</pub-id></citation></ref>
<ref id="B106">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>C.</given-names></name> <name><surname>Zada</surname> <given-names>B.</given-names></name> <name><surname>Wei</surname> <given-names>G.</given-names></name> <name><surname>Kim</surname> <given-names>S. W.</given-names></name></person-group> (<year>2017</year>). <article-title>Metabolic engineering and synthetic biology approaches driving isoprenoid production in <italic>Escherichia coli</italic></article-title>. <source>Bioresour. Technol.</source> <volume>241</volume>, <fpage>430</fpage>&#x02013;<lpage>438</lpage>. <pub-id pub-id-type="doi">10.1016/j.biortech.2017.05.168</pub-id><pub-id pub-id-type="pmid">28599221</pub-id></citation></ref>
<ref id="B107">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weyman</surname> <given-names>P. D.</given-names></name> <name><surname>Beeri</surname> <given-names>K.</given-names></name> <name><surname>Lefebvre</surname> <given-names>S. C.</given-names></name> <name><surname>Rivera</surname> <given-names>J.</given-names></name> <name><surname>Mccarthy</surname> <given-names>J. K.</given-names></name> <name><surname>Heuberger</surname> <given-names>A. L.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Inactivation of <italic>Phaeodactylum tricornutum</italic> urease gene using transcription activator-like effector nuclease-based targeted mutagenesis</article-title>. <source>Plant Biotechnol. J.</source> <volume>13</volume>, <fpage>460</fpage>&#x02013;<lpage>470</lpage>. <pub-id pub-id-type="doi">10.1111/pbi.12254</pub-id><pub-id pub-id-type="pmid">25302562</pub-id></citation></ref>
<ref id="B108">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yao</surname> <given-names>Y.</given-names></name> <name><surname>Lu</surname> <given-names>Y.</given-names></name> <name><surname>Peng</surname> <given-names>K. T.</given-names></name> <name><surname>Huang</surname> <given-names>T.</given-names></name> <name><surname>Niu</surname> <given-names>Y. F.</given-names></name> <name><surname>Xie</surname> <given-names>W. H.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Glycerol and neutral lipid production in the oleaginous marine diatom <italic>Phaeodactylum tricornutum</italic> promoted by overexpression of glycerol-3-phosphate dehydrogenase</article-title>. <source>Biotechnol. Biofuels</source> <volume>7</volume>:<fpage>110</fpage>. <pub-id pub-id-type="doi">10.1186/1754-6834-7-110</pub-id></citation></ref>
<ref id="B109">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zurbriggen</surname> <given-names>A.</given-names></name> <name><surname>Kirst</surname> <given-names>H.</given-names></name> <name><surname>Melis</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Isoprene production via the mevalonic acid pathway in <italic>Escherichia coli</italic> (Bacteria)</article-title>. <source>Bioenergy Res.</source> <volume>5</volume>, <fpage>814</fpage>&#x02013;<lpage>828</lpage>. <pub-id pub-id-type="doi">10.1007/s12155-012-9192-4</pub-id></citation></ref>
</ref-list>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>EE</term>
<def><p>Episomal expression</p></def></def-item>
<def-item><term>RICE</term>
<def><p>Randomly integrated chromosomal expression.</p></def></def-item>
</def-list>
</glossary>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> This work was funded by the University of Technology Sydney and the CSIRO Synthetic Biology Future Science Platform. JG was supported by a UTS Doctoral Scholarship. MF was supported by a CSIRO Synthetic Biology Future Science Platform Fellowship co-funded by CSIRO and the University of Technology Sydney.</p>
</fn>
</fn-group>
</back>
</article> 