<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2021.655286</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Methods</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Easymap: A User-Friendly Software Package for Rapid Mapping-by-Sequencing of Point Mutations and Large Insertions</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Lup</surname> <given-names>Samuel Daniel</given-names></name>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/500494/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Wilson-S&#x00E1;nchez</surname> <given-names>David</given-names></name>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x2021;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1308601/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Andreu-S&#x00E1;nchez</surname> <given-names>Sergio</given-names></name>
<xref ref-type="author-notes" rid="fn003"><sup>&#x2021;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1077004/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Micol</surname> <given-names>Jos&#x00E9; Luis</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/27992/overview"/>
</contrib>
</contrib-group>
<aff><institution>Instituto de Bioingenier&#x00ED;a, Universidad Miguel Hern&#x00E1;ndez de Elche</institution>, <addr-line>Elche</addr-line>, <country>Spain</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Marcos Egea-Cortines, Universidad Polit&#x00E9;cnica de Cartagena, Spain</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Unai L&#x00F3;pez De Heredia, Polytechnic University of Madrid, Spain; Hiroyuki Koga, The University of Tokyo, Japan</p></fn>
<corresp id="c001">&#x002A;Correspondence: Jos&#x00E9; Luis Micol, <email>jlmicol@umh.es</email></corresp>
<fn fn-type="other" id="fn002"><p><sup>&#x2020;</sup>These authors have contributed equally to this work</p></fn>
<fn fn-type="other" id="fn003"><p><sup>&#x2021;</sup>Present address: David Wilson-S&#x00E1;nchez, Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany; Sergio Andreu-S&#x00E1;nchez, Department of Pediatrics and Genetics, University of Groningen, Groningen, Netherlands</p></fn>
<fn fn-type="other" id="fn004"><p>This article was submitted to Technical Advances in Plant Science, a section of the journal Frontiers in Plant Science</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>05</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>655286</elocation-id>
<history>
<date date-type="received">
<day>18</day>
<month>01</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>18</day>
<month>03</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Lup, Wilson-S&#x00E1;nchez, Andreu-S&#x00E1;nchez and Micol.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Lup, Wilson-S&#x00E1;nchez, Andreu-S&#x00E1;nchez and Micol</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Mapping-by-sequencing strategies combine next-generation sequencing (NGS) with classical linkage analysis, allowing rapid identification of the causal mutations of the phenotypes exhibited by mutants isolated in a genetic screen. Computer programs that analyze NGS data obtained from a mapping population of individuals derived from a mutant of interest to identify a causal mutation are available; however, the installation and usage of such programs requires bioinformatic skills, modifying or combining pieces of existing software, or purchasing licenses. To ease this process, we developed Easymap, an open-source program that simplifies the data analysis workflows from raw NGS reads to candidate mutations. Easymap can perform bulked segregant mapping of point mutations induced by ethyl methanesulfonate (EMS) with DNA-seq or RNA-seq datasets, as well as tagged-sequence mapping for large insertions, such as transposons or T-DNAs. The mapping analyses implemented in Easymap have been validated with experimental and simulated datasets from different plant and animal model species. Easymap was designed to be accessible to all users regardless of their bioinformatics skills by implementing a user-friendly graphical interface, a simple universal installation script, and detailed mapping reports, including informative images and complementary data for assessment of the mapping results. Easymap is available at <ext-link ext-link-type="uri" xlink:href="http://genetics.edu.umh.es/resources/easymap">http://genetics.edu.umh.es/resources/easymap</ext-link>; its Quickstart Installation Guide details the recommended procedure for installation.</p>
</abstract>
<kwd-group>
<kwd>bioinformatics</kwd>
<kwd>NGS</kwd>
<kwd>mapping-by-sequencing</kwd>
<kwd>candidate mutations</kwd>
<kwd>forward genetics</kwd>
<kwd>linkage analysis mapping</kwd>
<kwd>bulked segregant analysis</kwd>
</kwd-group>
<contract-sponsor id="cn001">Ministerio de Ciencia e Innovaci&#x00F3;n<named-content content-type="fundref-id">10.13039/501100004837</named-content></contract-sponsor>
<contract-sponsor id="cn002">Generalitat Valenciana<named-content content-type="fundref-id">10.13039/501100003359</named-content></contract-sponsor>
<counts>
<fig-count count="3"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="51"/>
<page-count count="10"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>Forward genetic screens consist of random mutagenesis followed by the isolation of mutants exhibiting a phenotype of interest, and genetic analysis of these mutants to identify the mutations that cause their phenotypes. Two commonly used mutagenesis strategies are the induction of G&#x2192;A substitutions using the chemical mutagen ethyl methanesulfonate (EMS) (<xref ref-type="bibr" rid="B37">Neuffer and Ficsor, 1963</xref>; <xref ref-type="bibr" rid="B17">James and Dooner, 1990</xref>; <xref ref-type="bibr" rid="B19">Jansen et al., 1997</xref>) and the disruption of genes by insertional mutagens such as transposons or T-DNA (<xref ref-type="bibr" rid="B5">Cooley et al., 1988</xref>; <xref ref-type="bibr" rid="B3">Alonso et al., 2003</xref>; <xref ref-type="bibr" rid="B11">Fr&#x00F8;kj&#x00E6;r-Jensen et al., 2014</xref>). Linkage analysis of molecular markers in segregant mapping populations is the classically preferred approach to map the point mutations induced by a chemical mutagen, carried by mutants isolated in a genetic screen (<xref ref-type="bibr" rid="B33">Michelmore et al., 1991</xref>; <xref ref-type="bibr" rid="B41">Ponce et al., 1999</xref>). By contrast, localization of insertional mutations has relied on methods to capture the genomic sequences present at their flanks (<xref ref-type="bibr" rid="B12">Gasch et al., 1992</xref>; <xref ref-type="bibr" rid="B32">Medford et al., 1992</xref>; <xref ref-type="bibr" rid="B30">Liu et al., 1995</xref>; <xref ref-type="bibr" rid="B40">Ponce et al., 1998</xref>; <xref ref-type="bibr" rid="B39">O&#x2019;Malley et al., 2007</xref>).</p>
<p>The preliminary identification and subsequent validation of the mutations that cause a phenotype of interest is the most laborious and time-consuming step of a forward genetic screen. Next-generation sequencing (NGS) of DNA has facilitated and revitalized such approaches, through the so-called mapping-by-sequencing methods, which combine NGS with linkage analysis (<xref ref-type="bibr" rid="B44">Schneeberger and Weigel, 2011</xref>; <xref ref-type="bibr" rid="B14">Hartwig et al., 2012</xref>; <xref ref-type="bibr" rid="B18">James et al., 2013</xref>; <xref ref-type="bibr" rid="B4">Candela et al., 2015</xref>). Mapping-by-sequencing approaches for the identification of causal mutations are much faster than previous methods but can be hampered by the lack of computing resources and/or accessible software. Currently available programs for mapping-by-sequencing data analysis suffer from one or several of the following issues: they require the purchase of licenses (<xref ref-type="bibr" rid="B45">Smith, 2015</xref>); they require a certain level of bioinformatics skills to use (<xref ref-type="bibr" rid="B1">Abe et al., 2012</xref>; <xref ref-type="bibr" rid="B10">Fekih et al., 2013</xref>; <xref ref-type="bibr" rid="B21">Jiang et al., 2015</xref>; <xref ref-type="bibr" rid="B47">Sun and Schneeberger, 2015</xref>; <xref ref-type="bibr" rid="B6">Ecovoiu et al., 2016</xref>; <xref ref-type="bibr" rid="B48">Wachsman et al., 2017</xref>; see also <ext-link ext-link-type="uri" xlink:href="https://sourceforge.net/projects/mimodd/">https://sourceforge.net/projects/mimodd/</ext-link>); they only do a part of the computing tasks required for a mapping-by-sequencing experiment (<xref ref-type="bibr" rid="B25">Li et al., 2009</xref>; <xref ref-type="bibr" rid="B24">Langmead and Salzberg, 2012</xref>; <xref ref-type="bibr" rid="B16">Hill et al., 2013</xref>); they are designed for a specific type of mutation or mapping strategy (<xref ref-type="bibr" rid="B13">Gonzalez et al., 2013</xref>; <xref ref-type="bibr" rid="B8">Ewing, 2015</xref>; <xref ref-type="bibr" rid="B15">H&#x00E9;naff et al., 2015</xref>; <xref ref-type="bibr" rid="B46">Solaimanpour et al., 2015</xref>; <xref ref-type="bibr" rid="B47">Sun and Schneeberger, 2015</xref>; <xref ref-type="bibr" rid="B48">Wachsman et al., 2017</xref>; <xref ref-type="bibr" rid="B23">Klein et al., 2018</xref>; <xref ref-type="bibr" rid="B20">Javorka et al., 2019</xref>); they are hosted at a public server but usage is limited (<xref ref-type="bibr" rid="B13">Gonzalez et al., 2013</xref>; <xref ref-type="bibr" rid="B2">Afgan et al., 2018</xref>); or they can no longer be accessed or used (<xref ref-type="bibr" rid="B34">Minevich et al., 2012</xref>; <xref ref-type="bibr" rid="B42">Pulido-Tamayo et al., 2016</xref>).</p>
<p>Here, we describe Easymap, an accessible graphical-interface program that analyses NGS data from mapping populations derived through a variety of experimental designs from either insertional or EMS-induced mutants. This software package avoids the aforementioned issues, enabling mapping-by-sequencing experiments to be conducted by researchers with minimal bioinformatics experience. Easymap features a web-based graphic interface, a simple installation script, robust mapping analyses for several experimental designs, and thorough user-oriented mapping reports.</p>
</sec>
<sec id="S2">
<title>Results</title>
<sec id="S2.SS1">
<title>Mapping Strategies for Which Easymap Can Be Used</title>
<p>Easymap offers the user two alternative workflows, to map point and insertional mutations (Workflows 1 and 2; see next section), as well as a set of complementary tasks common to all analyses. <xref ref-type="fig" rid="F1">Figure 1</xref> offers an overview of both mapping strategies, from the initial selection of the mutants of interest to the output that Easymap generates. To ease both mapping scenarios, Easymap automates the whole data analysis process requiring no user intervention with alignment, variant-calling, or filtering parameters.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Overview of two typical mapping-by-sequencing experiments with Easymap in <italic>Arabidopsis</italic>. <bold>(A)</bold> Experimental design. For EMS-induced mutants, an outcross or backcross is first performed. The F<sub>1</sub> plants derived from the cross are selfed, and the resulting F<sub>2</sub> is screened for the mutant phenotype to create a phenotypically mutant mapping population. A control sample is required for the mapping analysis, which can be either one of the parental individuals crossed or, alternatively, a pool of phenotypically wild-type F<sub>2</sub> individuals. For mapping of large insertions, the DNA of different insertional mutant lines can be sequenced individually or pooled, and no control sample is required. <bold>(B)</bold> Input files. Easymap takes NGS paired-end or single-end short reads as input. The remaining mandatory input files are available on public databases for each model species. <bold>(C)</bold> Easymap workflows. The user selects the experimental design used for mutation mapping from a variety of options for both EMS mutation mapping (backcross and outcross strategies, alternative control samples) and tagged-sequence mapping (paired-end and single-end reads). <bold>(D)</bold> Output. Easymap produces comprehensive mapping reports with organized tabular data to ease interpretation of the results. As an example of EMS-induced mutations, data from the <italic>Arabidopsis suppressor of overexpression of CONSTANS 1-2</italic> (<italic>soc1-2</italic>) mutant (<xref ref-type="bibr" rid="B47">Sun and Schneeberger, 2015</xref>) was used for this figure. Allele frequency (AF) versus position plots are drawn for each chromosome containing the polymorphisms used for the analysis. A candidate region is highlighted in pink; all putative EMS-type mutations contained in this region are regarded as candidates, and their position and relevant information are presented in a table, such as the gene affected by the mutation. For each gene affected by a candidate mutation, a gene plot is made in which the position of the mutation is shown, followed by further information (genotyping primers, flanking sequences, functional annotation, etc.). As an example of large insertion mapping, the figure includes data from an unpublished mapping experiment made in our laboratory (see <xref ref-type="table" rid="T2">Table 2</xref>). A genomic overview is drawn showing the positions of the insertions found. Read depth (RD) histograms are generated for each read cluster pointing to an insertion site showing the information supporting the insertion. Finally, a gene plot is made for each gene interrupted by an insertion.</p></caption>
<graphic xlink:href="fpls-12-655286-g001.tif"/>
</fig>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Validation of point-mutation mapping strategies using published, real experimental data.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Genetic background of the mutant</bold></td>
<td valign="top" align="center"><bold>Mapping strategy</bold></td>
<td valign="top" align="left"><bold>Control sample</bold></td>
<td valign="top" align="center"><bold>Mutant</bold></td>
<td valign="top" align="left"><bold>Sample type</bold></td>
<td valign="top" align="left"><bold>Results obtained by Easymap</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Same as reference sequence</td>
<td valign="top" align="center">Backcross/M<sub>2</sub></td>
<td valign="top" align="left">Wild-type parental strain</td>
<td valign="top" align="center"><italic>hasty</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM<sup><italic>a</italic></sup> is the only candidate<sup><italic>b</italic></sup> within the CI<sup><italic>c</italic></sup></td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>ago1-25</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM and 2 more candidates within the CI</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>alp1-3</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM and one more candidate within the CI</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>gl3</italic></td>
<td valign="top" align="left">RNA</td>
<td valign="top" align="left">Correct CI, CM undetectable with our methods</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="left">Phenotypically wild-type siblings</td>
<td valign="top" align="center"><italic>atgr2</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM and 3 more candidates within the CI</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>shortroot</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM and 2 more candidates within the CI</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>rth6</italic></td>
<td valign="top" align="left">RNA</td>
<td valign="top" align="left">Correct CI, CM unknown in the original paper</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="center">Outcross</td>
<td valign="top" align="left">Wild-type parental strain</td>
<td valign="top" align="center"><italic>rol-6(su1006)</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">Correct CI, CM unknown in the original paper</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="left">Genetically distant strain crossed to the mutant</td>
<td valign="top" align="center"><italic>soc1-2</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM and 6 more candidates within the CI</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>rol-6(su1006)</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">Correct CI, CM unknown in the original paper</td>
</tr>
<tr>
<td valign="top" align="left">Different from reference sequence</td>
<td valign="top" align="center">Backcross/M<sub>2</sub></td>
<td valign="top" align="left">Phenotypically wild-type siblings</td>
<td valign="top" align="center"><italic>icu11-1</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM is the only candidate within the CI</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>jj410</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">Correct CI, CM lost due to very low coverage</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>gl13</italic></td>
<td valign="top" align="left">RNA</td>
<td valign="top" align="left">Correct CI, CM is fine-mapped in later experiments</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="center">Outcross</td>
<td valign="top" align="left">Wild-type parental strain</td>
<td valign="top" align="center"><italic>HKT2.4</italic> allele</td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">Correct CI, CM lost due to experimental design</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center"><italic>ten</italic></td>
<td valign="top" align="left">DNA</td>
<td valign="top" align="left">CM and 70 more candidates within the CI</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic><sup><italic>a</italic></sup>CM, causal mutation. <sup><italic>b</italic></sup>Candidate: only protein-altering mutations are considered. <sup><italic>c</italic></sup>CI: candidate interval defined by Easymap. See <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref> for more detailed information.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Validation of large-insertion mapping strategies with real experimental data.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>References</bold></td>
<td valign="top" align="left"><bold>Sequenced sample</bold></td>
<td valign="top" align="left"><bold>Results obtained by Easymap</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B50">Wilson-S&#x00E1;nchez et al. (2014)</xref></td>
<td valign="top" align="left">Pool of 10 SALK mutants</td>
<td valign="top" align="left">17 of 19 known insertions were detected by Easymap; the remaining 2 were filtered out as false positives due to very few supporting reads (average RD per sample was 4.5&#x00D7;, instead of the recommended 10&#x00D7;)</td>
</tr>
<tr>
<td valign="top" align="left">Lup and Micol, unpublished</td>
<td valign="top" align="left">Pool of 6 SALK mutants</td>
<td valign="top" align="left">9 insertions detected</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B51">Yang et al. (2013)</xref></td>
<td valign="top" align="left">Mutagenized T1c-19 rice</td>
<td valign="top" align="left">2 of 2 insertions detected; 2 clearly distinguishable false positives</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="left">Mutagenized TT51-1 rice</td>
<td valign="top" align="left">2 of 2 insertions detected; 47 clearly distinguishable false positives due to the presence of an endogenous sequence in the insertion sequence</td>
</tr>
<tr>
<td valign="top" align="left"><xref ref-type="bibr" rid="B28">Li W.X. et al. (2016)</xref></td>
<td valign="top" align="left">Mutant T027</td>
<td valign="top" align="left">2 of 2 insertions detected; 1 false positive common to all lines from this article (omitted below)</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="left">Mutant T182</td>
<td valign="top" align="left">1 of 1 insertion detected; 1 false positive</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="left">Mutant T204</td>
<td valign="top" align="left">1 of 1 insertion detected</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="left">Mutant T273</td>
<td valign="top" align="left">1 of 1 insertion detected split in two clusters due to a large deletion in the mutagenized genome</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="left">Mutant T400</td>
<td valign="top" align="left">1 of 1 insertion detected</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>See <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref> for more detailed information.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<sec id="S2.SS1.SSS1">
<title>Workflow 1: Linkage Analysis Mapping</title>
<p>Mapping of causal EMS-induced mutations is typically achieved by linkage analysis in a bulked segregant population. The user must obtain a mapping population by phenotyping the M<sub>2</sub> offspring of an M<sub>1</sub> individual or the F<sub>2</sub> offspring of a backcross (in which an M<sub>2</sub> mutant is crossed to an individual genetically identical to the parent subjected to mutagenesis) or an outcross (in which an M<sub>2</sub> mutant is crossed to an individual unrelated and genetically polymorphic to the parent that was subjected to mutagenesis).</p>
<p>Easymap includes the splice-aware aligner HISAT2 (<xref ref-type="bibr" rid="B22">Kim et al., 2019</xref>), which is three times faster than commonly used aligners in default conditions with no impact on memory usage or sensitivity. The implementation of HISAT2 allows the user to input RNA-seq and DNA-seq reads indistinctly for point mutation mapping. Easymap requires NGS reads from test and control samples. The test sample consists of NGS reads obtained from a population of individuals exhibiting the mutant phenotype of interest, hence homozygous for a recessive mutation that causes that phenotype. The control sample can be pooled M<sub>2</sub> or F<sub>2</sub> phenotypically wild-type individuals, or individuals genetically identical to the parent that was subjected to mutagenesis, or the strain to which the mutant is outcrossed. A minimum coverage of 25&#x00D7; is recommended for each sample, although since coverage directly correlates to the reliability of the results, higher coverages are encouraged. <xref ref-type="table" rid="T1">Table 1</xref> and the Easymap Documentation (<xref ref-type="supplementary-material" rid="DS2">Supplementary Data Sheet 2</xref>) describe the different experimental designs supported by Easymap, four of which are detailed in <xref ref-type="fig" rid="F2">Figure 2</xref>. Easymap first calls the single-nucleotide polymorphisms (SNPs) between the reads obtained by the user and the reference sequence for the genome of the species under study, and identifies high-confidence SNPs that are informative for mapping. The allele frequencies (AFs) of these biallelic markers are then averaged in overlapping sliding windows throughout the genome, and the average AF values are corrected by comparing adjacent windows using a weighted average approach to reduce noise and randomly generated peaks, taking into account up to six adjacent windows for each correction. Easymap then analyses the revised AF values to select the window with the highest value, which is the most likely to contain the causal mutation. The center of the window defines the center of the mapping region, and a candidate interval of 4, 10, or 20 Mb is set depending on the size of the input genome. The SNPs in the candidate region are then analyzed and reported as candidates to be the mutation causing the phenotype under study. Although Easymap has been designed to map EMS mutations, its mapping report has a link to a file containing all the mutations found in the genome and their potential impact on genes and the products encoded by these genes. This file includes mutations that are not those typically produced by EMS. The Easymap documentation (<xref ref-type="supplementary-material" rid="DS2">Supplementary Data Sheet 2</xref>) contains more detailed information about the SNP selection and mapping algorithms implemented.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Some strategies for EMS-induced mutation mapping implemented in Easymap. <bold>(A&#x2013;D)</bold> The input reads are processed into control and test SNP lists. The lists are contrasted to determine the SNPs that can be informative for mapping, which are subjected to an allelic frequency (AF) analysis to find the mapping region. A candidate region is defined around the center of the mapping region, and the potentially causal SNPs within the candidate region are collected as candidate SNPs. <bold>(A)</bold> For a mutant strain obtained in the reference genetic background, a backcross is performed to obtain the mapping population and the control sample used is the parental of the mutagenized line. <bold>(B)</bold> For a mutant obtained in the reference genetic background, an outcross is performed to obtain the mapping population, and the control sample is the polymorphic wild-type parent. <bold>(C)</bold> For a mutant obtained in a non-reference strain, a backcross is performed to obtain the mapping population, and the control sample used is a pool of phenotypically wild-type F<sub>2</sub> individuals. <bold>(D)</bold> For a mutant obtained in a non-reference strain, an outcross is performed to obtain the mapping population, and the control sample is the parent of the mutagenized line. <bold>(E&#x2013;H)</bold> Selection of the experimental design corresponding to panels <bold>(A&#x2013;D)</bold> in the multiple-choice selectors of the graphic interface of Easymap.</p></caption>
<graphic xlink:href="fpls-12-655286-g002.tif"/>
</fig>
</sec>
<sec id="S2.SS1.SSS2">
<title>Workflow 2: Tagged Sequence Mapping</title>
<p>Easymap uses a tagged-sequence strategy to map the positions of large DNA insertions of known sequence (<xref ref-type="fig" rid="F3">Figure 3</xref>). The user has to obtain paired-end (e.g., Illumina-like) or single-end (e.g., Ion Proton-like) NGS reads from a mutant carrying an insertion of partially or completely known sequence. In previous works, we used simulations to assess that a 5&#x00D7; read depth can be sufficient to map most insertions in a sample using the methods that we implemented in Easymap (<xref ref-type="bibr" rid="B49">Wilson-S&#x00E1;nchez et al., 2019</xref>); however, as higher read depths directly translate to more reliable results, a minimum read depth of 10&#x00D7; is advisable. Easymap can also use reads from multiple mutants pooled into a single DNA sample, in which case the minimum read depth recommended is 10&#x00D7; per pooled mutant. The program finds reads that overlap the left and right junctions of a given insertion, as well as unpaired alignments neighboring the insertion site; then it uses them as probes against the whole genome sequence (<xref ref-type="fig" rid="F3">Figure 3</xref>). Bowtie2 is used as the alignment tool for this workflow, because local alignment of reads is currently not compatible with HISAT2, which favors spliced alignment of reads. If several hits accumulate around a locus, its physical position is reported as a putative insertion site.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Large insertion mapping with Easymap. <bold>(A&#x2013;C)</bold> Local alignment analysis. <bold>(A)</bold> The DNA insert appears in blue, over genomic DNA in gray. Individual reads are taken from the mutant genome. <bold>(B)</bold> The reads are aligned to the insertion sequence. Locally aligned reads (e.g., 1) are selected and sorted according to the end that is truncated (in blue and green). <bold>(C)</bold> The selected reads are aligned to the genomic reference sequence. The blue triangle indicates the position of the insertion in the mutant genome. <bold>(D&#x2013;F)</bold> Paired-read analysis. <bold>(D)</bold> Paired reads are taken from the mutant genome. <bold>(E)</bold> The reads are aligned to the insertion sequence. Unaligned reads with aligned mates (e.g., 2) are selected and sorted according to their position in relation to the insertion (in blue and green). <bold>(F)</bold> The selected reads are aligned to the reference sequence, delimiting a candidate region for the insertion site. <bold>(G,H)</bold> Read depth histograms for examples of local alignment <bold>(G)</bold> and paired-read analyses <bold>(H)</bold>. <bold>(G)</bold> False-positive insertion, characterized by low overall read depths and disorganized data. <bold>(H)</bold> True-positive insertion, characterized by high read depths and organized data.</p></caption>
<graphic xlink:href="fpls-12-655286-g003.tif"/>
</fig>
<p>If more than one insertional event is detected in a given mutant, the aforementioned experimental design and analysis workflow cannot discriminate the insertion causing the mutant phenotype. Such a mutant, however, can also be crossed and analyzed as described in Workflow 1. The mapping report from Workflow 2 includes a histogram that shows the distribution of the data supporting each putative insertion (<xref ref-type="fig" rid="F1">Figure 1D</xref>). The user can inspect the histograms visually to easily discern false positives [disorganized clusters with very low accumulated read depths (RDs)] from genuine insertions (clusters of organized data with a high number of accumulated RDs).</p>
</sec>
<sec id="S2.SS1.SSS3">
<title>Complementary Tasks</title>
<p>An Easymap run automatically performs several essential tasks such as input data quality controls, including the verification of the FastQ encoding and quality, and assessment of the RD distribution for each sample. After each analysis, Easymap creates a comprehensive report containing high-resolution images and tabular data to assist the user in interpreting the mapping results (e.g., allele frequency plots for EMS-induced mutations, RD histograms for insertional mutations, and gene plots for each putatively damaged gene; <xref ref-type="fig" rid="F1">Figure 1D</xref>), a prediction of the functional effect of the candidate mutations, the flanking sequences of each mutation, and the sequences of oligonucleotide primers to genotype such mutation.</p>
<p>Next-generation sequencing experiment simulations can be helpful for optimizing the design of effective mapping experiments (<xref ref-type="bibr" rid="B18">James et al., 2013</xref>; <xref ref-type="bibr" rid="B49">Wilson-S&#x00E1;nchez et al., 2019</xref>). Therefore, Easymap includes a built-in experiment simulator that allows the user to simulate NGS data to test different mapping designs and parameters.</p>
</sec>
</sec>
<sec id="S2.SS2">
<title>Assessment of Easymap Performance With Simulated and Real Data</title>
<p>We tested Easymap with tens of simulated datasets for each mapping strategy and analysis workflow supported by the program. This allowed us to hard-code appropriate values for analysis variables (e.g., SNP filtering thresholds) under different experimental conditions, saving the user from having to set complex parameters. However, to add more flexibility to the analysis, Workflow 1 allows the user to choose between two levels of stringency for SNP selection. We tested Easymap with data from real mapping-by-sequencing experiments; <xref ref-type="table" rid="T1">Tables 1</xref>, <xref ref-type="table" rid="T2">2</xref> show the results that we obtained when analyzing reads from a range of previously published mutants. We reproduced previously published results, demonstrating the reliability of Easymap even under extreme conditions with average read depths as low as 5&#x00D7; (<xref ref-type="bibr" rid="B38">Obholzer et al., 2012</xref>; <xref ref-type="bibr" rid="B50">Wilson-S&#x00E1;nchez et al., 2014</xref>). The mapping reports for each of these experiments are available in our preview Easymap installation<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> and additional information about these experiments is provided in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>. Among the data used for testing the Easymap linkage analysis mapping workflows, we employed NGS reads obtained from mutants of <italic>Arabidopsis thaliana</italic> (<xref ref-type="bibr" rid="B35">Morel et al., 2002</xref>; <xref ref-type="bibr" rid="B14">Hartwig et al., 2012</xref>; <xref ref-type="bibr" rid="B43">Rishmawi et al., 2014</xref>; <xref ref-type="bibr" rid="B47">Sun and Schneeberger, 2015</xref>; <xref ref-type="bibr" rid="B48">Wachsman et al., 2017</xref>; <xref ref-type="bibr" rid="B31">Mateo-Bonmat&#x00ED; et al., 2018</xref>), <italic>Zea mays</italic> (<xref ref-type="bibr" rid="B29">Liu et al., 2012</xref>; <xref ref-type="bibr" rid="B27">Li et al., 2013</xref>; <xref ref-type="bibr" rid="B26">Li L. et al., 2016</xref>; <xref ref-type="bibr" rid="B23">Klein et al., 2018</xref>), <italic>Caenorhabditis elegans</italic> (<xref ref-type="bibr" rid="B9">Fay and Spencer, 2006</xref>), and <italic>Danio rerio</italic> (<xref ref-type="bibr" rid="B38">Obholzer et al., 2012</xref>), which included F<sub>2</sub> (<xref ref-type="bibr" rid="B14">Hartwig et al., 2012</xref>; <xref ref-type="bibr" rid="B38">Obholzer et al., 2012</xref>; <xref ref-type="bibr" rid="B43">Rishmawi et al., 2014</xref>; <xref ref-type="bibr" rid="B47">Sun and Schneeberger, 2015</xref>; <xref ref-type="bibr" rid="B23">Klein et al., 2018</xref>; <xref ref-type="bibr" rid="B31">Mateo-Bonmat&#x00ED; et al., 2018</xref>) and M<sub>2</sub>/M<sub>3</sub> (<xref ref-type="bibr" rid="B48">Wachsman et al., 2017</xref>) mapping populations obtained to identify recessive mutations, as well as dominant mutations mapped in F<sub>2</sub> after an F<sub>3</sub> screening (<xref ref-type="bibr" rid="B9">Fay and Spencer, 2006</xref>), and RNA-seq datasets (<xref ref-type="bibr" rid="B29">Liu et al., 2012</xref>; <xref ref-type="bibr" rid="B27">Li et al., 2013</xref>; <xref ref-type="bibr" rid="B26">Li L. et al., 2016</xref>; <xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). Among the mentioned datasets, we included the two experiments used for the validation of the mapping tool SHOREmap v3.0 (<xref ref-type="bibr" rid="B47">Sun and Schneeberger, 2015</xref>), a backcross and an outcross approach corresponding to the <italic>hasty</italic> and <italic>soc1</italic> alleles, respectively (<xref ref-type="table" rid="T1">Table 1</xref>). The candidates reported by Easymap are comparable with the ones produced by SHOREmap. With Easymap, we identified two candidate mutations for <italic>hasty</italic>, while SHOREmap reported three (the missing candidate is outside the candidate region delimited by Easymap). Whereas SHOREmap reported two mutations of interest for <italic>soc1-2</italic>, Easymap identified seven mutations within a broader candidate region.</p>
<p>For tagged sequence mapping, we reproduced previous results for <italic>Arabidopsis thaliana</italic> (<xref ref-type="bibr" rid="B50">Wilson-S&#x00E1;nchez et al., 2014</xref>; <xref ref-type="bibr" rid="B28">Li W.X. et al., 2016</xref>) and <italic>Oryza sativa</italic> (<xref ref-type="bibr" rid="B51">Yang et al., 2013</xref>) mutants; we also analyzed an unpublished dataset obtained in our laboratory (<xref ref-type="table" rid="T2">Table 2</xref> and <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>).</p>
</sec>
<sec id="S2.SS3">
<title>Easymap Architecture</title>
<p>We developed Easymap for UNIX-based operating systems since high-performance machines most commonly run Linux, and several tasks in Easymap are performed by third-party software that has already been extensively tested and used in Linux. These open-source, publicly available programs obtained by previous authors are listed in <xref ref-type="table" rid="T3">Table 3</xref>.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Third-party software packages included in Easymap.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Workflow</bold></td>
<td valign="top" align="center"><bold>Software</bold></td>
<td valign="top" align="left"><bold>References</bold></td>
<td valign="top" align="left"><bold>General usage and parameters</bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Linkage analysis mapping</td>
<td valign="top" align="center">HISAT2</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B22">Kim et al. (2019)</xref></td>
<td valign="top" align="left">hisat2-build for genome indexing; hisat2 with default options for paired-end or single-end read alignment</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="center">SAMtools</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B25">Li et al. (2009)</xref></td>
<td valign="top" align="left">samtools sort to convert BAM files to SAM; samtools mpileup for first step in variant calling; with arguments: -t DP, ADF, ADR for specific output formatting of VCF file -C50 to fix overestimated mapping qualities (high stringency mode)</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="center">BCFtools</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B36">Narasimhan et al. (2016)</xref></td>
<td valign="top" align="left">bcftools call for second step in variant calling, with argument -mv to report only polymorphic sites</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="center">HTSlib</td>
<td valign="top" align="left"><ext-link ext-link-type="uri" xlink:href="http://www.htslib.org/">http://www.htslib.org/</ext-link></td>
<td valign="top" align="left">A dependency for BCFtools</td>
</tr>
<tr>
<td valign="top" align="left">Tagged sequence mapping</td>
<td valign="top" align="center">Bowtie 2</td>
<td valign="top" align="left"><xref ref-type="bibr" rid="B24">Langmead and Salzberg (2012)</xref></td>
<td valign="top" align="left">bowtie2-build for genome indexing; bowtie2 with default options for paired-end read alignment; bowtie2 -local for local alignment of paired-end or single-end reads</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Easymap comprises a software stack consisting of a controller layer, a workflow layer (representing linkage analysis mapping, tagged sequence mapping, and common processes), and a tasks layer (representing custom and third-party programs). The controller exposes a simple application programming interface (API) with which the web and command line scripts interact. This allows Easymap to be installed either locally or remotely while offering simultaneous command-line and graphical interface access (<xref ref-type="supplementary-material" rid="PS1">Supplementary Figure 1</xref>).</p>
<p>To simplify Easymap installation, a single script compiles and/or installs all required software (Python2, Python Imaging Library, Virtualenv, Bowtie2, HISAT2, HTSlib, SAMtools, and BCFtools; <xref ref-type="table" rid="T3">Table 3</xref>) within the Easymap directory. All third-party software is included within the Easymap package so that no dependencies are required. For installations in shared environments, usage (memory and number of concurrent jobs) can be limited by the system administrator through a simple configuration file. The installation script sets up a dedicated HTTP server to run as a background process using the port chosen by the user. Easymap implements chunked file transfers for reliable HTTP transfer of large read files. Further installation setups and usage indications can be found in the Easymap documentation (<xref ref-type="supplementary-material" rid="DS2">Supplementary Data Sheet 2</xref>).</p>
</sec>
</sec>
<sec id="S3">
<title>Discussion</title>
<p>After the advent of NGS, mapping-by-sequencing strategies quickly became the most attractive methods for mutation mapping. Not few software tools have been developed for this purpose, most of which are not easy to manage for researchers without a background in bioinformatics. We attempted to identify the main accessibility issues of such tools and developed Easymap, a program for mapping-by-sequencing that can be used reliably by as many researchers as possible, irrespective of their computer skills, and in as many experimental designs as possible.</p>
<p>The main accessibility features of Easymap are the following: it is free and open source; a single command installs the software and launches the server for the graphical interface; it is easy to use, as it has a graphical interface and workflows that smoothly convert raw data into comprehensive yet simple reports; it is polyvalent, because it can be used for a wide variety of experimental setups (<xref ref-type="table" rid="T4">Table 4</xref>); it is flexible, because it can be installed locally or remotely (on a server) while maintaining its graphical interface (<xref ref-type="supplementary-material" rid="PS1">Supplementary Figures 1</xref>, <xref ref-type="supplementary-material" rid="PS2">2</xref>).</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>Experimental designs supported by different open-source programs used for mapping-by-sequencing.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"><bold>Mutations under study</bold></td>
<td valign="top" align="center"><bold>Mapping design</bold></td>
<td valign="top" align="left"><bold>Control sample</bold></td>
<td valign="top" align="center" colspan="5"><bold>Software</bold><hr/></td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="left"><bold>SHOREmap<sup>1</sup> artMAP<sup>2</sup></bold></td>
<td valign="top" align="left"><bold>SIMPLE<sup>3</sup></bold></td>
<td valign="top" align="left"><bold>CandiSNP<sup>4</sup></bold></td>
<td valign="top" align="left"><bold>Jitterbug<sup>5</sup> ITIS<sup>6</sup></bold></td>
<td valign="top" align="left"><bold>Easymap<sup>7</sup></bold></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Point mutations</td>
<td valign="top" align="center">Backcross</td>
<td valign="top" align="left">Parental line</td>
<td valign="top" align="left">D</td>
<td valign="top" align="justify"/>
<td valign="top" align="center">D</td>
<td valign="top" align="justify"/>
<td valign="top" align="center">D/R</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center">Phenotypically wild-type F<sub>2</sub> or M<sub>2</sub></td>
<td valign="top" align="justify"/>
<td valign="top" align="center">D</td>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center">D/R</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="center">Outcross</td>
<td valign="top" align="left">Parental line</td>
<td valign="top" align="center">D</td>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center">D/R</td>
</tr>
<tr>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center">Phenotypically wild-type F<sub>2</sub> or M<sub>2</sub></td>
<td valign="top" align="justify"/>
<td valign="top" align="center">D</td>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center">D/R</td>
</tr>
<tr>
<td valign="top" align="left">Large insertions</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="left">&#x2013;</td>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="justify"/>
<td valign="top" align="center">D</td>
<td valign="top" align="center">D</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic>The capabilities of some current mapping tools are compared, D and R indicating an experimental design supported by the software (with DNA-seq or RNA-seq data, respectively). <sup>1</sup><xref ref-type="bibr" rid="B47">Sun and Schneeberger (2015)</xref>. <sup>2</sup><xref ref-type="bibr" rid="B20">Javorka et al. (2019)</xref>. <sup>3</sup><xref ref-type="bibr" rid="B48">Wachsman et al. (2017)</xref>. <sup>4</sup><xref ref-type="bibr" rid="B7">Etherington et al. (2014)</xref>. <sup>5</sup><xref ref-type="bibr" rid="B15">H&#x00E9;naff et al. (2015)</xref>. <sup>6</sup><xref ref-type="bibr" rid="B21">Jiang et al. (2015)</xref>. <sup>7</sup>This work.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>The implementation of the HISAT2 aligner allows the use of RNA-seq data for mutation mapping in large genomes, making Easymap the first program of its class to allow performing mapping-by-sequencing with large genomes for which whole genome sequencing may not be affordable. Mapping-by-sequencing approaches with RNA-seq datasets are generally limited to the identification of a candidate region because it is considered likely that the causal mutation of the phenotype under study will not be represented in the sequenced transcriptome, as a consequence of low or null transcription of the mutated gene.</p>
<p>Easymap proved to be reliable under a wide variety of experimental designs, in five different plant (<italic>A. thaliana</italic>, <italic>Z. mays</italic>, and <italic>O. sativa</italic>) and animal (<italic>D. rerio</italic> and <italic>C. elegans</italic>) species and a total of 28 experiments showing unprecedented versatility and adaptability to the input data. While Easymap facilitates mutation mapping, the identification of a causal mutation will inevitably require validation of the candidates via experimental approaches such as allelic complementation tests and phenotypic rescue assays. Furthermore, the linkage analysis workflow will point to the mapping region but will not report any insertions, deletions, or genomic rearrangements. If the causal mutation under study is not a typical point mutation, additional analyses will be required to map such mutations within the candidate region.</p>
<p>In conclusion, here we introduce Easymap, a novel analysis tool for mapping-by-sequencing of large insertions and point mutations, designed to broaden the access to mapping-by-sequencing approaches to a larger number of potential users. Easymap features a web-based graphic interface, a simple installation script, robust mapping analyses for several experimental designs, and thorough user-oriented mapping reports. A preview instance of Easymap is available at <ext-link ext-link-type="uri" xlink:href="http://atlas.umh.es/genetics">http://atlas.umh.es/genetics</ext-link>. This preview contains the mapping reports for all the experiments used during the validation of our program (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>); however, it is not a functional installation of Easymap. The Easymap source code is available for download at <ext-link ext-link-type="uri" xlink:href="https://github.com/MicolLab/easymap">https://github.com/MicolLab/easymap</ext-link> and <ext-link ext-link-type="uri" xlink:href="http://genetics.edu.umh.es/resources/easymap">http://genetics.edu.umh.es/resources/easymap</ext-link>. However, we recommend to install the program by following our Quickstart Installation Guide (<xref ref-type="supplementary-material" rid="DS1">Supplementary Data Sheet 1</xref>).</p>
</sec>
<sec id="S4">
<title>Methods</title>
<sec id="S4.SS1">
<title>Programming Languages and Utilities</title>
<p>Easymap was designed as a modular program, so that each module can be used and modified independently. Modules include custom Python2 scripts and third-party software packages (<xref ref-type="table" rid="T3">Table 3</xref>). Modules are run sequentially by different Bash scripts, or workflows, attending to the user preferences as defined in the web interface or the command line interface.</p>
<p>Python source and libraries are installed within a Virtualenv virtual environment so that previous software installations are not disturbed. The Easymap server is launched using the Python2 CGIHTTPServer function to set up the web interface. The Pillow imaging library is used for the generation of the graphic output.</p>
</sec>
<sec id="S4.SS2">
<title>Testing</title>
<p>We tested Easymap in several operating systems, including different distributions of Linux such as Ubuntu, Fedora, Red Hat, and AMI. Easymap can also run within the Ubuntu apps available in the Windows 10 Microsoft Store. Easymap runs on regular desktop computers and high-performance machines, in local machines and remote instances (e.g., the Amazon Elastic Compute Cloud service), and also in virtual machines running UNIX-based OS within Windows or Mac OS. Appendix D of the Easymap user manual (<xref ref-type="supplementary-material" rid="DS2">Supplementary Data Sheet 2</xref>) includes detailed information for different installation setups.</p>
<p>Easymap&#x2019;s performance obviously depends on the performance of the machine where it is installed, as well as on the size of the genome being analyzed and the size of input reads. A typical Easymap run can take from hours to days, depending on these variables. As a general recommendation, we suggest using machines with a minimum of 4 Gb of RAM and twice the size of all input reads of available disk storage. Analysis of a typical <italic>Arabidopsis</italic> backcross dataset with 50&#x00D7; test and control reads will take around 6 h to complete and use 4 Gb of RAM memory in a desktop computer. Easymap allows executing multiple projects simultaneously, but this can affect the overall performance of a desktop machine. Easymap does not allow multithreading by default for the sake of simplifying its usage; however, advanced users can find instructions to implement multithreading within the Easymap Documentation.</p>
</sec>
</sec>
<sec id="S5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="S8">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>JLM obtained funding, provided resources, and supervised the work. DW-S, SDL, and JLM conceived and designed the program and wrote the article. DW-S and SDL developed the program. SA-S contributed a number of Python scripts. SDL tested the software with real datasets. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This work was supported by grants from the Ministerio de Ciencia e Innovaci&#x00F3;n of Spain [PGC2018-093445-B-I00 (MCI/AEI/FEDER, UE)] and the Generalitat Valenciana (PROMETEO/2019/117), to JLM. SDL held a predoctoral fellowship (ACIF/2018/005) from the Generalitat Valenciana.</p>
</fn>
</fn-group>
<sec id="S8" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpls.2021.655286/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpls.2021.655286/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Presentation_1.PDF" id="PS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure 1</label>
<caption><p>Easymap architecture overview.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Presentation_1.PDF" id="PS2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure 2</label>
<caption><p>Easymap graphic interface to design and run a new project.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Table_1.XLSX" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Table 1</label>
<caption><p>Results obtained in the validation of Easymap using multiple NGS datasets.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_1.PDF" id="DS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Data Sheet 1</label>
<caption><p>Easymap Quickstart Guide.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_2.pdf" id="DS2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Data Sheet 2</label>
<caption><p>Easymap documentation.</p></caption>
</supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abe</surname> <given-names>A.</given-names></name> <name><surname>Kosugi</surname> <given-names>S.</given-names></name> <name><surname>Yoshida</surname> <given-names>K.</given-names></name> <name><surname>Natsume</surname> <given-names>S.</given-names></name> <name><surname>Takagi</surname> <given-names>H.</given-names></name> <name><surname>Kanzaki</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Genome sequencing reveals agronomically important loci in rice using MutMap.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>30</volume> <fpage>174</fpage>&#x2013;<lpage>178</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2095</pub-id> <pub-id pub-id-type="pmid">22267009</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afgan</surname> <given-names>E.</given-names></name> <name><surname>Baker</surname> <given-names>D.</given-names></name> <name><surname>Batut</surname> <given-names>B.</given-names></name> <name><surname>Van den Beek</surname> <given-names>M.</given-names></name> <name><surname>Bouvier</surname> <given-names>D.</given-names></name> <name><surname>&#x010C;ech</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>W537</fpage>&#x2013;<lpage>W544</lpage>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alonso</surname> <given-names>J. M.</given-names></name> <name><surname>Stepanova</surname> <given-names>A. N.</given-names></name> <name><surname>Leisse</surname> <given-names>T. J.</given-names></name> <name><surname>Kim</surname> <given-names>C. J.</given-names></name> <name><surname>Chen</surname> <given-names>H.</given-names></name> <name><surname>Shinn</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2003</year>). <article-title>Genome-wide insertional mutagenesis of <italic>Arabidopsis thaliana</italic>.</article-title> <source><italic>Science</italic></source> <volume>301</volume> <fpage>653</fpage>&#x2013;<lpage>657</lpage>. <pub-id pub-id-type="doi">10.1126/science.1086391</pub-id> <pub-id pub-id-type="pmid">12893945</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Candela</surname> <given-names>H.</given-names></name> <name><surname>Casanova-S&#x00E1;ez</surname> <given-names>R.</given-names></name> <name><surname>Micol</surname> <given-names>J. L.</given-names></name></person-group> (<year>2015</year>). <article-title>Getting started in mapping-by-sequencing.</article-title> <source><italic>J. Integr. Plant Biol.</italic></source> <volume>57</volume> <fpage>606</fpage>&#x2013;<lpage>612</lpage>. <pub-id pub-id-type="doi">10.1111/jipb.12305</pub-id> <pub-id pub-id-type="pmid">25359627</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cooley</surname> <given-names>L.</given-names></name> <name><surname>Kelley</surname> <given-names>R.</given-names></name> <name><surname>Spradling</surname> <given-names>A.</given-names></name></person-group> (<year>1988</year>). <article-title>Insertional mutagenesis of the <italic>Drosophila</italic> genome with single P elements.</article-title> <source><italic>Science</italic></source> <volume>239</volume> <fpage>1121</fpage>&#x2013;<lpage>1128</lpage>. <pub-id pub-id-type="doi">10.1126/science.2830671</pub-id> <pub-id pub-id-type="pmid">2830671</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ecovoiu</surname> <given-names>A. A.</given-names></name> <name><surname>Ghionoiu</surname> <given-names>I. C.</given-names></name> <name><surname>Ciuca</surname> <given-names>A. M.</given-names></name> <name><surname>Ratiu</surname> <given-names>A. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Genome ARTIST: a robust, high-accuracy aligner tool for mapping transposon insertions and self-insertions.</article-title> <source><italic>Mob. DNA</italic></source> <volume>7</volume>:<fpage>3</fpage>.</citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Etherington</surname> <given-names>G. J.</given-names></name> <name><surname>Monaghan</surname> <given-names>J.</given-names></name> <name><surname>Zipfel</surname> <given-names>C.</given-names></name> <name><surname>MacLean</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Mapping mutations in plant genomes with the user-friendly web application CandiSNP.</article-title> <source><italic>Plant Methods</italic></source> <volume>10</volume>:<fpage>41</fpage>.</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ewing</surname> <given-names>A. D.</given-names></name></person-group> (<year>2015</year>). <article-title>Transposable element detection from whole genome sequence data.</article-title> <source><italic>Mob. DNA</italic></source> <volume>6</volume>:<fpage>24</fpage>.</citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fay</surname> <given-names>D.</given-names></name> <name><surname>Spencer</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <source><italic>Genetic Mapping and Manipulation: Chapter 8-Dominant Mutations (February 17, 2006), WormBook</italic>. The <italic>C. elegans</italic> Research Community</source>. <publisher-loc>Madison, WI</publisher-loc>: <publisher-name>WormBook</publisher-name>, <pub-id pub-id-type="doi">10.1895/wormbook.1.97.1</pub-id> <pub-id pub-id-type="pmid">18050456</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fekih</surname> <given-names>R.</given-names></name> <name><surname>Takagi</surname> <given-names>H.</given-names></name> <name><surname>Tamiru</surname> <given-names>M.</given-names></name> <name><surname>Abe</surname> <given-names>A.</given-names></name> <name><surname>Natsume</surname> <given-names>S.</given-names></name> <name><surname>Yaegashi</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>MutMap+: genetic mapping and mutant identification without crossing in rice.</article-title> <source><italic>PLoS One</italic></source> <volume>8</volume>:<fpage>e68529</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0068529</pub-id> <pub-id pub-id-type="pmid">23874658</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fr&#x00F8;kj&#x00E6;r-Jensen</surname> <given-names>C.</given-names></name> <name><surname>Davis</surname> <given-names>M. W.</given-names></name> <name><surname>Sarov</surname> <given-names>M.</given-names></name> <name><surname>Taylor</surname> <given-names>J.</given-names></name> <name><surname>Flibotte</surname> <given-names>S.</given-names></name> <name><surname>LaBella</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Random and targeted transgene insertion in <italic>Caenorhabditis elegans</italic> using a modified Mos1 transposon.</article-title> <source><italic>Nat. Methods</italic></source> <volume>11</volume> <fpage>529</fpage>&#x2013;<lpage>534</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.2889</pub-id> <pub-id pub-id-type="pmid">24820376</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gasch</surname> <given-names>A.</given-names></name> <name><surname>Aoyama</surname> <given-names>T.</given-names></name> <name><surname>Foster</surname> <given-names>R.</given-names></name> <name><surname>Chua</surname> <given-names>N.-H.</given-names></name></person-group> (<year>1992</year>). &#x201C;<article-title>Gene isolation with the polymerase chain reaction</article-title>,&#x201D; in <source><italic>Methods in Arabidopsis Research</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Koncz</surname> <given-names>C.</given-names></name></person-group> (<publisher-loc>Singapore</publisher-loc>: <publisher-name>World Scientific</publisher-name>), <fpage>342</fpage>&#x2013;<lpage>356</lpage>. <pub-id pub-id-type="doi">10.1142/9789814439701_0014</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez</surname> <given-names>M. A.</given-names></name> <name><surname>Lebrigio</surname> <given-names>R. F.</given-names></name> <name><surname>Van Booven</surname> <given-names>D.</given-names></name> <name><surname>Ulloa</surname> <given-names>R. H.</given-names></name> <name><surname>Powell</surname> <given-names>E.</given-names></name> <name><surname>Speziani</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>GEnomes Management application (GEM.app): a new software tool for large-scale collaborative genome analysis.</article-title> <source><italic>Hum. Mutat.</italic></source> <volume>34</volume> <fpage>842</fpage>&#x2013;<lpage>846</lpage>. <pub-id pub-id-type="doi">10.1002/humu.22305</pub-id> <pub-id pub-id-type="pmid">23463597</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hartwig</surname> <given-names>B.</given-names></name> <name><surname>James</surname> <given-names>G. V.</given-names></name> <name><surname>Konrad</surname> <given-names>K.</given-names></name> <name><surname>Schneeberger</surname> <given-names>K.</given-names></name> <name><surname>Turck</surname> <given-names>F.</given-names></name></person-group> (<year>2012</year>). <article-title>Fast isogenic mapping-by-sequencing of ethyl methanesulfonate-induced mutant bulks.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>160</volume> <fpage>591</fpage>&#x2013;<lpage>600</lpage>. <pub-id pub-id-type="doi">10.1104/pp.112.200311</pub-id> <pub-id pub-id-type="pmid">22837357</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>H&#x00E9;naff</surname> <given-names>E.</given-names></name> <name><surname>Zapata</surname> <given-names>L.</given-names></name> <name><surname>Casacuberta</surname> <given-names>J. M.</given-names></name> <name><surname>Ossowski</surname> <given-names>S.</given-names></name></person-group> (<year>2015</year>). <article-title>Jitterbug: somatic and germline transposon insertion detection at single-nucleotide resolution.</article-title> <source><italic>BMC Genomics</italic></source> <volume>16</volume>:<fpage>768</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-015-1975-5</pub-id> <pub-id pub-id-type="pmid">26459856</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hill</surname> <given-names>J. T.</given-names></name> <name><surname>Demarest</surname> <given-names>B. L.</given-names></name> <name><surname>Bisgrove</surname> <given-names>B. W.</given-names></name> <name><surname>Gorsi</surname> <given-names>B.</given-names></name> <name><surname>Su</surname> <given-names>Y. C.</given-names></name> <name><surname>Yost</surname> <given-names>H. J.</given-names></name></person-group> (<year>2013</year>). <article-title>MMAPPR: mutation mapping analysis pipeline for pooled RNA-seq.</article-title> <source><italic>Genome Res.</italic></source> <volume>23</volume> <fpage>687</fpage>&#x2013;<lpage>697</lpage>. <pub-id pub-id-type="doi">10.1101/gr.146936.112</pub-id> <pub-id pub-id-type="pmid">23299975</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>James</surname> <given-names>D. W.</given-names> <suffix>Jr.</suffix></name> <name><surname>Dooner</surname> <given-names>H. K.</given-names></name></person-group> (<year>1990</year>). <article-title>Isolation of EMS-induced mutants in <italic>Arabidopsis</italic> altered in seed fatty acid composition.</article-title> <source><italic>Theor. Appl. Genet.</italic></source> <volume>80</volume> <fpage>241</fpage>&#x2013;<lpage>245</lpage>. <pub-id pub-id-type="doi">10.1007/bf00224393</pub-id> <pub-id pub-id-type="pmid">24220902</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>James</surname> <given-names>G. V.</given-names></name> <name><surname>Patel</surname> <given-names>V.</given-names></name> <name><surname>Nordstrom</surname> <given-names>K. J.</given-names></name> <name><surname>Klasen</surname> <given-names>J. R.</given-names></name> <name><surname>Salome</surname> <given-names>P. A.</given-names></name> <name><surname>Weigel</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>User guide for mapping-by-sequencing in <italic>Arabidopsis</italic>.</article-title> <source><italic>Genome Biol.</italic></source> <volume>14</volume>:<fpage>R61</fpage>.</citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jansen</surname> <given-names>G.</given-names></name> <name><surname>Hazendonk</surname> <given-names>E.</given-names></name> <name><surname>Thijssen</surname> <given-names>K. L.</given-names></name> <name><surname>Plasterk</surname> <given-names>R. H.</given-names></name></person-group> (<year>1997</year>). <article-title>Reverse genetics by chemical mutagenesis in <italic>Caenorhabditis elegans</italic>.</article-title> <source><italic>Nat. Genet.</italic></source> <volume>17</volume> <fpage>119</fpage>&#x2013;<lpage>121</lpage>. <pub-id pub-id-type="doi">10.1038/ng0997-119</pub-id> <pub-id pub-id-type="pmid">9288111</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Javorka</surname> <given-names>P.</given-names></name> <name><surname>Raxwal</surname> <given-names>V. K.</given-names></name> <name><surname>Najvarek</surname> <given-names>J.</given-names></name> <name><surname>Riha</surname> <given-names>K.</given-names></name></person-group> (<year>2019</year>). <article-title>artMAP: a user-friendly tool for mapping ethyl methanesulfonate-induced mutations in <italic>Arabidopsis</italic>.</article-title> <source><italic>Plant Direct</italic></source> <volume>3</volume>:<fpage>e00146</fpage>. <pub-id pub-id-type="doi">10.1002/pld3.146</pub-id> <pub-id pub-id-type="pmid">31245783</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jiang</surname> <given-names>C.</given-names></name> <name><surname>Chen</surname> <given-names>C.</given-names></name> <name><surname>Huang</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>R.</given-names></name> <name><surname>Verdier</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). <article-title>ITIS, a bioinformatics tool for accurate identification of transposon insertion sites using next-generation sequencing data.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>16</volume>:<fpage>72</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-015-0507-2</pub-id> <pub-id pub-id-type="pmid">25887332</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>D.</given-names></name> <name><surname>Paggi</surname> <given-names>J. M.</given-names></name> <name><surname>Park</surname> <given-names>C.</given-names></name> <name><surname>Bennett</surname> <given-names>C.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2019</year>). <article-title>Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>37</volume> <fpage>907</fpage>&#x2013;<lpage>915</lpage>. <pub-id pub-id-type="doi">10.1038/s41587-019-0201-4</pub-id> <pub-id pub-id-type="pmid">31375807</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klein</surname> <given-names>H.</given-names></name> <name><surname>Xiao</surname> <given-names>Y.</given-names></name> <name><surname>Conklin</surname> <given-names>P. A.</given-names></name> <name><surname>Govindarajulu</surname> <given-names>R.</given-names></name> <name><surname>Kelly</surname> <given-names>J. A.</given-names></name> <name><surname>Scanlon</surname> <given-names>M. J.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Bulked-segregant analysis coupled to whole genome sequencing (BSA-Seq) for rapid gene cloning in maize.</article-title> <source><italic>G3</italic></source> <volume>8</volume> <fpage>3583</fpage>&#x2013;<lpage>3592</lpage>. <pub-id pub-id-type="doi">10.1534/g3.118.200499</pub-id> <pub-id pub-id-type="pmid">30194092</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langmead</surname> <given-names>B.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Fast gapped-read alignment with Bowtie 2.</article-title> <source><italic>Nat. Methods</italic></source> <volume>9</volume> <fpage>357</fpage>&#x2013;<lpage>359</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id> <pub-id pub-id-type="pmid">22388286</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>The sequence alignment/map format and SAMtools.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>2078</fpage>&#x2013;<lpage>2079</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id> <pub-id pub-id-type="pmid">19505943</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Hey</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>Q.</given-names></name> <name><surname>McNinch</surname> <given-names>C.</given-names></name> <name><surname>Hu</surname> <given-names>H. C.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Characterization of maize <italic>roothairless6</italic> which encodes a D-type cellulose synthase and controls the switch from bulge formation to tip growth.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>6</volume>:<fpage>34395</fpage>.</citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Ma</surname> <given-names>X.</given-names></name> <name><surname>Dietrich</surname> <given-names>C. R.</given-names></name> <name><surname>Hu</surname> <given-names>H. C.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>The maize <italic>glossy13</italic> gene, cloned via BSR-Seq and Seq-walking encodes a putative ABC transporter required for the normal accumulation of epicuticular waxes.</article-title> <source><italic>PLoS One</italic></source> <volume>8</volume>:<fpage>e82333</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0082333</pub-id> <pub-id pub-id-type="pmid">24324772</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>W. X.</given-names></name> <name><surname>Wu</surname> <given-names>S. L.</given-names></name> <name><surname>Liu</surname> <given-names>Y. H.</given-names></name> <name><surname>Jin</surname> <given-names>G. L.</given-names></name> <name><surname>Zhao</surname> <given-names>H. J.</given-names></name> <name><surname>Fan</surname> <given-names>L. J.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Genome-wide profiling of genetic variation in <italic>Agrobacterium</italic>-transformed rice plants.</article-title> <source><italic>J. Zhejiang Univ. Sci. B</italic></source> <volume>17</volume> <fpage>992</fpage>&#x2013;<lpage>996</lpage>. <pub-id pub-id-type="doi">10.1631/jzus.b1600301</pub-id> <pub-id pub-id-type="pmid">27921404</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Yeh</surname> <given-names>C. T.</given-names></name> <name><surname>Tang</surname> <given-names>H. M.</given-names></name> <name><surname>Nettleton</surname> <given-names>D.</given-names></name> <name><surname>Schnable</surname> <given-names>P. S.</given-names></name></person-group> (<year>2012</year>). <article-title>Gene mapping via bulked segregant RNA-Seq (BSR-Seq).</article-title> <source><italic>PLoS One</italic></source> <volume>7</volume>:<fpage>e36406</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0036406</pub-id> <pub-id pub-id-type="pmid">22586469</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y. G.</given-names></name> <name><surname>Mitsukawa</surname> <given-names>N.</given-names></name> <name><surname>Oosumi</surname> <given-names>T.</given-names></name> <name><surname>Whittier</surname> <given-names>R. F.</given-names></name></person-group> (<year>1995</year>). <article-title>Efficient isolation and mapping of <italic>Arabidopsis thaliana</italic> T-DNA insert junctions by thermal asymmetric interlaced PCR.</article-title> <source><italic>Plant J.</italic></source> <volume>8</volume> <fpage>457</fpage>&#x2013;<lpage>463</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-313x.1995.08030457.x</pub-id> <pub-id pub-id-type="pmid">7550382</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mateo-Bonmat&#x00ED;</surname> <given-names>E.</given-names></name> <name><surname>Esteve-Bruna</surname> <given-names>D.</given-names></name> <name><surname>Juan-Vicente</surname> <given-names>L.</given-names></name> <name><surname>Nadi</surname> <given-names>R.</given-names></name> <name><surname>Candela</surname> <given-names>H.</given-names></name> <name><surname>Lozano</surname> <given-names>F. M.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title><italic>INCURVATA11</italic> and <italic>CUPULIFORMIS2</italic> are redundant genes that encode epigenetic machinery components in <italic>Arabidopsis</italic>.</article-title> <source><italic>Plant Cell</italic></source> <volume>30</volume> <fpage>1596</fpage>&#x2013;<lpage>1616</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.18.00300</pub-id> <pub-id pub-id-type="pmid">29915151</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Medford</surname> <given-names>J. I.</given-names></name> <name><surname>Behringer</surname> <given-names>F. J.</given-names></name> <name><surname>Callos</surname> <given-names>J. D.</given-names></name> <name><surname>Feldmann</surname> <given-names>K. A.</given-names></name></person-group> (<year>1992</year>). <article-title>Normal and abnormal development in the <italic>Arabidopsis</italic> vegetative shoot apex.</article-title> <source><italic>Plant Cell</italic></source> <volume>4</volume> <fpage>631</fpage>&#x2013;<lpage>643</lpage>. <pub-id pub-id-type="doi">10.2307/3869522</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michelmore</surname> <given-names>R. W.</given-names></name> <name><surname>Paran</surname> <given-names>I.</given-names></name> <name><surname>Kesseli</surname> <given-names>R. V.</given-names></name></person-group> (<year>1991</year>). <article-title>Identification of markers linked to disease-resistance genes by bulked segregant analysis: a rapid method to detect markers in specific genomic regions by using segregating populations.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>88</volume> <fpage>9828</fpage>&#x2013;<lpage>9832</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.88.21.9828</pub-id> <pub-id pub-id-type="pmid">1682921</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Minevich</surname> <given-names>G.</given-names></name> <name><surname>Park</surname> <given-names>D. S.</given-names></name> <name><surname>Blankenberg</surname> <given-names>D.</given-names></name> <name><surname>Poole</surname> <given-names>R. J.</given-names></name> <name><surname>Hobert</surname> <given-names>O.</given-names></name></person-group> (<year>2012</year>). <article-title>CloudMap: a cloud-based pipeline for analysis of mutant genome sequences.</article-title> <source><italic>Genetics</italic></source> <volume>192</volume> <fpage>1249</fpage>&#x2013;<lpage>1269</lpage>. <pub-id pub-id-type="doi">10.1534/genetics.112.144204</pub-id> <pub-id pub-id-type="pmid">23051646</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morel</surname> <given-names>J. B.</given-names></name> <name><surname>Godon</surname> <given-names>C.</given-names></name> <name><surname>Mourrain</surname> <given-names>P.</given-names></name> <name><surname>Beclin</surname> <given-names>C.</given-names></name> <name><surname>Boutet</surname> <given-names>S.</given-names></name> <name><surname>Feuerbach</surname> <given-names>F.</given-names></name><etal/></person-group> (<year>2002</year>). <article-title>Fertile hypomorphic ARGONAUTE (ago1) mutants impaired in post-transcriptional gene silencing and virus resistance.</article-title> <source><italic>Plant Cell</italic></source> <volume>14</volume> <fpage>629</fpage>&#x2013;<lpage>639</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.010358</pub-id> <pub-id pub-id-type="pmid">11910010</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Narasimhan</surname> <given-names>V.</given-names></name> <name><surname>Danecek</surname> <given-names>P.</given-names></name> <name><surname>Scally</surname> <given-names>A.</given-names></name> <name><surname>Xue</surname> <given-names>Y.</given-names></name> <name><surname>Tyler-Smith</surname> <given-names>C.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data.</article-title> <source><italic>Bioinformatics</italic></source> <volume>32</volume> <fpage>1749</fpage>&#x2013;<lpage>1751</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btw044</pub-id> <pub-id pub-id-type="pmid">26826718</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neuffer</surname> <given-names>M. G.</given-names></name> <name><surname>Ficsor</surname> <given-names>G.</given-names></name></person-group> (<year>1963</year>). <article-title>Mutagenic action of ethyl methanesulfonate in maize.</article-title> <source><italic>Science</italic></source> <volume>139</volume> <fpage>1296</fpage>&#x2013;<lpage>1297</lpage>. <pub-id pub-id-type="doi">10.1126/science.139.3561.1296</pub-id> <pub-id pub-id-type="pmid">17757068</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Obholzer</surname> <given-names>N.</given-names></name> <name><surname>Swinburne</surname> <given-names>I. A.</given-names></name> <name><surname>Schwab</surname> <given-names>E.</given-names></name> <name><surname>Nechiporuk</surname> <given-names>A. V.</given-names></name> <name><surname>Nicolson</surname> <given-names>T.</given-names></name> <name><surname>Megason</surname> <given-names>S. G.</given-names></name></person-group> (<year>2012</year>). <article-title>Rapid positional cloning of zebrafish mutations by linkage and homozygosity mapping using whole-genome sequencing.</article-title> <source><italic>Development</italic></source> <volume>139</volume> <fpage>4280</fpage>&#x2013;<lpage>4290</lpage>. <pub-id pub-id-type="doi">10.1242/dev.083931</pub-id> <pub-id pub-id-type="pmid">23052906</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x2019;Malley</surname> <given-names>R. C.</given-names></name> <name><surname>Alonso</surname> <given-names>J. M.</given-names></name> <name><surname>Kim</surname> <given-names>C. J.</given-names></name> <name><surname>Leisse</surname> <given-names>T. J.</given-names></name> <name><surname>Ecker</surname> <given-names>J. R.</given-names></name></person-group> (<year>2007</year>). <article-title>An adapter ligation-mediated PCR method for high-throughput mapping of T-DNA inserts in the <italic>Arabidopsis</italic> genome.</article-title> <source><italic>Nat. Protoc.</italic></source> <volume>2</volume> <fpage>2910</fpage>&#x2013;<lpage>2917</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2007.425</pub-id> <pub-id pub-id-type="pmid">18007627</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ponce</surname> <given-names>M. R.</given-names></name> <name><surname>Quesada</surname> <given-names>V.</given-names></name> <name><surname>Micol</surname> <given-names>J. L.</given-names></name></person-group> (<year>1998</year>). <article-title>Rapid discrimination of sequences flanking and within T-DNA insertions in the <italic>Arabidopsis</italic> genome.</article-title> <source><italic>Plant J</italic>.</source> <volume>14</volume> <fpage>497</fpage>&#x2013;<lpage>501</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-313x.1998.00146.x</pub-id> <pub-id pub-id-type="pmid">9670564</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ponce</surname> <given-names>M. R.</given-names></name> <name><surname>Robles</surname> <given-names>P.</given-names></name> <name><surname>Micol</surname> <given-names>J. L.</given-names></name></person-group> (<year>1999</year>). <article-title>High-throughput genetic mapping in <italic>Arabidopsis thaliana</italic>.</article-title> <source><italic>Mol. Gen. Genet.</italic></source> <volume>261</volume> <fpage>408</fpage>&#x2013;<lpage>415</lpage>. <pub-id pub-id-type="doi">10.1007/s004380050982</pub-id> <pub-id pub-id-type="pmid">10102377</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pulido-Tamayo</surname> <given-names>S.</given-names></name> <name><surname>Duitama</surname> <given-names>J.</given-names></name> <name><surname>Marchal</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>EXPLoRA-web: linkage analysis of quantitative trait loci using bulk segregant analysis.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>W142</fpage>&#x2013;<lpage>W146</lpage>.</citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rishmawi</surname> <given-names>L.</given-names></name> <name><surname>Sun</surname> <given-names>H.</given-names></name> <name><surname>Schneeberger</surname> <given-names>K.</given-names></name> <name><surname>Hulskamp</surname> <given-names>M.</given-names></name> <name><surname>Schrader</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Rapid identification of a natural knockout allele of <italic>ARMADILLO REPEAT-CONTAINING KINESIN1</italic> that causes root hair branching by mapping-by-sequencing.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>166</volume> <fpage>1280</fpage>&#x2013;<lpage>1287</lpage>. <pub-id pub-id-type="doi">10.1104/pp.114.244046</pub-id> <pub-id pub-id-type="pmid">25248719</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schneeberger</surname> <given-names>K.</given-names></name> <name><surname>Weigel</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>Fast-forward genetics enabled by new sequencing technologies.</article-title> <source><italic>Trends Plant Sci.</italic></source> <volume>16</volume> <fpage>282</fpage>&#x2013;<lpage>288</lpage>. <pub-id pub-id-type="doi">10.1016/j.tplants.2011.02.006</pub-id> <pub-id pub-id-type="pmid">21439889</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>D. R.</given-names></name></person-group> (<year>2015</year>). <article-title>Buying in to bioinformatics: an introduction to commercial sequence analysis software.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>16</volume> <fpage>700</fpage>&#x2013;<lpage>709</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbu030</pub-id> <pub-id pub-id-type="pmid">25183247</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Solaimanpour</surname> <given-names>S.</given-names></name> <name><surname>Sarmiento</surname> <given-names>F.</given-names></name> <name><surname>Mr&#x00E1;zek</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). <article-title>Tn-seq explorer: a tool for analysis of high-throughput sequencing data of transposon mutant libraries.</article-title> <source><italic>PLoS One</italic></source> <volume>10</volume>:<fpage>e0126070</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0126070</pub-id> <pub-id pub-id-type="pmid">25938432</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>H.</given-names></name> <name><surname>Schneeberger</surname> <given-names>K.</given-names></name></person-group> (<year>2015</year>). <article-title>SHOREmap v3.0: fast and accurate identification of causal mutations from forward genetic screens.</article-title> <source><italic>Methods Mol. Biol.</italic></source> <volume>1284</volume> <fpage>381</fpage>&#x2013;<lpage>395</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-2444-8_19</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wachsman</surname> <given-names>G.</given-names></name> <name><surname>Modliszewski</surname> <given-names>J. L.</given-names></name> <name><surname>Valdes</surname> <given-names>M.</given-names></name> <name><surname>Benfey</surname> <given-names>P. N.</given-names></name></person-group> (<year>2017</year>). <article-title>A SIMPLE pipeline for mapping point mutations.</article-title> <source><italic>Plant Physiol.</italic></source> <volume>174</volume> <fpage>1307</fpage>&#x2013;<lpage>1313</lpage>. <pub-id pub-id-type="doi">10.1104/pp.17.00415</pub-id> <pub-id pub-id-type="pmid">28546435</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wilson-S&#x00E1;nchez</surname> <given-names>D.</given-names></name> <name><surname>Lup</surname> <given-names>S. D.</given-names></name> <name><surname>Sarmiento-Ma&#x00F1;&#x00FA;s</surname> <given-names>R.</given-names></name> <name><surname>Ponce</surname> <given-names>M. R.</given-names></name> <name><surname>Micol</surname> <given-names>J. L.</given-names></name></person-group> (<year>2019</year>). <article-title>Next-generation forward genetic screens: using simulated data to improve the design of mapping-by-sequencing experiments in <italic>Arabidopsis</italic>.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume> <fpage>140</fpage>&#x2013;<lpage>143</lpage>.</citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wilson-S&#x00E1;nchez</surname> <given-names>D.</given-names></name> <name><surname>Rubio-D&#x00ED;az</surname> <given-names>S.</given-names></name> <name><surname>Mu&#x00F1;oz-Viana</surname> <given-names>R.</given-names></name> <name><surname>P&#x00E9;rez-P&#x00E9;rez</surname> <given-names>J. M.</given-names></name> <name><surname>Jover-Gil</surname> <given-names>S.</given-names></name> <name><surname>Ponce</surname> <given-names>M. R.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Leaf phenomics: a systematic reverse genetic screen for <italic>Arabidopsis</italic> leaf mutants.</article-title> <source><italic>Plant J.</italic></source> <volume>79</volume> <fpage>878</fpage>&#x2013;<lpage>891</lpage>. <pub-id pub-id-type="doi">10.1111/tpj.12595</pub-id> <pub-id pub-id-type="pmid">24946828</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>L.</given-names></name> <name><surname>Wang</surname> <given-names>C.</given-names></name> <name><surname>Holst-Jensen</surname> <given-names>A.</given-names></name> <name><surname>Morisset</surname> <given-names>D.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>D.</given-names></name></person-group> (<year>2013</year>). <article-title>Characterization of GM events by insert knowledge adapted re-sequencing approaches.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>3</volume>:<fpage>2839</fpage>.</citation></ref>
</ref-list>
<glossary>
<title>Abbreviations</title>
<def-list id="DL1">
<def-item><term>AF</term><def><p>allele frequency</p></def></def-item>
<def-item><term>CV</term><def><p>coverage</p></def></def-item>
<def-item><term>EMS</term><def><p>ethyl methanesulfonate</p></def></def-item>
<def-item><term>NGS</term><def><p>next-generation sequencing</p></def></def-item>
<def-item><term>RD</term><def><p>read depth</p></def></def-item>
<def-item><term>SNP</term><def><p>single-nucleotide polymorphism.</p></def></def-item>
</def-list>
</glossary>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://atlas.umh.es/genetics/">http://atlas.umh.es/genetics/</ext-link></p></fn>
</fn-group>
</back>
</article>