<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2017.00091</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>RNA-Seq of Guar (<italic>Cyamopsis tetragonoloba</italic>, L. Taub.) Leaves: <italic>De novo</italic> Transcriptome Assembly, Functional Annotation and Development of Genomic Resources</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Tanwar</surname> <given-names>Umesh K.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/380382/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Pruthi</surname> <given-names>Vikas</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/405920/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Randhawa</surname> <given-names>Gursharn S.</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/379432/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Biotechnology, Indian Institute of Technology Roorkee</institution> <country>Roorkee, India</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Michael Deyholos, University of British Columbia, Canada</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Manish Kumar Pandey, International Crops Research Institute for the Semi-Arid Tropics, India; Gunvant Baliram Patil, University of Missouri, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Gursharn S. Randhawa <email>SHARNFBS&#x00040;iitr.ac.in</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Plant Genetics and Genomics, a section of the journal Frontiers in Plant Science</p></fn></author-notes>
<pub-date pub-type="epub">
<day>02</day>
<month>02</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>91</elocation-id>
<history>
<date date-type="received">
<day>21</day>
<month>09</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>16</day>
<month>01</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Tanwar, Pruthi and Randhawa.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Tanwar, Pruthi and Randhawa</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Genetic improvement in industrially important guar (<italic>Cyamopsis tetragonoloba</italic>, L. Taub.) crop has been hindered due to the lack of sufficient genomic or transcriptomic resources. In this study, RNA-Seq technology was employed to characterize the transcriptome of leaf tissues from two guar varieties, namely, M-83 and RGC-1066. Approximately 30 million high-quality pair-end reads of each variety generated by Illumina HiSeq platform were used for <italic>de novo</italic> assembly by Trinity program. A total of 62,146 non-redundant unigenes with an average length of 679 bp were obtained. The quality assessment of assembled unigenes revealed 87.50% of complete and 97.18% partial core eukaryotic genes (CEGs). Sequence similarity analyses and annotation of the unigenes against non-redundant protein (Nr) and Gene Ontology (GO) databases identified 175,882 GO annotations. A total of 11,308 guar unigenes were annotated with various enzyme codes (EC) and categorized in six categories with 55 subclasses. The annotation of biochemical pathways resulted in a total of 11,971 unigenes assigned with 145 KEGG maps and 1759 enzyme codes. The species distribution analysis of the unigenes showed highest similarity with <italic>Glycine max</italic> genes. A total of 5773 potential simple sequence repeats (SSRs) and 3594 high-quality single nucleotide polymorphisms (SNPs) were identified. Out of 20 randomly selected SSRs for wet laboratory validation, 13 showed consistent PCR amplification in both guar varieties. <italic>In silico</italic> studies identified 145 polymorphic SSR markers in two varieties. To the best of our knowledge, this is the first report on transcriptome analysis and SNPs identification in guar till date.</p></abstract>
<kwd-group>
<kwd>next generation sequencing</kwd>
<kwd>transcriptome analysis</kwd>
<kwd>molecular markers</kwd>
<kwd>simple sequence repeats</kwd>
<kwd>single nucleotide polymorphisms</kwd>
</kwd-group>
<counts>
<fig-count count="8"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="84"/>
<page-count count="15"/>
<word-count count="9133"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Guar [<italic>Cyamopsis tetragonoloba</italic>, L. Taub.], also known as clusterbean, is an annual drought-tolerant legume crop belonging to the family Leguminosae. It is grown mainly in semiarid regions of India, Pakistan, and the United States. Guar has been traditionally used as a forage, green manure and vegetable crop (Dwivedi et al., <xref ref-type="bibr" rid="B18">1995</xref>). In recent times, it has attained the status of an economically important crop because of the gum contained in endosperm of its seeds. Guar gum contains about 90% galactomannan and it is one of the most cost-effective natural thickeners (Dhugga et al., <xref ref-type="bibr" rid="B15">2004</xref>). It is used in textile, paper, petroleum, explosives, cosmetics, and pharmaceutical industries (Yadav et al., <xref ref-type="bibr" rid="B77">2013</xref>). Additionally, guar gum is used in the treatment of diarrhea, irritable bowel syndrome, diabetics, and high cholesterol (Slavin and Greenberg, <xref ref-type="bibr" rid="B57">2003</xref>; Giannini et al., <xref ref-type="bibr" rid="B20">2006</xref>; Butt et al., <xref ref-type="bibr" rid="B9">2007</xref>). Therefore, the demand for guar has increased globally in recent years, leading to its introduction in several countries including South Africa, Australia and Brazil having varied climates and seasons (Undersander et al., <xref ref-type="bibr" rid="B62">1991</xref>). As a result, there is a need to develop, through breeding programs, improved guar varieties for wide range of climatic conditions.</p>
<p>Molecular markers have been found useful in breeding programs involving marker-assisted selection and their use has reduced time and effort for developing improved varieties (Kesawat and Kumar, <xref ref-type="bibr" rid="B26">2009</xref>). These markers are a tool to detect genetic polymorphism at specific loci and whole-genome level as they facilitate marker-based gene tagging, genetic mapping, map-based cloning of agronomically important genes, genetic diversity studies, and phylogenetic analysis (Morgante et al., <xref ref-type="bibr" rid="B37">2002</xref>; Kesawat and Kumar, <xref ref-type="bibr" rid="B26">2009</xref>). Five molecular markers, namely, random amplified polymorphic DNA (RAPD), ribosomal DNA (rDNA), inter simple sequence repeat (ISSR), simple sequence repeat (SSR) and sequence characterized amplified region (SCAR) have been used in the study of molecular diversity in guar (Punia et al., <xref ref-type="bibr" rid="B50">2009</xref>; Pathak et al., <xref ref-type="bibr" rid="B47">2010</xref>, <xref ref-type="bibr" rid="B46">2011</xref>; Kuravadi et al., <xref ref-type="bibr" rid="B30">2013</xref>, <xref ref-type="bibr" rid="B31">2014</xref>; Sharma et al., <xref ref-type="bibr" rid="B56">2014</xref>; Kumar et al., <xref ref-type="bibr" rid="B29">2016</xref>). Among the various molecular markers, SSR and single nucleotide polymorphism (SNP) markers are considered to be very important in genetic and plant breeding applications (Hiremath et al., <xref ref-type="bibr" rid="B23">2012</xref>). However, limited number of SSR markers are available in guar (Kuravadi et al., <xref ref-type="bibr" rid="B31">2014</xref>; Kumar et al., <xref ref-type="bibr" rid="B29">2016</xref>) and no SNPs have yet been reported in this crop.</p>
<p>Next generation sequencing (NGS) offers novel opportunities in functional genomics, gene identification and development of molecular markers in non-model plants (Wang et al., <xref ref-type="bibr" rid="B72">2009</xref>). The massive parallel sequencing of RNA (RNA-Seq or transcriptome profiling) is a powerful tool for transcription profiling, providing a rapid access to a large collection of expressed sequences (transcriptome). This sequencing approach is more efficient than the traditional expressed sequence tag (EST) sequencing. RNA-Seq technology has been successfully applied in several organisms including model and non-model plants (Mortazavi et al., <xref ref-type="bibr" rid="B38">2008</xref>; Nagalakshmi et al., <xref ref-type="bibr" rid="B39">2008</xref>; Parchman et al., <xref ref-type="bibr" rid="B43">2010</xref>; Wang et al., <xref ref-type="bibr" rid="B74">2014</xref>). This technology can be used as a cost-effective source for the development of molecular markers such as SSRs and SNPs (Wang et al., <xref ref-type="bibr" rid="B73">2011</xref>, <xref ref-type="bibr" rid="B74">2014</xref>). These transcriptome-derived markers are expected to show greater transferability among closely related species than that of the genomic markers because of their presence in more-conserved transcribed regions of the genome (Cordeiro et al., <xref ref-type="bibr" rid="B14">2001</xref>). These markers can also be used for comparative mapping and evolutionary studies (Varshney et al., <xref ref-type="bibr" rid="B68">2005b</xref>).</p>
<p>At present complete genome sequences of five legumes, namely, soybean, <italic>Lotus, Medicago</italic>, pigeonpea, and chickpea, are available (Sato et al., <xref ref-type="bibr" rid="B53">2008</xref>; Schmutz et al., <xref ref-type="bibr" rid="B54">2010</xref>; Young et al., <xref ref-type="bibr" rid="B82">2011</xref>; Varshney et al., <xref ref-type="bibr" rid="B65">2012</xref>, <xref ref-type="bibr" rid="B69">2013</xref>; Jain et al., <xref ref-type="bibr" rid="B24">2013</xref>). Guar genome sequencing and transcriptome analysis of guar have not been yet done. Only 16,476 ESTs from developing guar embryos are available in National Center for Biotechnology Information (NCBI) database. The breeding programs in guar have been hindered due to the limited availability of genomic resources in this crop. The development of genomic resources for guar is needed to support molecular genetics research at different levels. Therefore, the present study was undertaken to develop genomic resources based on the sequencing of cDNA pools from leaf tissues of two guar varieties (M-83 and RGC-1066) which were selected because of their contrasting characteristics.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Plant material and transcriptome sequencing</title>
<p>The seeds of two guar varieties, namely, M-83 and RGC-1066, were obtained from Rajasthan Agricultural Research Institute, Durgapura, Jaipur (India). The variety M-83 has glabrous leaf surface, white flower color and it is a vegetable variety. The variety RGC-1066 has hairy leaf surface, purple flower color and is a commercial variety for gum production. The plants were grown in field conditions at Indian Institute of Technology Roorkee, India and healthy leaves were collected from 3-week-old plants. The sequencing of leaf transcriptome was outsourced to SciGenome Labs Pvt. Ltd., Cochin (India). Three technical and three biological replicates were used for library preparation and RNAseq. Total RNA from plant leaves of each variety was extracted by using SIGMA SpectrumTM Plant Total RNA Kit (Sigma-Aldrich, USA) and cDNA library of each variety was prepared by the procedure described in Illumina&#x00027;s TruSeq&#x000AE; RNA sample preparation guide (Illumina, Inc., USA). The sequencing of each cDNA library was carried out on an Illumina HiSeq 2500 machine to get pair-end sequence reads of 100 bp length. The raw data in FASTQ format was obtained from the company.</p>
</sec>
<sec>
<title><italic>De novo</italic> transcriptome assembly of guar leaf</title>
<p>The raw reads of leaf transcriptome of each guar variety were processed for quality control by FastQC version 0.11.4 software (Andrews, <xref ref-type="bibr" rid="B3">2010</xref>). The adaptor sequences and low quality reads with ambiguous sequences &#x0201C;N&#x0201D; were removed to obtain the clean reads. The read orientation based pooling of the clean reads from both varieties was carried out. The pooled clean reads were uploaded to Transcriptomes User-Friendly Analysis (TRUFA) web server for cluster computing for <italic>de novo</italic> transcriptome assembly (Kornobis et al., <xref ref-type="bibr" rid="B28">2015</xref>). The Trinity program (Grabherr et al., <xref ref-type="bibr" rid="B22">2011</xref>) was employed for assembling the clean reads to obtain the unigene contigs. For the <italic>de novo</italic> transcriptome assembly, <italic>k-mer</italic> size was set as 25 and default values were used for other parameters. The assembled transcripts were clustered by the CD-HIT version 4.5.4 tool (Li and Godzik, <xref ref-type="bibr" rid="B34">2006</xref>) with sequence identity threshold 0.95 to remove redundant transcripts. The quality check of the transcriptome assembly was done by assessing the presence of 248 ultra-conserved core eukaryotic genes (CEGs) in the assembly by Core Eukaryotic Genes Mapping Approach (CEGMA) computational method (Parra et al., <xref ref-type="bibr" rid="B44">2007</xref>, <xref ref-type="bibr" rid="B45">2009</xref>).</p>
</sec>
<sec>
<title>Functional annotation of guar leaf transcriptome</title>
<p>Functional annotations were done by comparison of the sequences of clustered assembly with the public databases. The sequence similarity search of unitranscripts was carried out by BLASTX tool (Altschul et al., <xref ref-type="bibr" rid="B2">1997</xref>). Homologs of the assembled unigenes were searched in the NCBI non-redundant protein (Nr), UniProt Reference Clusters (UniRef; Suzek et al., <xref ref-type="bibr" rid="B59">2015</xref>) and Pfam (Finn et al., <xref ref-type="bibr" rid="B19">2014</xref>) databases using default parameters. The BLAST&#x0002B; (Camacho et al., <xref ref-type="bibr" rid="B10">2009</xref>) results against the Nr database were imported to Blast2GO suite (Conesa et al., <xref ref-type="bibr" rid="B13">2005</xref>) for mapping and retrieving Gene Ontology (GO) and unique enzyme code (EC) annotations of assembled unigenes. The retrieved GO terms were allocated to query sequences and the genes present in the transcriptome were classified into cellular component, molecular function and biological process categories. The WEGO tool (Ye et al., <xref ref-type="bibr" rid="B81">2006</xref>) was used for functional classification and graphical representation of GO terms at macro level. The assembled unigenes were further annotated against the Kyoto Encyclopedia of Genes and Genomes (KEGG) metabolic pathways database (Kanehisa and Goto, <xref ref-type="bibr" rid="B25">2000</xref>). The comparison of the assembled unigenes with the most closely related species was carried out by TRAPID online tool (Van Bel et al., <xref ref-type="bibr" rid="B64">2013</xref>) with similarity search <italic>E</italic>-value 10e-5.</p>
</sec>
<sec>
<title>Mining of simple sequence repeats (SSRs) of guar transcriptome</title>
<p>The mining of SSRs was done by searching six repeat motifs (mono-, di-, tri-, tetra-, penta-, and hexanucleotides) using the PERL script MIcroSAtellite (MISA) tool (Thiel et al., <xref ref-type="bibr" rid="B60">2003</xref>). The following default definements (unit size/minimum number of repeats) were set in MISA for microsatellites: (1/10) (2/6) (3/5) (4/5) (5/5) (6/5). All motifs containing continuous uninterrupted repeats were classified as perfect and the motifs having two or more classes of repeats were classified as compound microsatellites. Maximal numbers of bases interrupting 2 SSRs in a compound microsatellite were set to 100.</p>
</sec>
<sec>
<title>Validation of SSR markers</title>
<p>Twenty SSR markers representing all the motif types (except mononucleotide repeats) were selected randomly for wet laboratory validation. The primers were designed by Primer3 tool (Koressaar and Remm, <xref ref-type="bibr" rid="B27">2007</xref>; Untergasser et al., <xref ref-type="bibr" rid="B63">2012</xref>). The DNA was extracted by CTAB method (Doyle and Doyle, <xref ref-type="bibr" rid="B16">1990</xref>) with slight modifications from the healthy leaves of field grown guar plants of each variety. The quality of extracted DNA was assessed by gel electrophoresis on 0.8% agarose gel. The isolated DNA was quantified by measuring the absorbance at 260 nm in a UV-visible Varian spectrophotometer, model Cary 100 and diluted with TE buffer to &#x0007E;100 ng/&#x003BC;l. Polymerase chain reaction (PCR) was carried out in a Mastercycler gradient programmable thermal cycler (Eppendorf). PCR amplified products were electrophoresed on 8% PAGE gels and visualized under white light by silver staining. A 100 bp DNA ladder was used as a molecular marker to determine the approximate size of the fragments. The gel was documented in the gel documentation unit (Bio-Rad).</p>
</sec>
<sec>
<title><italic>In silico</italic> analysis of SSR polymorphism</title>
<p>The reads of each variety were mapped to the assembly using Bowtie2 version 2.2.6 (Langmead and Salzberg, <xref ref-type="bibr" rid="B32">2012</xref>) software to obtain the sorted transcripts binary version of SAM files (BAM). <italic>In silico</italic> identification of SSR polymorphism was carried out using Integrative Genome Viewer (IGV 2.3) software (Robinson et al., <xref ref-type="bibr" rid="B52">2011</xref>; Thorvaldsd&#x000F3;ttir et al., <xref ref-type="bibr" rid="B61">2013</xref>). The pairwise alignment of the sorted transcripts of both varieties was done against the assembly using IGV 2.3 software and the alignment was inspected manually to identify the SSR differences in guar varieties M-83 and RGC-1066.</p>
</sec>
<sec>
<title>Detection of single nucleotide polymorphisms (SNPs)</title>
<p>The reads of each guar variety were aligned against the assembled unigenes by Bowtie2 version 2.2.6 (Langmead and Salzberg, <xref ref-type="bibr" rid="B32">2012</xref>) software to obtain the sorted transcripts (BAM files) for each variety. The detection of SNPs was carried out by SAMtools 1.3 (Li et al., <xref ref-type="bibr" rid="B33">2009</xref>) variant calling programms in Integrated SNP Mining and Utilization (ISMU) pipeline (Azam et al., <xref ref-type="bibr" rid="B5">2014</xref>). The <italic>de novo</italic> assembly was used as a reference for SNP calling. A position was called a putative SNP if any variety had a different allele against the reference. The putative SNPs were further filtered for the homozygous allele types with a minimum read depth of 5 in each variety.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>RNA-seq and <italic>de novo</italic> transcriptome assembly of guar leaf</title>
<p>The Illumina HiSeq sequencing platform generated 28,688,024 and 33,018,878 raw pair-end reads for the guar varieties M-83 and RGC-1066, respectively. The sequence reads have been submitted to NCBI-SRA database (Temporary Submission ID: SUB1380346). The mean read quality (Phred Score) and % Q &#x0003E; 30 of these reads were &#x0007E;35 and 90, respectively. The average read length was 100 bp for each variety (Supplementary Table <xref ref-type="supplementary-material" rid="SM1">S1</xref>). The cleaning and read orientation based pooling of the reads of both varieties resulted in a total of 42,777,004 (R1) and 59,940,380 (R2) clean reads with an average length of 88 bp. The <italic>de novo</italic> assembly of all the clean reads by Trinity program (Grabherr et al., <xref ref-type="bibr" rid="B22">2011</xref>) generated 79,355 contigs. The clustering of assembled sequences using CD-HIT version 4.5.4 tool (Li and Godzik, <xref ref-type="bibr" rid="B34">2006</xref>) gave 62,146 unigenes having 679 bp average length and 1035 bp N50 value (Table <xref ref-type="table" rid="T1">1</xref>). The shortest and longest unigenes were 201 and 29,056 bp, respectively. The length of 37,352 unigenes was &#x0003C;500 bp whereas 24,794 unigenes were having the length of more than 500 bp size. A total of 11,593 unigenes were over 1000 bp and 237 unigenes were over 5000 bp (Figure <xref ref-type="fig" rid="F1">1A</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Statistics of <italic>de novo</italic> assembly of guar leaf transcriptome</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Characteristic</bold></th>
<th valign="top" align="center"><bold>Details</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Total number of contigs</td>
<td valign="top" align="center">62,146</td>
</tr>
<tr>
<td valign="top" align="left">Min length</td>
<td valign="top" align="center">201</td>
</tr>
<tr>
<td valign="top" align="left">Max length</td>
<td valign="top" align="center">29,056</td>
</tr>
<tr>
<td valign="top" align="left">Average length</td>
<td valign="top" align="center">679.36</td>
</tr>
<tr>
<td valign="top" align="left">Standard deviation</td>
<td valign="top" align="center">792.86</td>
</tr>
<tr>
<td valign="top" align="left">Median length</td>
<td valign="top" align="center">394.0</td>
</tr>
<tr>
<td valign="top" align="left">Total bases in contigs</td>
<td valign="top" align="center">42,219,607</td>
</tr>
<tr>
<td valign="top" align="left">Number of contigs &#x0003C;500 pb</td>
<td valign="top" align="center">37,352</td>
</tr>
<tr>
<td valign="top" align="left">Number of contigs &#x02265; 500 pb</td>
<td valign="top" align="center">24,794</td>
</tr>
<tr>
<td valign="top" align="left">Number of contigs &#x02265; 1000 pb</td>
<td valign="top" align="center">11,593</td>
</tr>
<tr>
<td valign="top" align="left">Number of contigs &#x02265; 2000 pb</td>
<td valign="top" align="center">3292</td>
</tr>
<tr>
<td valign="top" align="left">Number of contigs &#x02265; 5000 pb</td>
<td valign="top" align="center">237</td>
</tr>
<tr>
<td valign="top" align="left">Number of contigs &#x02265; 10000 pb</td>
<td valign="top" align="center">38</td>
</tr>
<tr>
<td valign="top" align="left">N50</td>
<td valign="top" align="center">1035.0</td>
</tr>
<tr>
<td valign="top" align="left">Contigs in N50</td>
<td valign="top" align="center">11,028</td>
</tr>
<tr>
<td valign="top" align="left">GC content</td>
<td valign="top" align="center">43.68%</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Functional annotation of guar leaf transcriptome. (A)</bold> Sequence distribution, <bold>(B)</bold> Blast hit distribution, <bold>(C)</bold> <italic>E</italic>-value distribution of BLAST hits for each unique sequence against the Nr database, <bold>(D)</bold> Similarity distributions of the top BLAST hits for each sequence against the Nr database, <bold>(E)</bold> Distribution of Blast2GO three step processes including BLASTX, mapping and annotation, and <bold>(F)</bold> Enzyme code distributions.</p></caption>
<graphic xlink:href="fpls-08-00091-g0001.tif"/>
</fig>
<p>The clean reads were mapped to the assembled unigenes to assess the quality of assembly. The overall alignment rate was 71%. Among the mapped reads 74% reads could uniquely map to the unigenes, while 11% reads could map to multiple locations on unigenes. In addition, analysis of the presence of CEGs revealed that the assembly had 87.50% of complete and 97.18% partial CEGs against the 248 CEGs as reference (Table <xref ref-type="table" rid="T2">2</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Statistics of CEGMA results<xref ref-type="table-fn" rid="TN1"><sup>&#x00023;</sup></xref> of guar leaf transcriptome assembly</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center"><bold>Prots</bold></th>
<th valign="top" align="center"><bold>%Completeness</bold></th>
<th valign="top" align="center"><bold>Total</bold></th>
<th valign="top" align="center"><bold>Average</bold></th>
<th valign="top" align="center"><bold>%Ortho</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Complete</td>
<td valign="top" align="center">217</td>
<td valign="top" align="center">87.50</td>
<td valign="top" align="center">490</td>
<td valign="top" align="center">2.26</td>
<td valign="top" align="center">70.51</td>
</tr>
<tr>
<td valign="top" align="left">Group 1</td>
<td valign="top" align="center">55</td>
<td valign="top" align="center">83.33</td>
<td valign="top" align="center">131</td>
<td valign="top" align="center">2.38</td>
<td valign="top" align="center">78.18</td>
</tr>
<tr>
<td valign="top" align="left">Group 2</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">80.36</td>
<td valign="top" align="center">101</td>
<td valign="top" align="center">2.24</td>
<td valign="top" align="center">73.33</td>
</tr>
<tr>
<td valign="top" align="left">Group 3</td>
<td valign="top" align="center">55</td>
<td valign="top" align="center">90.16</td>
<td valign="top" align="center">129</td>
<td valign="top" align="center">2.35</td>
<td valign="top" align="center">67.27</td>
</tr>
<tr>
<td valign="top" align="left">Group 4</td>
<td valign="top" align="center">62</td>
<td valign="top" align="center">95.38</td>
<td valign="top" align="center">129</td>
<td valign="top" align="center">2.08</td>
<td valign="top" align="center">64.52</td>
</tr>
<tr>
<td valign="top" align="left">Partial</td>
<td valign="top" align="center">241</td>
<td valign="top" align="center">97.18</td>
<td valign="top" align="center">661</td>
<td valign="top" align="center">2.74</td>
<td valign="top" align="center">80.50</td>
</tr>
<tr>
<td valign="top" align="left">Group 1</td>
<td valign="top" align="center">64</td>
<td valign="top" align="center">96.97</td>
<td valign="top" align="center">176</td>
<td valign="top" align="center">2.75</td>
<td valign="top" align="center">82.81</td>
</tr>
<tr>
<td valign="top" align="left">Group 2</td>
<td valign="top" align="center">54</td>
<td valign="top" align="center">96.43</td>
<td valign="top" align="center">158</td>
<td valign="top" align="center">2.93</td>
<td valign="top" align="center">85.19</td>
</tr>
<tr>
<td valign="top" align="left">Group 3</td>
<td valign="top" align="center">58</td>
<td valign="top" align="center">95.08</td>
<td valign="top" align="center">164</td>
<td valign="top" align="center">2.83</td>
<td valign="top" align="center">72.41</td>
</tr>
<tr>
<td valign="top" align="left">Group 4</td>
<td valign="top" align="center">65</td>
<td valign="top" align="center">100.00</td>
<td valign="top" align="center">163</td>
<td valign="top" align="center">2.51</td>
<td valign="top" align="center">81.54</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN1">
<label>&#x00023;</label>
<p><italic>These results are based on the set of genes selected by Genis Parra</italic>.</p></fn>
<p><italic>(Prots represents the number of 248 ultra-conserved CEGs present in genome, % Completeness represents percentage of 248 ultra-conserved CEGs present, Total represents total number of CEGs present including putative orthologs, Average represents the average number of orthologs per CEG and % Ortho represents percentage of detected CEGS that have more than 1 ortholog)</italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Functional annotation of guar leaf transcriptome</title>
<p>The annotation of assembled leaf unigenes was done using BLASTX against the Nr, Uniref90, Pfam and Nt databases (Data sheet <xref ref-type="supplementary-material" rid="SM8">1</xref>), with an <italic>E</italic>-value cut off of 1e<sup>&#x02212;6</sup> (Figure <xref ref-type="fig" rid="F1">1B</xref>). The total numbers of hits obtained in Uniref90 and Nr databases were 44,992 and 45,972, respectively. Among the 62,146 unigenes, 44,268 (71.23%) had at least one significant match in blast hit results with an <italic>E</italic> &#x0003C; 1e<sup>&#x02212;6</sup>. Most of these unigenes were found to be protein coding genes. The <italic>E</italic>-value distribution analysis based on Nr database annotation results revealed that 72.29 and 56.65% of the matched sequences had strong homology with the <italic>E</italic>-values &#x0003C; 1e<sup>&#x02212;30</sup> and &#x0003C;1e<sup>&#x02212;45</sup>, respectively. Only 27.70% of the matched sequences showed high similarity with an <italic>E</italic>-value from 1e<sup>&#x02212;30</sup> to 1e<sup>&#x02212;6</sup> (Figure <xref ref-type="fig" rid="F1">1C</xref>). The similarity distribution analysis of the BLAST hits indicated that the sequences having a similarity higher than 80% were 66.34% whereas the sequences with a similarity ranging from 35 to 80% were only 33.65% (Figures <xref ref-type="fig" rid="F1">1D,E</xref>). The species distribution analysis revealed that the sequences homologous to guar unigenes were found in several plant species (Figure <xref ref-type="fig" rid="F2">2</xref>). The maximum similarity of 41.91% was found with <italic>Glycine max</italic>, followed by <italic>Phaseolus vulgaris</italic> (14.85%), <italic>Cicer arietinum</italic> (13.30%), <italic>Sphingomonas melonis</italic> (9.89%) and <italic>Medicago truncatula</italic> (6.34%). The comparison of assembled unigenes with closely related sequenced species was carried out by TRAPID analysis. Out of total 62,146 assembled unigenes 39,123 (63%), 34,744 (55.9%) and 35,263 (56.7%) showed similarity to <italic>G. max, M. truncatula</italic> and <italic>Lotus japonicus</italic>, respectively. The detailed results of comparison with three species showing the meta annotation, gene family and functional annotation information have been presented in Supplementary Table <xref ref-type="supplementary-material" rid="SM2">S2</xref>.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Species distribution results of guar leaf transcriptome. (A)</bold> By accounting all BLASTX hits and <bold>(B)</bold> Top hit species distribution based on BLASTX alignments.</p></caption>
<graphic xlink:href="fpls-08-00091-g0002.tif"/>
</fig>
<p>Based on sequence homology, 62,146 Trinity-assembled guar leaf unigenes were assigned GO terms. A total of 175,882 annotations were found on the basis of BLAST&#x0002B; results (Figure <xref ref-type="fig" rid="F3">3A</xref>). These GO terms were distributed into 46 functional groups, which were further classified into three categories, namely, cellular component, molecular function and biological process (Figure <xref ref-type="fig" rid="F4">4</xref>). The top GO terms were &#x0201C;metabolic process&#x0201D; (23,214), &#x0201C;cellular process&#x0201D; (21,230), &#x0201C;single-organism process&#x0201D; (17,550) and &#x0201C;biological regulation&#x0201D; (7295) in the biological process category. In the molecular function category, &#x0201C;catalytic activity&#x0201D; (18,275), &#x0201C;binding&#x0201D; (16,528) and &#x0201C;transporter activity&#x0201D; (2164) were major GO terms. In the cellular component category, &#x0201C;cell&#x0201D; (15,743), &#x0201C;membrane&#x0201D; (13,110), &#x0201C;organelle&#x0201D; (10,345) and &#x0201C;macromolecular complex&#x0201D; (4985) were mainly enriched. Only a few unigenes were classified in terms of &#x0201C;cell killing,&#x0201D; &#x0201C;behavior,&#x0201D; &#x0201C;protein tag,&#x0201D; &#x0201C;translation regulator activity,&#x0201D; &#x0201C;nutrient reservoir activity,&#x0201D; and &#x0201C;extracellular matrix.&#x0201D; Similar results were obtained by using WEGO tool (Figure <xref ref-type="fig" rid="F3">3B</xref>).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>GO-level distributions in guar leaf transcriptome. (A)</bold> P, F and C represent the biological process, molecular function and cellular component, respectively. Total Annotations &#x0003D; 175,882, Mean Level &#x0003D; 7.011, and <bold>(B)</bold> Classification of guar leaf transcripts into functional categories according to GO-terms on the basis of WEGO tool.</p></caption>
<graphic xlink:href="fpls-08-00091-g0003.tif"/>
</fig>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>Classification of guar leaf transcripts into functional categories according to GO-terms</bold>.</p></caption>
<graphic xlink:href="fpls-08-00091-g0004.tif"/>
</fig>
<p>By searching against the available database, a total of 11,308 guar unigenes were annotated with various enzyme codes (Data sheet <xref ref-type="supplementary-material" rid="SM9">2</xref>). The annotated enzyme codes were grouped into six classes: Oxidoreductases (17.97%), Transferases (41.04%), Hydrolases (26.51%), Lyases (4.61%), Isomerases (3.61%), and Ligases (6.26%) as shown in Figure <xref ref-type="fig" rid="F1">1F</xref>.</p>
<p>Systematic high-level gene function analysis against KEGG database resulted in assigning biochemical pathways to 11,971 guar leaf unigenes. These unigenes were associated with 145 KEGG maps and 1759 enzyme codes (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">S3</xref>). The annotated unigenes were categorized into five major pathways in KEGG database&#x02014;&#x0201C;metabolism&#x0201D; (11,421), &#x0201C;genetic information processing&#x0201D; (132), &#x0201C;environmental information processing&#x0201D; (207), &#x0201C;organismal systems&#x0201D; (208), and &#x0201C;human diseases&#x0201D; (3). The &#x0201C;metabolism&#x0201D; was the most highly represented category which led to in-depth analysis of this group (Figure <xref ref-type="fig" rid="F5">5</xref>). The top five enriched pathways were &#x0201C;carbohydrate metabolism&#x0201D; (2933), &#x0201C;amino acid metabolism&#x0201D; (1754), &#x0201C;lipid metabolism&#x0201D; (1297), &#x0201C;nucleotide metabolism&#x0201D; (1094) and &#x0201C;energy metabolism&#x0201D; (1070).</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Annotation of guar leaf transcriptome in KEGG database. (A)</bold> Distribution of unigenes into KEGG biological categories. <bold>(B)</bold> Classification of unigenes in KEGG &#x0201C;metabolism&#x0201D; category.</p></caption>
<graphic xlink:href="fpls-08-00091-g0005.tif"/>
</fig>
</sec>
<sec>
<title>Identification of differentially expressed genes</title>
<p>Two guar varities, namely, M-83 and RGC-1066 showed &#x0007E;80% similar gene expression in leaf transcriptome. A total of 175 unigenes were found to be overexpressed with at least 30-folds overexpression in variety M-83. These unigenmes were further annotated against KEGG databse and 36 KEGG maps with 49 ECs were found. A total of 158 unigenes were found in RGC-1066 variety with overexpression of 20-folds and only two KEGG maps with five EC were annotated (Supplementary Table <xref ref-type="supplementary-material" rid="SM7">S7</xref>).</p>
</sec>
<sec>
<title>Identification of simple sequence repeats (SSRs)</title>
<p>Out of total 62,146 unigenes assembled in guar leaf transcriptome, 4970 unigenes were found to contain 5773 SSRs (Data sheets <xref ref-type="supplementary-material" rid="SM10">3</xref>, <xref ref-type="supplementary-material" rid="SM11">4</xref>). More than one SSR was present in 593 unigenes. On an average basis, one SSR per 7.31 kb was found in the unigenes. The SSRs contained 2624 (45.45%) mononucleotide, 1179 (20.42%) dinucleotide, 1856 (32.14%) trinucleotide, 97 (1.68%) tetranucleotide, 7 (0.12%) pentanucleotide, and 10 (0.17%) hexanucleotide motifs (Figure <xref ref-type="fig" rid="F6">6</xref>). Most of the SSRs were not repeated more than 10 times. Only a small number of SSRs with more than 20 repeat sequences were observed (Table <xref ref-type="table" rid="T3">3</xref>). For most dinucleotide SSRs, the repeat numbers varied from 6 to 11, with 9.92 average value, while the repeat numbers of most of the pentanucleotide and hexanucleotide types were &#x0003C;6. If the mononucleotide SSRs were excluded, trinucleotide repeats were found to be the maximum (1856).</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p><bold>Simple sequence repeat (SSR) mining results in guar leaf transcriptome</bold>.</p></caption>
<graphic xlink:href="fpls-08-00091-g0006.tif"/>
</fig>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Profiles of different SSR types in guar leaf transcriptome</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Repeat type</bold></th>
<th valign="top" align="center" colspan="3" style="border-bottom: thin solid #000000;"><bold>Repeat numbers</bold></th>
<th valign="top" align="center"><bold>Total</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>&#x02264; 6</bold></th>
<th valign="top" align="center"><bold>7 to 10</bold></th>
<th valign="top" align="center"><bold>&#x0003E;10</bold></th>
<th/>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Mononucleotide</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">2624</td>
<td valign="top" align="center">2624</td>
</tr>
<tr>
<td valign="top" align="left">Dinucleotide</td>
<td valign="top" align="center">559</td>
<td valign="top" align="center">578</td>
<td valign="top" align="center">42</td>
<td valign="top" align="center">1179</td>
</tr>
<tr>
<td valign="top" align="left">Trinucleotide</td>
<td valign="top" align="center">1636</td>
<td valign="top" align="center">218</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1856</td>
</tr>
<tr>
<td valign="top" align="left">Tetranucleotide</td>
<td valign="top" align="center">93</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">97</td>
</tr>
<tr>
<td valign="top" align="left">Pentanucleotide</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td valign="top" align="left">Hexanucleotide</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">10</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>A total of 20 SSR markers representing all the repeat motifs (except mononucleotide repeats) in the <italic>de novo</italic> transcriptome assembly were selected for wet laboratory validation. The flanking primers were designed for SSR containing sequences using the online tool Primer3. Five primers for each dinucleotide, trinucleotide and tetranucleotide repeats, three primers for each pentanucleotide repeat and two primers for each hexanucleotide repeat, were designed and synthesized. The details of the transcriptome sequence ID, motif type and SSR length are given in Supplementary Table <xref ref-type="supplementary-material" rid="SM4">S4</xref>. The details of the primers synthesized are shown in Supplementary Table <xref ref-type="supplementary-material" rid="SM5">S5</xref>. Out of the 20 primer pairs, 13 (GT-2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, and 18) resulted in PCR amplification in the two guar varieties. Three primer pairs (GT-16, 17, and 19) showed amplification only in the variety RGC-1066 whereas the SSR primer pair GT-15 resulted in amplification only in M-83 variety. The SSR primer GT-17 showed amplification at higher size than the theoretical amplicon size. Some of the tested markers showed more than one band and no polymorphism was detected in the tested SSR primers. The results of amplification of six primer pairs are shown in Figure <xref ref-type="fig" rid="F7">7</xref>. Figures of PCR amplification results of other primers are not shown. Some of the tested markers showed more than one band that might be due to the presence of multiple sites complementary to the primers in the genomic DNA. Only 65% of the 20 tested SSR primers resulted in amplification in the target guar varieties M-83 and RGC-1066.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p><bold>Banding pattern of SSR primers&#x00027; amplification on genomic DNA of guar</bold>. M represents 100 bp marker and Lanes 1 and 2 represent guar varieties M-83 and RGC-1066, respectively.</p></caption>
<graphic xlink:href="fpls-08-00091-g0007.tif"/>
</fig>
</sec>
<sec>
<title><italic>In silico</italic> identification of SSR polymorphism</title>
<p>The reads of each guar variety were mapped against the assembled unigenes to obtain the sorted transcripts (BAM files). The overall alignment rates were found to be 89.44 and 91.69% for guar varieties M-83 and RGC-1066, respectively. The sorted transcripts were further aligned against the reference by IGV 2.3 software and observed manually to get the nucleotide differences surrounding the SSR region in both varieties. As a result, a total number of 145 SSRs were found to be polymorphic between the two guar varieties (Supplementary Table <xref ref-type="supplementary-material" rid="SM6">S6</xref>). Two instances of <italic>in silico</italic> polymorphic SSRs have been shown in Supplementary Figure <xref ref-type="supplementary-material" rid="SM13">S1</xref>.</p>
</sec>
<sec>
<title>Detection of single nucleotide polymorphisms (SNPs)</title>
<p>A total of 53,402 putative SNPs (&#x0007E;1 SNP per transcript) were identified and out of these 8416 were found with the read depth of &#x0003E;5. These results showed that about one SNP was present for every 5.01 kb of leaf transcriptome in guar. High-confidence 3594 SNPs were obtained after filtering for homozygous SNPs (Data sheet <xref ref-type="supplementary-material" rid="SM12">5</xref>). The statistical analysis of SNP loci was done for each variety against the assembled transcripts. This resulted in 65.25% transition nucleotide substitutions and 34.75% transversions in guar variety M-83. In variety RGC-1066 61.36% transitions and 38.64% transversions were found. The statistical information of SNPs in guar varieties M-83 and RGC-1066 against the reference is shown in Figure <xref ref-type="fig" rid="F8">8</xref>. In addition, 2930 and 3984 Insertion-Deletion (InDel) variants were found in the varieties M-83 and RGC-1066, respectively.</p>
<fig id="F8" position="float">
<label>Figure 8</label>
<caption><p><bold>Statistical information of SNPs in guar varieties M-83 and RGC-1066 against <italic>de novo</italic> assembly</bold>.</p></caption>
<graphic xlink:href="fpls-08-00091-g0008.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Guar (<italic>Cyamopsis</italic>) is an exclusively diploid (2n &#x0003D; 14) genus with haploid chromosome number 7. The genome sizes (4C DNA contents) in all its three species, viz., <italic>C. tetragonoloba, C. serrate</italic>, and <italic>C. senegalensis</italic>, have been reported to be 10.05, 20.35, and 18.19 pg, respectively (Patil, <xref ref-type="bibr" rid="B48">2004</xref>). The nuclear genomes of legumes vary greatly in size, from 370 million base pairs (Mbp) in <italic>Lablab niger</italic> to more than 13,000 Mbp in the genome of <italic>Vicia faba</italic>. Most of the cultivated species are modest in genome size; mung bean, cowpea, common bean, chick pea, and clover all have haploid genomes smaller than 1000 Mbp (Young et al., <xref ref-type="bibr" rid="B83">2003</xref>). The genome of <italic>Cyamopsis</italic> in comparison to the other legumes, is intermediate in size. Despite the intermediate size of the guar genome, very few studies have been done on the molecular genetics of this crop.</p>
<p>All genetic improvement programs in guar have been carried out till now using conventional breeding without the involvement of molecular markers. As a result, only a limited success has been achieved in obtaining improved guar varieties. Marker assisted breeding, especially with SSRs and SNPs, has given excellent results in several other crops (Rafalski, <xref ref-type="bibr" rid="B51">2002</xref>; Kesawat and Kumar, <xref ref-type="bibr" rid="B26">2009</xref>; Hiremath et al., <xref ref-type="bibr" rid="B23">2012</xref>). Such breeding programs have not been possible in guar due to the lack of sufficient number of SSRs (Kuravadi et al., <xref ref-type="bibr" rid="B31">2014</xref>; Kumar et al., <xref ref-type="bibr" rid="B29">2016</xref>) and the complete absence of SNPs. This has happened due to the limited availability of genetic resources in this crop. NGS technologies provide novel opportunities not only in functional genomics and gene discovery but also in developing huge genetic resources in non-model plants (Wang et al., <xref ref-type="bibr" rid="B72">2009</xref>). These technologies have been widely used for the development of molecular markers through transcriptome analysis in several plant species (Dutta et al., <xref ref-type="bibr" rid="B17">2011</xref>; Wang et al., <xref ref-type="bibr" rid="B73">2011</xref>, <xref ref-type="bibr" rid="B74">2014</xref>).</p>
<p>The present study was performed on two guar varieties, one gum producing variety having hairy leaves (RGC-1066) and the other vegetable variety having pubescent leaves (M-83). Approximately 60 MB high quality sequence reads from the leaf tissues of both guar varieties were assembled to generate 62,146 unigene contigs which represented a large fraction of the guar transcriptome and helped in identification of a comprehensive set of genic-markers. The <italic>de novo</italic> assembly indicated good coverage as well as the depth of sequencing data. The CEGMA software was used for assessment of completeness of a transcriptome assembly by evaluating the presence and completeness of the widely conserved set of 248 CEGs. These CEGs represent the proteins mostly coded by the housekeeping genes and therefore can be expected to be expressed (Parra et al., <xref ref-type="bibr" rid="B44">2007</xref>, <xref ref-type="bibr" rid="B45">2009</xref>; Nakasugi et al., <xref ref-type="bibr" rid="B40">2013</xref>). The CEGMA analysis revealed that assembly had 87.50% of complete and 97.18% partial CEGs. Similar results were obtained in <italic>de novo</italic> transcriptome assembly of <italic>Nicotiana benthamiana</italic> (Nakasugi et al., <xref ref-type="bibr" rid="B40">2013</xref>). Hence the <italic>de novo</italic> assembly obtained in this work was appropriate for the functional annotation and identification of genic markers.</p>
<p>Guar being a non-model plant and without any prior genome information, sequence similarity search and comparison for the assembled unigenes of guar leaf transcriptome were carried out by BLASTX against several databases. The total numbers of hits obtained in Uniref90 and Nr databases were 44,992 and 45,972, respectively. Among the 62,146 unigenes, 71.23% had at least one significant match in blast hit results with an <italic>E</italic> &#x0003C; 1e<sup>&#x02212;6</sup> showing that most of the unigenes code for proteins. The unigenes that had no significant matches may be lacking a known conserved functional domain or are representing non-coding RNAs. Another explanation could be that these unigenes, despite containing a known protein domain, do not show sequence matches as they are very short (Wu et al., <xref ref-type="bibr" rid="B75">2015</xref>). Moreover, as very little genomic and transcriptomic information is available for guar, many guar lineage specific genes may not be present in the available databases. The part of sequences showing no hits might be of great interest for further research for alternative splice variants, novel gene products and differentially expressed genes. As per species distribution analysis a number of sequences homologous to guar leaf sequences are present in many plant species. Among these plant species <italic>G. max</italic> genes have the highest similarity (41.91%) with guar unigenes. Hence for the transcriptome analysis of guar, the genome of <italic>G. max</italic> may serve as a reference.</p>
<p>The GO database is an important resource as GO terms provide a set of dynamically controlled and structured vocabularies for describing the roles of genes in any organism (Ashburner et al., <xref ref-type="bibr" rid="B4">2000</xref>). Based on sequence homology, 62,146 guar leaf transcriptome unigenes were assigned GO terms and classified into three main categories, namely, cellular component, molecular function, and biological process. The annotation of guar unigenes with enzyme codes revealed that non-specific serine/threonine protein kinases, phosphoprotein phosphatases, and RNA helicases were most abundant. The above findings are consistent with the other plant leaf transcriptome studies (Wu et al., <xref ref-type="bibr" rid="B75">2015</xref>; Bose Mazumdar and Chattopadhyay, <xref ref-type="bibr" rid="B8">2016</xref>). The identification of several enzyme codes of guar in this work is likely to be helpful in understanding various metabolic activities of this industrially important crop.</p>
<p>The gene function analysis against KEGG database revealed that 11,971 guar leaf unigenes were assigned with 145 KEGG pathways and 1759 enzyme codes. It was observed that more than one unigenes were annotated with the same enzyme in our dataset. Similar pattern was also found in <italic>P. amarus</italic> leaf transcriptome (Bose Mazumdar and Chattopadhyay, <xref ref-type="bibr" rid="B8">2016</xref>). Transcriptome profiling by RNA-Seq has enabled comparison of transcriptional variation in two guar varieties. Both the varieties showed &#x0007E;80% similar gene expression in leaf transcriptome. The direct comparison of expression of genes would require a meta-analysis (Bhargava et al., <xref ref-type="bibr" rid="B7">2013</xref>) to have a better insight into the functions of genes specifically and commonly involved in various leaf characteristics.</p>
<p>Our main goal in this study was to identify genic-markers that can be readily used in breeding programs. Among various molecular markers, SSRs and SNPs are the most useful ones for genetics and plant breeding applications (Hiremath et al., <xref ref-type="bibr" rid="B23">2012</xref>). In the present study, two sets of molecular markers, SSR and SNP were identified using the transcriptome dataset of guar leaves. Transcriptome based markers are advantageous as compared to the markers in non-transcribed regions due to their high amplification rates and cross-species transferability (Barbar&#x000E1; et al., <xref ref-type="bibr" rid="B6">2007</xref>). A total of 5773 potential SSRs were identified with an average of one SSR per 7.31 kb in the unigenes. This result was consistent with the previous EST-SSR report in guar with occurrence (kb/SSR) of 7.9 (Kumar et al., <xref ref-type="bibr" rid="B29">2016</xref>) while, Kuravadi et al. reported the occurrence of 4.1 using the same dataset (Kuravadi et al., <xref ref-type="bibr" rid="B31">2014</xref>). The occurrence of genic-SSR was also comparable to 8.4 in pigeonpea, 3.4 in rice, 5.4 in wheat, and 7.4 in soybean (Cardle et al., <xref ref-type="bibr" rid="B11">2000</xref>; Peng and Lapitan, <xref ref-type="bibr" rid="B49">2005</xref>; Dutta et al., <xref ref-type="bibr" rid="B17">2011</xref>). The differences in genic-SSR abundance may be due to the size of EST or unigene assembly dataset, and different data mining tools and criteria (Varshney et al., <xref ref-type="bibr" rid="B66">2005a</xref>). The frequency distribution of SSR markers are in agreement of previous reports in guar (Kuravadi et al., <xref ref-type="bibr" rid="B31">2014</xref>; Kumar et al., <xref ref-type="bibr" rid="B29">2016</xref>). If the mononucleotide SSRs are excluded because of the frequent homopolymer errors found in sequencing data, a large proportion was covered by di- and trinucleotides (96%) while the rest amounted to &#x0003C;4%. This is consistent with the EST-SSRs distributions reported in many legumes (Wang et al., <xref ref-type="bibr" rid="B74">2014</xref>). A similar trend was observed in other plant species (Sonah et al., <xref ref-type="bibr" rid="B58">2011</xref>; Ahn et al., <xref ref-type="bibr" rid="B1">2013</xref>). The trinucleotide repeats, which are more frequently detected in coding regions, have been reported to be the maximum (Yu et al., <xref ref-type="bibr" rid="B84">2011</xref>). The possible reason for abundance in trinucleotide motifs may be due to expansion or contraction of di-nucleotide repeat length in exons to suppress deleterious effects of the frame-shift mutations in translated regions (Xin et al., <xref ref-type="bibr" rid="B76">2012</xref>). These repeats are generally more robust since they are reported to give fewer &#x0201C;stutter bands&#x0201D; than the dinucleotide repeats. The trinucleotide repeats have been reported as highly polymorphic and stably inherited (Yang et al., <xref ref-type="bibr" rid="B78">2012</xref>). The 5773 potential SSRs identified from <italic>de novo</italic> transcriptome sequencing data of guar leaf represent a significant addition to the limited set of genic-SSR markers available in guar.</p>
<p>The results of SSR markers validation showed that 13 of the 20 tested SSR primers resulted amplification in the target guar varieties M-83 and RGC-1066. The lack of amplification of 7 SSR markers could be because of the flanking primers extending across a splice site with a large intron or chimeric cDNA contigs (Varshney et al., <xref ref-type="bibr" rid="B67">2006</xref>). Some of the tested markers showed more than one band that might be due to the presence of multiple sites complementary to the primers in the genomic DNA. None of the tested markers showed distinct polymorphism. The possible reason may be due to the small product size difference or actual lack of polymorphism as earlier reported in pigeonpea (Dutta et al., <xref ref-type="bibr" rid="B17">2011</xref>). Overall 65% of the tested SSRs were validated successfully by wet laboratory analysis. These results are consistent with barley, where 67&#x02013;70% of the primers showed amplification (Thiel et al., <xref ref-type="bibr" rid="B60">2003</xref>; Varshney et al., <xref ref-type="bibr" rid="B67">2006</xref>). The amplification success rate was higher than that reported sugarcane (48%) and lower than flax (92%; Cordeiro et al., <xref ref-type="bibr" rid="B14">2001</xref>; Cloutier et al., <xref ref-type="bibr" rid="B12">2009</xref>). <italic>In silico</italic> polymorphism analysis of the SSR markers was done by IGV software (Thorvaldsd&#x000F3;ttir et al., <xref ref-type="bibr" rid="B61">2013</xref>). A total of 145 out of 5773 SSR markers were identified as <italic>in silico</italic> polymorphic in the guar varieties M-83 and RGC-1066. This result is in agreement with the reports in pigeonpea (Dutta et al., <xref ref-type="bibr" rid="B17">2011</xref>). Taken together with the previous SSR polymorphism studies, it can be concluded that genetic diversity in the guar gene pool is very low (Kuravadi et al., <xref ref-type="bibr" rid="B31">2014</xref>; Kumar et al., <xref ref-type="bibr" rid="B29">2016</xref>).</p>
<p>A total number of 53,402 putative SNPs (&#x0007E;1 SNP per transcript) were detected in the two guar varieties M-83 and RGC-1066. The putative SNPs were screened for a minimum depth of five reads with same homozygous allele. The screening process might have reduced the sensitivity in detecting rare SNPs, but the probability of true SNP detection was increased due to the reduced chances of inclusion of false variants that arise by sequencing errors. High-confidence differences were composed of 3594 SNPs after screening for the SNP density. SNPs are genetic markers which are bi-allelic in nature, besides being highly abundant and less prone to mutations as compared to SSRs. They can contribute directly to a phenotype or can be associated with a phenotype as a result of linkage disequilibrium (Neff et al., <xref ref-type="bibr" rid="B42">1998</xref>). In plants, SNPs are particularly useful in the construction of high resolution genetic maps, the positional cloning of target loci, marker assisted breeding of important genes, genome wide large-scale linkage disequilibrium associate analysis, DNA fingerprinting, and species origin, relationship and evolutionary studies (Shahinnia and Sayed-Tabatabaei, <xref ref-type="bibr" rid="B55">2009</xref>). Most conventional molecular markers, such as restriction fragment length polymorphism (RFLP) and cleaved amplified polymorphic sequence (CAPS), are based on SNPs, i.e., nucleotide substitutions or insertions/deletions (Nasu et al., <xref ref-type="bibr" rid="B41">2002</xref>). The existence of a restriction site difference spanning the SNPs between varieties/lines to be analyzed is essential for converting SNPs to CAPS markers. However, Michaels and Amasino (<xref ref-type="bibr" rid="B36">1998</xref>) and Neff et al. (<xref ref-type="bibr" rid="B42">1998</xref>) demonstrated that single-base changes generating no restriction site differences could be employed for the development of PCR-based markers by the derived CAPS (dCAPS) method. Like the CAPS markers, the dCAPS markers are simple and relatively inexpensive to identify (Neff et al., <xref ref-type="bibr" rid="B42">1998</xref>).</p>
<p>Statistical analysis of SNP loci resulted in 65.25% transition nucleotide substitutions and 34.75% transversions in guar variety M-83. In variety RGC-1066 61.36% transitions and 38.64% transversions were found. This finding is in agreement with red pepper transcriptome profiling (Lu et al., <xref ref-type="bibr" rid="B35">2012</xref>). These results of higher occurrence of transitions in comparison to transversions are in accordance of transition/transversion rate bias. Transitions (T&#x02194;C and A&#x02194;G) have been found to occur at higher frequencies than transversions or all other changes in almost all studied genomes (Gojobori et al., <xref ref-type="bibr" rid="B21">1982</xref>; Wakeley, <xref ref-type="bibr" rid="B70">1994</xref>, <xref ref-type="bibr" rid="B71">1996</xref>; Yang and Bielawski, <xref ref-type="bibr" rid="B79">2000</xref>). The detection of transition/transversion rate bias is important to understand the patterns of DNA sequence evolution and phylogeny reconstruction (Yang and Yoder, <xref ref-type="bibr" rid="B80">1999</xref>).</p>
<p>This study is the first report of transcriptome analysis and SNPs detection in guar crop. The large number of SSRs and SNPs identified in this study provide a wealth of potential markers in this crop. These results are expected to open new opportunities for population genetics, linkage mapping, comparative genomics and marker-assisted breeding in guar.</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusions</title>
<p>The transcriptome sequencing of leaf tissues from two guar varieties, namely, M-83 and RGC-1066 was done by Illumina HiSeq technology. Approximately 30 million pair-end reads of each variety were used to generate a <italic>de novo</italic> assembly of 62,146 unigenes with an average length of 679 bp. The assembled unigenes were functionally annotated against non-redundant protein (Nr), Gene Ontology (GO), and KEGG databases. The genic markers identification resulted in a total of 5773 potential SSRs and 3594 high-quality SNPs. Twenty SSRs were validated using wet laboratory analysis and 145 SSRs were found to be polymorphic by <italic>in silico</italic> polymorphism detection. Taken together, this study not only reports the first transcriptomic dataset and SNPs in guar, but also provides the largest genetic resource in this crop for marker-assisted breeding, functional genomics, and proteomics research in future.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>UT planned the experiments, did experimental work, analyzed the data, made conclusions, and wrote the paper. VP planned the experiments, interpreted the results and gave suggestions on the manuscript. GR planned the experiments, interpreted the results, and corrected the manuscript.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<ack><p>We would like to thank Prof. Sudesh Kumar, RARI, Jaipur for providing the seed material and Prof. Kanwarpal Singh Dhugga, CIMMYT, Mexico and Prof. Gurmukh Singh Johal, Purdue University, USA for useful suggestions. We are grateful to Navneet K. Sekhon and Deepa Dewan, research scholars, for their useful suggestions in the primer synthesis and validation of SSR markers. Financial support for this work in the form of fellowship to UT by the Department of Biotechnology, Govt. of India, is gratefully acknowledged.</p>
</ack>
<sec sec-type="supplementary-material" id="s7">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://journal.frontiersin.org/article/10.3389/fpls.2017.00091/full#supplementary-material">http://journal.frontiersin.org/article/10.3389/fpls.2017.00091/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table1.DOCX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table2.DOCX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table3.DOC" id="SM3" mimetype="application/msword" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table4.DOCX" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table5.DOCX" id="SM5" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table6.DOCX" id="SM6" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table7.DOC" id="SM7" mimetype="application/msword" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.CSV" id="SM8" mimetype="text/csv" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet2.XLSX" id="SM9" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet3.XLSX" id="SM10" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet4.XLSX" id="SM11" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet5.XLSX" id="SM12" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image1.tif" id="SM13" mimetype="image/tif" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure S1</label>
<caption><p><bold>The instances of <italic>in silico</italic> identified polymorphic SSR markers. (A)</bold> comp9618_c0_seq1106-155 and <bold>(B)</bold> comp11342_c0_seq12,182-2,233.</p></caption></supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ahn</surname> <given-names>Y. K.</given-names></name> <name><surname>Tripathi</surname> <given-names>S.</given-names></name> <name><surname>Cho</surname> <given-names>Y. I.</given-names></name> <name><surname>Kim</surname> <given-names>J. H.</given-names></name> <name><surname>Lee</surname> <given-names>H. E.</given-names></name> <name><surname>Kim</surname> <given-names>D. S.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title><italic>De novo</italic> transcriptome assembly and novel microsatellite marker information in <italic>Capsicum annuum</italic> varieties Saengryeg 211 and Saengryeg 213</article-title>. <source>Bot. Stud.</source> <volume>54</volume>, <fpage>1</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1186/1999-3110-54-58</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname> <given-names>S. F.</given-names></name> <name><surname>Madden</surname> <given-names>T. L.</given-names></name> <name><surname>Sch&#x000E4;ffer</surname> <given-names>A. A.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Miller</surname> <given-names>W.</given-names></name> <etal/></person-group>. (<year>1997</year>). <article-title>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</article-title>. <source>Nucleic Acids Res.</source> <volume>25</volume>, <fpage>3389</fpage>&#x02013;<lpage>3402</lpage>. <pub-id pub-id-type="doi">10.1093/nar/25.17.3389</pub-id><pub-id pub-id-type="pmid">9254694</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Andrews</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <source>FastQC: A Quality Control Tool for High Throughput Sequence Data</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.bioinformatics.babraham.ac.uk/projects/fastqc/">http://www.bioinformatics.babraham.ac.uk/projects/fastqc/</ext-link></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashburner</surname> <given-names>M.</given-names></name> <name><surname>Ball</surname> <given-names>C. A.</given-names></name> <name><surname>Blake</surname> <given-names>J. A.</given-names></name> <name><surname>Botstein</surname> <given-names>D.</given-names></name> <name><surname>Butler</surname> <given-names>H.</given-names></name> <name><surname>Cherry</surname> <given-names>J. M.</given-names></name> <etal/></person-group>. (<year>2000</year>). <article-title>Gene Ontology: tool for the unification of biology</article-title>. <source>Nat. Genet.</source> <volume>25</volume>, <fpage>25</fpage>&#x02013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1038/75556</pub-id><pub-id pub-id-type="pmid">10802651</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Azam</surname> <given-names>S.</given-names></name> <name><surname>Rathore</surname> <given-names>A.</given-names></name> <name><surname>Shah</surname> <given-names>T. M.</given-names></name> <name><surname>Telluri</surname> <given-names>M.</given-names></name> <name><surname>Amindala</surname> <given-names>B.</given-names></name> <name><surname>Ruperao</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data</article-title>. <source>PLoS ONE</source> <volume>9</volume>:<fpage>e101754</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0101754</pub-id><pub-id pub-id-type="pmid">25003610</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barbar&#x000E1;</surname> <given-names>T.</given-names></name> <name><surname>Palma-Silva</surname> <given-names>C.</given-names></name> <name><surname>Paggi</surname> <given-names>G. M.</given-names></name> <name><surname>Bered</surname> <given-names>F.</given-names></name> <name><surname>Fay</surname> <given-names>M. F.</given-names></name> <name><surname>Lexer</surname> <given-names>C.</given-names></name></person-group> (<year>2007</year>). <article-title>Cross-species transfer of nuclear microsatellite markers: potential and limitations</article-title>. <source>Mol. Ecol.</source> <volume>16</volume>, <fpage>3759</fpage>&#x02013;<lpage>3767</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-294X.2007.03439.x</pub-id><pub-id pub-id-type="pmid">17850543</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhargava</surname> <given-names>A.</given-names></name> <name><surname>Clabaugh</surname> <given-names>I.</given-names></name> <name><surname>To</surname> <given-names>J. P.</given-names></name> <name><surname>Maxwell</surname> <given-names>B. B.</given-names></name> <name><surname>Chiang</surname> <given-names>Y.-H.</given-names></name> <name><surname>Schaller</surname> <given-names>G. E.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Identification of cytokinin-responsive genes using microarray meta-analysis and RNA-Seq in Arabidopsis</article-title>. <source>Plant Physiol.</source> <volume>162</volume>, <fpage>272</fpage>&#x02013;<lpage>294</lpage>. <pub-id pub-id-type="doi">10.1104/pp.113.217026</pub-id><pub-id pub-id-type="pmid">23524861</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bose Mazumdar</surname> <given-names>A.</given-names></name> <name><surname>Chattopadhyay</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Sequencing, <italic>de novo</italic> assembly, functional annotation and analysis of Phyllanthus amarus leaf transcriptome using the Illumina platform</article-title>. <source>Front. Plant Sci.</source> <volume>6</volume>:<fpage>1199</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2015.01199</pub-id><pub-id pub-id-type="pmid">26858723</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butt</surname> <given-names>M. S.</given-names></name> <name><surname>Shahzadi</surname> <given-names>N.</given-names></name> <name><surname>Sharif</surname> <given-names>M. K.</given-names></name> <name><surname>Nasir</surname> <given-names>M.</given-names></name></person-group> (<year>2007</year>). <article-title>Guar gum: a miracle therapy for hypercholesterolemia, hyperglycemia and obesity</article-title>. <source>Crit. Rev. Food Sci. Nutr.</source> <volume>47</volume>, <fpage>389</fpage>&#x02013;<lpage>396</lpage>. <pub-id pub-id-type="doi">10.1080/10408390600846267</pub-id><pub-id pub-id-type="pmid">17457723</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Camacho</surname> <given-names>C.</given-names></name> <name><surname>Coulouris</surname> <given-names>G.</given-names></name> <name><surname>Avagyan</surname> <given-names>V.</given-names></name> <name><surname>Ma</surname> <given-names>N.</given-names></name> <name><surname>Papadopoulos</surname> <given-names>J.</given-names></name> <name><surname>Bealer</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>BLAST&#x0002B;: architecture and applications</article-title>. <source>BMC Bioinformatics</source> <volume>10</volume>:<fpage>421</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-10-421</pub-id><pub-id pub-id-type="pmid">20003500</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cardle</surname> <given-names>L.</given-names></name> <name><surname>Ramsay</surname> <given-names>L.</given-names></name> <name><surname>Milbourne</surname> <given-names>D.</given-names></name> <name><surname>Macaulay</surname> <given-names>M.</given-names></name> <name><surname>Marshall</surname> <given-names>D.</given-names></name> <name><surname>Waugh</surname> <given-names>R.</given-names></name></person-group> (<year>2000</year>). <article-title>Computational and experimental characterization of physically clustered simple sequence repeats in plants</article-title>. <source>Genetics</source> <volume>156</volume>, <fpage>847</fpage>&#x02013;<lpage>854</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.genetics.org/content/156/2/847">http://www.genetics.org/content/156/2/847</ext-link> <pub-id pub-id-type="pmid">11014830</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cloutier</surname> <given-names>S.</given-names></name> <name><surname>Niu</surname> <given-names>Z.</given-names></name> <name><surname>Datla</surname> <given-names>R.</given-names></name> <name><surname>Duguid</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>Development and analysis of EST-SSRs for flax (<italic>Linum usitatissimum</italic> L.)</article-title>. <source>Theor. Appl. Genet.</source> <volume>119</volume>, <fpage>53</fpage>&#x02013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1007/s00122-009-1016-3</pub-id><pub-id pub-id-type="pmid">19357828</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Conesa</surname> <given-names>A.</given-names></name> <name><surname>G&#x000F6;t</surname> <given-names>S.</given-names></name> <name><surname>Juan Miguel Garc&#x000ED;a-G&#x000F3;mez</surname> <given-names>J. M.</given-names></name> <name><surname>Terol</surname> <given-names>J.</given-names></name> <name><surname>Tal&#x000F3;n</surname> <given-names>M.</given-names></name> <name><surname>Robles</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>3674</fpage>&#x02013;<lpage>3676</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti610</pub-id><pub-id pub-id-type="pmid">16081474</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cordeiro</surname> <given-names>G. M.</given-names></name> <name><surname>Casu</surname> <given-names>R.</given-names></name> <name><surname>McIntyre</surname> <given-names>C. L.</given-names></name> <name><surname>Manners</surname> <given-names>J. M.</given-names></name> <name><surname>Henry</surname> <given-names>R. J.</given-names></name></person-group> (<year>2001</year>). <article-title>Microsatellite markers from sugarcane (<italic>Saccharum</italic> spp.) ESTs cross transferable to erianthus and sorghum</article-title>. <source>Plant Sci.</source> <volume>160</volume>, <fpage>1115</fpage>&#x02013;<lpage>1123</lpage>. <pub-id pub-id-type="doi">10.1016/S0168-9452(01)00365-X</pub-id><pub-id pub-id-type="pmid">11337068</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dhugga</surname> <given-names>K. S.</given-names></name> <name><surname>Barreiro</surname> <given-names>R.</given-names></name> <name><surname>Whitten</surname> <given-names>B.</given-names></name> <name><surname>Stecca</surname> <given-names>K.</given-names></name> <name><surname>Hazebroek</surname> <given-names>J.</given-names></name> <name><surname>Randhawa</surname> <given-names>G. S.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>Guar seed beta-mannan synthase is a member of the cellulose synthase super gene family</article-title>. <source>Science</source> <volume>303</volume>, <fpage>363</fpage>&#x02013;<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1126/science.1090908</pub-id><pub-id pub-id-type="pmid">14726589</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Doyle</surname> <given-names>J. J.</given-names></name> <name><surname>Doyle</surname> <given-names>J. L.</given-names></name></person-group> (<year>1990</year>). <article-title>Isolation of DNA from small amounts of plant tissues</article-title>. <source>BRL Focus</source> <volume>12</volume>, <fpage>13</fpage>&#x02013;<lpage>15</lpage>.</citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dutta</surname> <given-names>S.</given-names></name> <name><surname>Kumawat</surname> <given-names>G.</given-names></name> <name><surname>Singh</surname> <given-names>B. P.</given-names></name> <name><surname>Gupta</surname> <given-names>D. K.</given-names></name> <name><surname>Singh</surname> <given-names>S.</given-names></name> <name><surname>Dogra</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [<italic>Cajanus cajan</italic> (L.) Millspaugh]</article-title>. <source>BMC Plant Biol.</source> <volume>11</volume>:<fpage>17</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2229-11-17</pub-id><pub-id pub-id-type="pmid">21251263</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Dwivedi</surname> <given-names>N. K.</given-names></name> <name><surname>Bhandari</surname> <given-names>D. C.</given-names></name> <name><surname>Dubas</surname> <given-names>B. S.</given-names></name> <name><surname>Agrawal</surname> <given-names>R. C.</given-names></name> <name><surname>Mandal</surname> <given-names>S.</given-names></name> <name><surname>Rana</surname> <given-names>R. S.</given-names></name></person-group> (<year>1995</year>). <source>Catalogue on Cluster Bean (Cyamopsis tetragonoloba (L.) Taub) Germplasm Part III</source>. <publisher-loc>New Delhi</publisher-loc>: <publisher-name>NBPGR</publisher-name>.</citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finn</surname> <given-names>R. D.</given-names></name> <name><surname>Bateman</surname> <given-names>A.</given-names></name> <name><surname>Clements</surname> <given-names>J.</given-names></name> <name><surname>Coggill</surname> <given-names>P.</given-names></name> <name><surname>Eberhardt</surname> <given-names>R. Y.</given-names></name> <name><surname>Eddy</surname> <given-names>S. R.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Pfam: the protein families database</article-title>. <source>Nucleic Acids Res.</source> <volume>42</volume>, <fpage>D222</fpage>&#x02013;<lpage>D230</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkt1223</pub-id><pub-id pub-id-type="pmid">24288371</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giannini</surname> <given-names>E. G.</given-names></name> <name><surname>Mansi</surname> <given-names>C.</given-names></name> <name><surname>Dulbecco</surname> <given-names>P.</given-names></name> <name><surname>Savarino</surname> <given-names>V.</given-names></name></person-group> (<year>2006</year>). <article-title>Role of partially hydrolyzed guar gum in the treatment of irritable bowel syndrome</article-title>. <source>Nutrition</source> <volume>22</volume>, <fpage>334</fpage>&#x02013;<lpage>342</lpage>. <pub-id pub-id-type="doi">10.1016/j.nut.2005.10.003</pub-id><pub-id pub-id-type="pmid">16413751</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gojobori</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>W. H.</given-names></name> <name><surname>Graur</surname> <given-names>D.</given-names></name></person-group> (<year>1982</year>). <article-title>Patterns of nucleotide substitution in pseudogenes and functional genes</article-title>. <source>J. Mol. Evol.</source> <volume>18</volume>, <fpage>360</fpage>&#x02013;<lpage>369</lpage>. <pub-id pub-id-type="doi">10.1007/BF01733904</pub-id><pub-id pub-id-type="pmid">7120431</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grabherr</surname> <given-names>M. G.</given-names></name> <name><surname>Haas</surname> <given-names>B. J.</given-names></name> <name><surname>Yassour</surname> <given-names>M.</given-names></name> <name><surname>Levin</surname> <given-names>J. Z.</given-names></name> <name><surname>Thompson</surname> <given-names>D. A.</given-names></name> <name><surname>Amit</surname> <given-names>I.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Full-length transcriptome assembly from RNA-Seq data without a reference genome</article-title>. <source>Nat. Biotechnol.</source> <volume>29</volume>, <fpage>644</fpage>&#x02013;<lpage>652</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.1883</pub-id><pub-id pub-id-type="pmid">21572440</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hiremath</surname> <given-names>P. J.</given-names></name> <name><surname>Kumar</surname> <given-names>A.</given-names></name> <name><surname>Penmetsa</surname> <given-names>R. V.</given-names></name> <name><surname>Farmer</surname> <given-names>A.</given-names></name> <name><surname>Schlueter</surname> <given-names>J. A.</given-names></name> <name><surname>Chamarthi</surname> <given-names>S. K.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Large-scale development of cost-effective SNP marker assays for diversity assessment and genetic mapping in chickpea and comparative mapping in legumes</article-title>. <source>Plant Biotechnol. J.</source> <volume>10</volume>, <fpage>716</fpage>&#x02013;<lpage>732</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-7652.2012.00710.x</pub-id><pub-id pub-id-type="pmid">22703242</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jain</surname> <given-names>M.</given-names></name> <name><surname>Misra</surname> <given-names>G.</given-names></name> <name><surname>Patel</surname> <given-names>R. K.</given-names></name> <name><surname>Priya</surname> <given-names>P.</given-names></name> <name><surname>Jhanwar</surname> <given-names>S.</given-names></name> <name><surname>Khan</surname> <given-names>A. W.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>A draft genome sequence of the pulse crop chickpea (<italic>Cicer arietinum</italic> L.)</article-title>. <source>Plant J.</source> <volume>74</volume>, <fpage>715</fpage>&#x02013;<lpage>729</lpage>. <pub-id pub-id-type="doi">10.1111/tpj.12173</pub-id><pub-id pub-id-type="pmid">23489434</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Goto</surname> <given-names>S.</given-names></name></person-group> (<year>2000</year>). <article-title>KEGG: kyoto encyclopedia of genes and genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume>, <fpage>27</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1093/nar/28.1.27</pub-id><pub-id pub-id-type="pmid">10592173</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kesawat</surname> <given-names>M. S.</given-names></name> <name><surname>Kumar</surname> <given-names>B. D.</given-names></name></person-group> (<year>2009</year>). <article-title>Molecular markers: it&#x00027;s application in crop improvement</article-title>. <source>J. Crop Sci. Biotechnol.</source> <volume>12</volume>, <fpage>169</fpage>&#x02013;<lpage>181</lpage>. <pub-id pub-id-type="doi">10.1007/s12892-009-0124-6</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koressaar</surname> <given-names>T.</given-names></name> <name><surname>Remm</surname> <given-names>M.</given-names></name></person-group> (<year>2007</year>). <article-title>Enhancements and modifications of primer design program Primer3</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>1289</fpage>&#x02013;<lpage>1291</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btm091</pub-id><pub-id pub-id-type="pmid">17379693</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kornobis</surname> <given-names>E.</given-names></name> <name><surname>Cabellos</surname> <given-names>L.</given-names></name> <name><surname>Aguilar</surname> <given-names>F.</given-names></name> <name><surname>Fr&#x000ED;as-L&#x000F3;pez</surname> <given-names>C.</given-names></name> <name><surname>Rozas</surname> <given-names>J.</given-names></name> <name><surname>Marco</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>TRUFA: a user-friendly web server for <italic>de novo</italic> RNA-seq analysis using cluster computing</article-title>. <source>Evol. Bioinformatics</source> <volume>11</volume>, <fpage>97</fpage>&#x02013;<lpage>104</lpage>. <pub-id pub-id-type="doi">10.4137/EBO.S23873</pub-id><pub-id pub-id-type="pmid">26056424</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>S.</given-names></name> <name><surname>Parekh</surname> <given-names>M. J.</given-names></name> <name><surname>Patel</surname> <given-names>C. B.</given-names></name> <name><surname>Zala</surname> <given-names>H. N.</given-names></name> <name><surname>Sharma</surname> <given-names>R.</given-names></name> <name><surname>Kulkarni</surname> <given-names>K. S.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Development and validation of EST-derived SSR markers and diversity analysis in cluster bean (<italic>Cyamopsis tetragonoloba</italic>)</article-title>. <source>J. Plant Biochem. Biotechnol.</source> <volume>25</volume>, <fpage>263</fpage>&#x02013;<lpage>269</lpage>. <pub-id pub-id-type="doi">10.1007/s13562-015-0337-3</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuravadi</surname> <given-names>N. A.</given-names></name> <name><surname>Tiwari</surname> <given-names>P. B.</given-names></name> <name><surname>Choudhary</surname> <given-names>M.</given-names></name> <name><surname>Randhawa</surname> <given-names>G. S.</given-names></name></person-group> (<year>2013</year>). <article-title>Genetic diversity study of cluster bean (<italic>Cyamopsis tetragonoloba</italic> (L.) Taub) landraces using RAPD and ISSR markers</article-title>. <source>Int. J. Adv. Biotechnol. Res.</source> <volume>4</volume>, <fpage>460</fpage>&#x02013;<lpage>471</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://bipublication.com/files/IJABR-V4I4-2013-05.pdf">http://bipublication.com/files/IJABR-V4I4-2013-05.pdf</ext-link></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuravadi</surname> <given-names>N. A.</given-names></name> <name><surname>Tiwari</surname> <given-names>P. B.</given-names></name> <name><surname>Tanwar</surname> <given-names>U. K.</given-names></name> <name><surname>Tripathi</surname> <given-names>S. K.</given-names></name> <name><surname>Dhugga</surname> <given-names>K. S.</given-names></name> <name><surname>Gill</surname> <given-names>K. S.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Identification and characterization of EST-SSR markers in cluster bean (spp.)</article-title>. <source>Crop Sci.</source> <volume>54</volume>, <fpage>1097</fpage>&#x02013;<lpage>1102</lpage>. <pub-id pub-id-type="doi">10.2135/cropsci2013.08.0522</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langmead</surname> <given-names>B.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Fast gapped-read alignment with Bowtie 2</article-title>. <source>Nat. Methods</source> <volume>9</volume>, <fpage>357</fpage>&#x02013;<lpage>359</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1923</pub-id><pub-id pub-id-type="pmid">22388286</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>The sequence alignment/map format and SAMtools</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>2078</fpage>&#x02013;<lpage>2079</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id><pub-id pub-id-type="pmid">19505943</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>W.</given-names></name> <name><surname>Godzik</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences</article-title>. <source>Bioinformatics</source> <volume>22</volume>, <fpage>1658</fpage>&#x02013;<lpage>1659</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btl158</pub-id><pub-id pub-id-type="pmid">16731699</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lu</surname> <given-names>F. H.</given-names></name> <name><surname>Cho</surname> <given-names>M. C.</given-names></name> <name><surname>Park</surname> <given-names>Y. J.</given-names></name></person-group> (<year>2012</year>). <article-title>Transcriptome profiling and molecular marker discovery in red pepper, <italic>Capsicum annuum</italic> L. TF68</article-title>. <source>Mol. Biol. Rep.</source> <volume>39</volume>, <fpage>3327</fpage>&#x02013;<lpage>3335</lpage>. <pub-id pub-id-type="doi">10.1007/s11033-011-1102-x</pub-id><pub-id pub-id-type="pmid">21706160</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Michaels</surname> <given-names>S. D.</given-names></name> <name><surname>Amasino</surname> <given-names>R. M.</given-names></name></person-group> (<year>1998</year>). <article-title>A robust method for detecting single-nucleotide changes as polymorphic markers by PCR</article-title>. <source>Plant J.</source> <volume>14</volume>, <fpage>381</fpage>&#x02013;<lpage>385</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-313X.1998.00123.x</pub-id><pub-id pub-id-type="pmid">9628032</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morgante</surname> <given-names>M.</given-names></name> <name><surname>Hanafey</surname> <given-names>M.</given-names></name> <name><surname>Powell</surname> <given-names>W.</given-names></name></person-group> (<year>2002</year>). <article-title>Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes</article-title>. <source>Nat. Genet.</source> <volume>30</volume>, <fpage>194</fpage>&#x02013;<lpage>200</lpage>. <pub-id pub-id-type="doi">10.1038/ng822</pub-id><pub-id pub-id-type="pmid">11799393</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mortazavi</surname> <given-names>A.</given-names></name> <name><surname>Williams</surname> <given-names>B. A.</given-names></name> <name><surname>McCue</surname> <given-names>K.</given-names></name> <name><surname>Schaeffer</surname> <given-names>L.</given-names></name> <name><surname>Wold</surname> <given-names>B.</given-names></name></person-group> (<year>2008</year>). <article-title>Mapping and quantifying mammalian transcriptomes by RNA-Seq</article-title>. <source>Nat. Methods</source> <volume>5</volume>, <fpage>621</fpage>&#x02013;<lpage>628</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1226</pub-id><pub-id pub-id-type="pmid">18516045</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nagalakshmi</surname> <given-names>U.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Waern</surname> <given-names>K.</given-names></name> <name><surname>Shou</surname> <given-names>C.</given-names></name> <name><surname>Raha</surname> <given-names>D.</given-names></name> <name><surname>Gerstein</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>The transcriptional landscape of the yeast genome defined by RNA sequencing</article-title>. <source>Science</source> <volume>320</volume>, <fpage>1344</fpage>&#x02013;<lpage>1349</lpage>. <pub-id pub-id-type="doi">10.1126/science.1158441</pub-id><pub-id pub-id-type="pmid">18451266</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakasugi</surname> <given-names>K.</given-names></name> <name><surname>Crowhurst</surname> <given-names>R. N.</given-names></name> <name><surname>Bally</surname> <given-names>J.</given-names></name> <name><surname>Wood</surname> <given-names>C. C.</given-names></name> <name><surname>Hellens</surname> <given-names>R. P.</given-names></name> <name><surname>Waterhouse</surname> <given-names>P. M.</given-names></name></person-group> (<year>2013</year>). <article-title><italic>De novo</italic> transcriptome sequence assembly and analysis of RNA silencing genes of <italic>Nicotiana benthamiana</italic></article-title>. <source>PLoS ONE</source> <volume>8</volume>:<fpage>59534</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0059534</pub-id><pub-id pub-id-type="pmid">23555698</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nasu</surname> <given-names>S.</given-names></name> <name><surname>Suzuki</surname> <given-names>J.</given-names></name> <name><surname>Ohta</surname> <given-names>R.</given-names></name> <name><surname>Hasegawa</surname> <given-names>K.</given-names></name> <name><surname>Yui</surname> <given-names>R.</given-names></name> <name><surname>Kitazawa</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2002</year>). <article-title>Search for and analysis of single nucleotide polymorphisms (SNPs) in rice (<italic>Oryza sativa, Oryza rufipogon</italic>) and establishment of SNP markers</article-title>. <source>DNA Res.</source> <volume>9</volume>, <fpage>163</fpage>&#x02013;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.1093/dnares/9.5.163</pub-id><pub-id pub-id-type="pmid">12465716</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neff</surname> <given-names>M. M.</given-names></name> <name><surname>Neff</surname> <given-names>J. D.</given-names></name> <name><surname>Chory</surname> <given-names>J.</given-names></name> <name><surname>Pepper</surname> <given-names>A. E.</given-names></name></person-group> (<year>1998</year>). <article-title>dCAPS, a simple technique for the genetic analysis of single nucleotide polymorphisms: experimental applications in <italic>Arabidopsis thaliana</italic> genetics</article-title>. <source>Plant J.</source> <volume>14</volume>, <fpage>387</fpage>&#x02013;<lpage>392</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-313X.1998.00124.x</pub-id><pub-id pub-id-type="pmid">9628033</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parchman</surname> <given-names>T. L.</given-names></name> <name><surname>Geist</surname> <given-names>K. S.</given-names></name> <name><surname>Grahnen</surname> <given-names>J. A.</given-names></name> <name><surname>Benkman</surname> <given-names>C. W.</given-names></name> <name><surname>Buerkle</surname> <given-names>C. A.</given-names></name></person-group> (<year>2010</year>). <article-title>Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery</article-title>. <source>BMC Genomics</source> <volume>11</volume>:<fpage>180</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-11-180</pub-id><pub-id pub-id-type="pmid">20233449</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parra</surname> <given-names>G.</given-names></name> <name><surname>Bradnam</surname> <given-names>K.</given-names></name> <name><surname>Korf</surname> <given-names>I.</given-names></name></person-group> (<year>2007</year>). <article-title>CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>1061</fpage>&#x02013;<lpage>1067</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btm071</pub-id><pub-id pub-id-type="pmid">17332020</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parra</surname> <given-names>G.</given-names></name> <name><surname>Bradnam</surname> <given-names>K.</given-names></name> <name><surname>Ning</surname> <given-names>Z.</given-names></name> <name><surname>Keane</surname> <given-names>T.</given-names></name> <name><surname>Korf</surname> <given-names>I.</given-names></name></person-group> (<year>2009</year>). <article-title>Assessing the gene space in draft genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>37</volume>, <fpage>289</fpage>&#x02013;<lpage>297</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkn916</pub-id><pub-id pub-id-type="pmid">19042974</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pathak</surname> <given-names>R.</given-names></name> <name><surname>Singh</surname> <given-names>S. K.</given-names></name> <name><surname>Singh</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Assessment of genetic diversity in clusterbean using nuclear rDNA and RAPD markers</article-title>. <source>J. Food Legumes</source> <volume>24</volume>, <fpage>180</fpage>&#x02013;<lpage>183</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.indianjournals.com/ijor.aspx?target=ijor:jfl&#x00026;volume=24&#x00026;issue=3&#x00026;article=003">http://www.indianjournals.com/ijor.aspx?target=ijor:jfl&#x00026;volume=24&#x00026;issue=3&#x00026;article=003</ext-link></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pathak</surname> <given-names>R.</given-names></name> <name><surname>Singh</surname> <given-names>S. K.</given-names></name> <name><surname>Singh</surname> <given-names>M.</given-names></name> <name><surname>Henry</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>Molecular assessment of genetic diversity in cluster bean (<italic>Cyamopsis tetragonoloba</italic>) genotypes</article-title>. <source>J. Genet.</source> <volume>89</volume>, <fpage>243</fpage>&#x02013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.1007/s12041-010-0033-y</pub-id><pub-id pub-id-type="pmid">20861578</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Patil</surname> <given-names>C. G.</given-names></name></person-group> (<year>2004</year>). <article-title>Nuclear DNA amount variation in Cyamopsis DC (Fabaceae)</article-title>. <source>Cytologia</source> <volume>69</volume>, <fpage>59</fpage>&#x02013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1508/cytologia.69.59</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>J. H.</given-names></name> <name><surname>Lapitan</surname> <given-names>N. L.</given-names></name></person-group> (<year>2005</year>). <article-title>Characterization of EST-derived microsatellites in the wheat genome and development of eSSR markers</article-title>. <source>Funct. Integr. Genomics</source> <volume>5</volume>, <fpage>80</fpage>&#x02013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1007/s10142-004-0128-8</pub-id><pub-id pub-id-type="pmid">15650880</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Punia</surname> <given-names>A.</given-names></name> <name><surname>Yadav</surname> <given-names>R.</given-names></name> <name><surname>Arora</surname> <given-names>P.</given-names></name> <name><surname>Chaudhury</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Molecular and morphophysiological characterization of superior cluster bean (<italic>Cymopsis tetragonoloba</italic>) varieties</article-title>. <source>J. Crop Sci. Biotechnol.</source> <volume>12</volume>, <fpage>143</fpage>&#x02013;<lpage>148</lpage>. <pub-id pub-id-type="doi">10.1007/s12892-009-0106-8</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rafalski</surname> <given-names>A.</given-names></name></person-group> (<year>2002</year>). <article-title>Applications of single nucleotide polymorphisms in crop genetics</article-title>. <source>Curr. Opin. Plant Biol.</source> <volume>5</volume>, <fpage>94</fpage>&#x02013;<lpage>100</lpage>. <pub-id pub-id-type="doi">10.1016/S1369-5266(02)00240-6</pub-id><pub-id pub-id-type="pmid">11856602</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname> <given-names>J. T.</given-names></name> <name><surname>Thorvaldsd&#x000F3;ttir</surname> <given-names>H.</given-names></name> <name><surname>Winckler</surname> <given-names>W.</given-names></name> <name><surname>Guttman</surname> <given-names>M.</given-names></name> <name><surname>Lander</surname> <given-names>E. S.</given-names></name> <name><surname>Getz</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Integrative genomics viewer</article-title>. <source>Nat. Biotechnol.</source> <volume>29</volume>, <fpage>24</fpage>&#x02013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.1754</pub-id><pub-id pub-id-type="pmid">21221095</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sato</surname> <given-names>S.</given-names></name> <name><surname>Nakamura</surname> <given-names>Y.</given-names></name> <name><surname>Kaneko</surname> <given-names>T.</given-names></name> <name><surname>Asamizu</surname> <given-names>E.</given-names></name> <name><surname>Kato</surname> <given-names>T.</given-names></name> <name><surname>Nakao</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Genome structure of the legume, <italic>Lotus japonicus</italic></article-title>. <source>DNA Res.</source> <volume>15</volume>, <fpage>227</fpage>&#x02013;<lpage>239</lpage>. <pub-id pub-id-type="doi">10.1093/dnares/dsn008</pub-id><pub-id pub-id-type="pmid">18511435</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmutz</surname> <given-names>J.</given-names></name> <name><surname>Cannon</surname> <given-names>S. B.</given-names></name> <name><surname>Schlueter</surname> <given-names>J.</given-names></name> <name><surname>Ma</surname> <given-names>J.</given-names></name> <name><surname>Mitros</surname> <given-names>T.</given-names></name> <name><surname>Nelson</surname> <given-names>W.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Genome sequence of the palaeopolyploid soybean</article-title>. <source>Nature</source> <volume>463</volume>, <fpage>178</fpage>&#x02013;<lpage>183</lpage>. <pub-id pub-id-type="doi">10.1038/nature08670</pub-id><pub-id pub-id-type="pmid">20075913</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shahinnia</surname> <given-names>F.</given-names></name> <name><surname>Sayed-Tabatabaei</surname> <given-names>B. E.</given-names></name></person-group> (<year>2009</year>). <article-title>Conversion of barley SNPs into PCR-based markers using dCAPS method</article-title>. <source>Genet. Mol. Biol.</source> <volume>32</volume>, <fpage>564</fpage>&#x02013;<lpage>567</lpage>. <pub-id pub-id-type="doi">10.1590/S1415-47572009005000047</pub-id><pub-id pub-id-type="pmid">21637520</pub-id></citation>
</ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharma</surname> <given-names>P.</given-names></name> <name><surname>Kumar</surname> <given-names>V.</given-names></name> <name><surname>Raman</surname> <given-names>K. V.</given-names></name> <name><surname>Tiwari</surname> <given-names>K.</given-names></name></person-group> (<year>2014</year>). <article-title>A set of SCAR markers in cluster bean (<italic>Cyamopsis tetragonoloba</italic> L. Taub) genotypes</article-title>. <source>Adv. Biosci. Biotechnol.</source> <volume>5</volume>, <fpage>131</fpage>&#x02013;<lpage>141</lpage>. <pub-id pub-id-type="doi">10.4236/abb.2014.52017</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slavin</surname> <given-names>J. L.</given-names></name> <name><surname>Greenberg</surname> <given-names>N. A.</given-names></name></person-group> (<year>2003</year>). <article-title>Partially hydrolyzed guar gum: clinical nutrition uses</article-title>. <source>Nutrition</source> <volume>19</volume>, <fpage>549</fpage>&#x02013;<lpage>552</lpage>. <pub-id pub-id-type="doi">10.1016/S0899-9007(02)01032-8</pub-id><pub-id pub-id-type="pmid">12781858</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sonah</surname> <given-names>H.</given-names></name> <name><surname>Deshmukh</surname> <given-names>R. K.</given-names></name> <name><surname>Sharma</surname> <given-names>A.</given-names></name> <name><surname>Singh</surname> <given-names>V. P.</given-names></name> <name><surname>Gupta</surname> <given-names>D. K.</given-names></name> <name><surname>Gacche</surname> <given-names>R. N.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium</article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e21298</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0021298</pub-id><pub-id pub-id-type="pmid">21713003</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Suzek</surname> <given-names>B. E.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Huang</surname> <given-names>H.</given-names></name> <name><surname>McGarvey</surname> <given-names>P. B.</given-names></name> <name><surname>Wu</surname> <given-names>C. H.</given-names></name> <name><surname>UniProt</surname> <given-names>C.</given-names></name></person-group> (<year>2015</year>). <article-title>UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>926</fpage>&#x02013;<lpage>932</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu739</pub-id><pub-id pub-id-type="pmid">25398609</pub-id></citation>
</ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thiel</surname> <given-names>T.</given-names></name> <name><surname>Michalek</surname> <given-names>W.</given-names></name> <name><surname>Varshney</surname> <given-names>R. K.</given-names></name> <name><surname>Graner</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <article-title>Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (<italic>Hordeum vulgare</italic> L.)</article-title>. <source>Theor. Appl. Genet.</source> <volume>106</volume>, <fpage>411</fpage>&#x02013;<lpage>422</lpage>. <pub-id pub-id-type="doi">10.1007/s00122-002-1031-0</pub-id><pub-id pub-id-type="pmid">12589540</pub-id></citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thorvaldsd&#x000F3;ttir</surname> <given-names>H.</given-names></name> <name><surname>Robinson</surname> <given-names>J. T.</given-names></name> <name><surname>Mesirov</surname> <given-names>J. P.</given-names></name></person-group> (<year>2013</year>). <article-title>Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration</article-title>. <source>Brief. Bioinformatics</source> <volume>14</volume>, <fpage>178</fpage>&#x02013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbs017</pub-id><pub-id pub-id-type="pmid">22517427</pub-id></citation>
</ref>
<ref id="B62">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Undersander</surname> <given-names>D. J.</given-names></name> <name><surname>Putnam</surname> <given-names>D. H.</given-names></name> <name><surname>Kaminski</surname> <given-names>A. R.</given-names></name> <name><surname>Kelling</surname> <given-names>K. A.</given-names></name> <name><surname>Doll</surname> <given-names>J. D.</given-names></name> <name><surname>Oplinger</surname> <given-names>E. S.</given-names></name> <etal/></person-group>. (<year>1991</year>). <article-title>Guar</article-title>, in <source>Alternative Field Crops Manual</source> (University of Wisconsin; University of Minnesota). Available online at: <ext-link ext-link-type="uri" xlink:href="https://hort.purdue.edu/newcrop/afcm/guar.html">https://hort.purdue.edu/newcrop/afcm/guar.html</ext-link></citation>
</ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Untergasser</surname> <given-names>A.</given-names></name> <name><surname>Cutcutache</surname> <given-names>I.</given-names></name> <name><surname>Koressaar</surname> <given-names>T.</given-names></name> <name><surname>Ye</surname> <given-names>J.</given-names></name> <name><surname>Faircloth</surname> <given-names>B. C.</given-names></name> <name><surname>Remm</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Primer3-new capabilities and interfaces</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume>, <fpage>e115</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gks596</pub-id><pub-id pub-id-type="pmid">22730293</pub-id></citation>
</ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Bel</surname> <given-names>M.</given-names></name> <name><surname>Proost</surname> <given-names>S.</given-names></name> <name><surname>Van Neste</surname> <given-names>C.</given-names></name> <name><surname>Deforce</surname> <given-names>D.</given-names></name> <name><surname>Van de Peer</surname> <given-names>Y.</given-names></name> <name><surname>Vandepoele</surname> <given-names>K.</given-names></name></person-group> (<year>2013</year>). <article-title>TRAPID: an efficient online tool for the functional and comparative analysis of <italic>de novo</italic> RNA-Seq transcriptomes</article-title>. <source>Genome Biol.</source> <volume>14</volume>, <fpage>1</fpage>&#x02013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1186/gb-2013-14-12-r134</pub-id><pub-id pub-id-type="pmid">24330842</pub-id></citation>
</ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Varshney</surname> <given-names>R. K.</given-names></name> <name><surname>Chen</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Bharti</surname> <given-names>A. K.</given-names></name> <name><surname>Saxena</surname> <given-names>R. K.</given-names></name> <name><surname>Schlueter</surname> <given-names>J. A.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Draft genome sequence of pigeonpea (<italic>Cajanus cajan</italic>), an orphan legume crop of resource-poor farmers</article-title>. <source>Nat. Biotechnol.</source> <volume>30</volume>, <fpage>83</fpage>&#x02013;<lpage>89</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2022</pub-id><pub-id pub-id-type="pmid">22057054</pub-id></citation>
</ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Varshney</surname> <given-names>R. K.</given-names></name> <name><surname>Graner</surname> <given-names>A.</given-names></name> <name><surname>Sorrells</surname> <given-names>M. E.</given-names></name></person-group> (<year>2005a</year>). <article-title>Genic microsatellite markers in plants: features and applications</article-title>. <source>Trends Biotechnol.</source> <volume>23</volume>, <fpage>48</fpage>&#x02013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1016/j.tibtech.2004.11.005</pub-id><pub-id pub-id-type="pmid">15629858</pub-id></citation>
</ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Varshney</surname> <given-names>R. K.</given-names></name> <name><surname>Grosse</surname> <given-names>I.</given-names></name> <name><surname>H&#x000E4;hnel</surname> <given-names>U.</given-names></name> <name><surname>Siefken</surname> <given-names>R.</given-names></name> <name><surname>Prasad</surname> <given-names>M.</given-names></name> <name><surname>Stein</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Genetic mapping and BAC assignment of EST-derived SSR markers shows non-uniform distribution of genes in the barley genome</article-title>. <source>Theor. Appl. Genet.</source> <volume>113</volume>, <fpage>239</fpage>&#x02013;<lpage>250</lpage>. <pub-id pub-id-type="doi">10.1007/s00122-006-0289-z</pub-id><pub-id pub-id-type="pmid">16791690</pub-id></citation>
</ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Varshney</surname> <given-names>R. K.</given-names></name> <name><surname>Sigmund</surname> <given-names>R.</given-names></name> <name><surname>Barner</surname> <given-names>A.</given-names></name> <name><surname>Korzun</surname> <given-names>V.</given-names></name> <name><surname>Stein</surname> <given-names>N.</given-names></name> <name><surname>Sorrells</surname> <given-names>M. E.</given-names></name> <etal/></person-group>. (<year>2005b</year>). <article-title>Interspecific transferability and comparative mapping of barley EST-SSR markers in wheat, rye and rice</article-title>. <source>Plant Sci.</source> <volume>168</volume>, <fpage>195</fpage>&#x02013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.1016/j.plantsci.2004.08.001</pub-id></citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Varshney</surname> <given-names>R. K.</given-names></name> <name><surname>Song</surname> <given-names>C.</given-names></name> <name><surname>Saxena</surname> <given-names>R. K.</given-names></name> <name><surname>Azam</surname> <given-names>S.</given-names></name> <name><surname>Yu</surname> <given-names>S.</given-names></name> <name><surname>Sharpe</surname> <given-names>A. G.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Draft genome sequence of chickpea (<italic>Cicer arietinum</italic>) provides a resource for trait improvement</article-title>. <source>Nat. Biotechnol.</source> <volume>31</volume>, <fpage>240</fpage>&#x02013;<lpage>246</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2491</pub-id><pub-id pub-id-type="pmid">23354103</pub-id></citation>
</ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wakeley</surname> <given-names>J.</given-names></name></person-group> (<year>1994</year>). <article-title>Substitution-rate variation among sites and the estimation of transition bias</article-title>. <source>Mol. Biol. Evol.</source> <volume>11</volume>, <fpage>436</fpage>&#x02013;<lpage>442</lpage>. <pub-id pub-id-type="pmid">8015437</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wakeley</surname> <given-names>J.</given-names></name></person-group> (<year>1996</year>). <article-title>The excess of transitions among nucleotide substitutions: new methods of estimating transition bias underscore its significance</article-title>. <source>Trends Ecol. Evol.</source> <volume>11</volume>, <fpage>158</fpage>&#x02013;<lpage>162</lpage>. <pub-id pub-id-type="doi">10.1016/0169-5347(96)10009-4</pub-id><pub-id pub-id-type="pmid">21237791</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Gerstein</surname> <given-names>M.</given-names></name> <name><surname>Snyder</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>RNA-Seq: a revolutionary tool for transcriptomics</article-title>. <source>Nat. Rev. Genet.</source> <volume>10</volume>, <fpage>57</fpage>&#x02013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2484</pub-id><pub-id pub-id-type="pmid">19015660</pub-id></citation>
</ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Luo</surname> <given-names>Z.</given-names></name> <name><surname>Huang</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Fang</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Characterization and development of EST-derived SSR markers in cultivated sweetpotato (Ipomoea batatas)</article-title>. <source>BMC Plant Biol.</source> <volume>11</volume>:<fpage>139</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2229-11-139</pub-id><pub-id pub-id-type="pmid">22011271</pub-id></citation>
</ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Yu</surname> <given-names>G.</given-names></name> <name><surname>Shi</surname> <given-names>B.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Qiang</surname> <given-names>H.</given-names></name> <name><surname>Gao</surname> <given-names>H.</given-names></name></person-group> (<year>2014</year>). <article-title>Development and characterization of simple sequence repeat (SSR) markers based on RNA-sequencing of <italic>Medicago sativa</italic> and <italic>in silico</italic> mapping onto the <italic>M. truncatula</italic> genome</article-title>. <source>PLoS ONE</source> <volume>9</volume>:<fpage>e92029</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0092029</pub-id><pub-id pub-id-type="pmid">24642969</pub-id></citation>
</ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>L.</given-names></name> <name><surname>Yin</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>J.</given-names></name> <name><surname>Yu</surname> <given-names>L.</given-names></name> <name><surname>Zhou</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Sequencing, <italic>de novo</italic> assembly and comparative analysis of <italic>Raphanus sativus</italic> transcriptome</article-title>. <source>Front. Plant Sci.</source> <volume>6</volume>:<fpage>198</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2015.00198</pub-id><pub-id pub-id-type="pmid">26029219</pub-id></citation>
</ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xin</surname> <given-names>D.</given-names></name> <name><surname>Sun</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Jiang</surname> <given-names>H.</given-names></name> <name><surname>Hu</surname> <given-names>G.</given-names></name> <name><surname>Liu</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Identification and characterization of SSRs from soybean (Glycine max) ESTs</article-title>. <source>Mol. Biol. Rep.</source> <volume>39</volume>, <fpage>9047</fpage>&#x02013;<lpage>9057</lpage>. <pub-id pub-id-type="doi">10.1007/s11033-012-1776-8</pub-id><pub-id pub-id-type="pmid">22744420</pub-id></citation>
</ref>
<ref id="B77">
<citation citation-type="other"><person-group person-group-type="author"><name><surname>Yadav</surname> <given-names>H.</given-names></name> <name><surname>Prasad</surname> <given-names>A. K.</given-names></name> <name><surname>Goswami</surname> <given-names>P.</given-names></name> <name><surname>Pednekar</surname> <given-names>S.</given-names></name> <name><surname>Haque</surname> <given-names>E.</given-names></name> <name><surname>Shah</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <source>Guar Industry Outlook 2015</source>. Report made for: National Commodity &#x00026; Derivatives Exchange Limited. NIAM, Jaipur.</citation>
</ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>T.</given-names></name> <name><surname>Bao</surname> <given-names>S. Y.</given-names></name> <name><surname>Ford</surname> <given-names>R.</given-names></name> <name><surname>Jia</surname> <given-names>T. J.</given-names></name> <name><surname>Guan</surname> <given-names>J. P.</given-names></name> <name><surname>He</surname> <given-names>Y. H.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>High-throughput novel microsatellite marker of faba bean via next generation sequencing</article-title>. <source>BMC Genomics</source> <volume>13</volume>:<fpage>602</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-13-602</pub-id><pub-id pub-id-type="pmid">23137291</pub-id></citation>
</ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Bielawski</surname> <given-names>J. P.</given-names></name></person-group> (<year>2000</year>). <article-title>Statistical methods for detecting molecular adaptation</article-title>. <source>Trends Ecol. Evol.</source> <volume>15</volume>, <fpage>496</fpage>&#x02013;<lpage>503</lpage>. <pub-id pub-id-type="doi">10.1016/S0169-5347(00)01994-7</pub-id><pub-id pub-id-type="pmid">11114436</pub-id></citation>
</ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Z.</given-names></name> <name><surname>Yoder</surname> <given-names>A. D.</given-names></name></person-group> (<year>1999</year>). <article-title>Estimation of the transition/transversion rate bias and species sampling</article-title>. <source>J. Mol. Evol.</source> <volume>48</volume>, <fpage>274</fpage>&#x02013;<lpage>283</lpage>. <pub-id pub-id-type="doi">10.1007/PL00006470</pub-id><pub-id pub-id-type="pmid">10093216</pub-id></citation>
</ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ye</surname> <given-names>J.</given-names></name> <name><surname>Fang</surname> <given-names>L.</given-names></name> <name><surname>Zheng</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>WEGO: a web tool for plotting GO annotations</article-title>. <source>Nucleic Acids Res.</source> <volume>34</volume>(<supplement>Suppl. 2</supplement>), <fpage>W293</fpage>&#x02013;<lpage>W297</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkl031</pub-id><pub-id pub-id-type="pmid">16845012</pub-id></citation>
</ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname> <given-names>N. D.</given-names></name> <name><surname>Debell&#x000E9;</surname> <given-names>F.</given-names></name> <name><surname>Oldroyd</surname> <given-names>G. E.</given-names></name> <name><surname>Geurts</surname> <given-names>R.</given-names></name> <name><surname>Cannon</surname> <given-names>S. B.</given-names></name> <name><surname>Udvardi</surname> <given-names>M. K.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>The Medicago genome provides insight into the evolution of Rhizobial symbioses</article-title>. <source>Nature</source> <volume>480</volume>, <fpage>520</fpage>&#x02013;<lpage>524</lpage>. <pub-id pub-id-type="doi">10.1038/nature10625</pub-id><pub-id pub-id-type="pmid">22089132</pub-id></citation>
</ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Young</surname> <given-names>N. D.</given-names></name> <name><surname>Mudge</surname> <given-names>J.</given-names></name> <name><surname>Ellis</surname> <given-names>T. H.</given-names></name></person-group> (<year>2003</year>). <article-title>Legume genomes: more than peas in a pod</article-title>. <source>Curr. Opin. Plant Biol.</source> <volume>6</volume>, <fpage>199</fpage>&#x02013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1016/S1369-5266(03)00006-2</pub-id><pub-id pub-id-type="pmid">12667879</pub-id></citation>
</ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>J. N.</given-names></name> <name><surname>Won</surname> <given-names>C.</given-names></name> <name><surname>Jun</surname> <given-names>J.</given-names></name> <name><surname>Lim</surname> <given-names>Y. W.</given-names></name> <name><surname>Kwak</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Fast and cost-effective mining of microsatellite markers using NGS technology: an example of a Korean water deer Hydropotes inermis argyropus</article-title>. <source>PLoS ONE</source> <volume>6</volume>:<fpage>e26933</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0026933</pub-id><pub-id pub-id-type="pmid">22069476</pub-id></citation>
</ref>
</ref-list>
</back>
</article>