<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2017.02029</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Inference of Transcription Regulatory Network in Low Phytic Acid Soybean Seeds</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Redekar</surname> <given-names>Neelam</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn004"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/340447/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Pilot</surname> <given-names>Guillaume</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/11124/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Raboy</surname> <given-names>Victor</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/58055/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Li</surname> <given-names>Song</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/45869/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Saghai Maroof</surname> <given-names>M. A.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/458642/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Crop and Soil Environmental Sciences, Virginia Tech</institution>, <addr-line>Blacksburg, VA</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Plant Pathology, Physiology, and Weed Science, Virginia Tech</institution>, <addr-line>Blacksburg, VA</addr-line>, <country>United States</country></aff>
<aff id="aff3"><sup>3</sup><institution>National Small Grains Germplasm Research Center, Agricultural Research Service (USDA)</institution>, <addr-line>Aberdeen, ID</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Diego Mauricio Ria&#x000F1;o-Pach&#x000F3;n, Institute of Chemistry, University of S&#x000E3;o Paulo, Brazil</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Jedrzej Jakub Szymanski, Weizmann Institute of Science, Israel; Sudip Kundu, University of Calcutta, India; Maria Katherine Mejia Guerra, Cornell University, United States</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Song Li <email>songli&#x00040;vt.edu</email></p></fn>
<fn fn-type="corresp" id="fn002"><p>M. A. Saghai Maroof <email>smaroof&#x00040;vt.edu</email></p></fn>
<fn fn-type="other" id="fn003"><p>This article was submitted to Plant Systems and Synthetic Biology, a section of the journal Frontiers in Plant Science</p></fn>
<fn fn-type="present-address" id="fn004"><p>&#x02020;Present Address: Neelam Redekar, Department of Crop and Soil Science, Oregon State University, Corvallis, OR, United States</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>30</day>
<month>11</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>8</volume>
<elocation-id>2029</elocation-id>
<history>
<date date-type="received">
<day>24</day>
<month>07</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>14</day>
<month>11</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Redekar, Pilot, Raboy, Li and Saghai Maroof.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Redekar, Pilot, Raboy, Li and Saghai Maroof</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>A dominant loss of function mutation in <italic>myo</italic>-inositol phosphate synthase (<italic>MIPS</italic>) gene and recessive loss of function mutations in two multidrug resistant protein type-ABC transporter genes not only reduce the seed phytic acid levels in soybean, but also affect the pathways associated with seed development, ultimately resulting in low emergence. To understand the regulatory mechanisms and identify key genes that intervene in the seed development process in low phytic acid crops, we performed computational inference of gene regulatory networks in low and normal phytic acid soybeans using a time course transcriptomic data and multiple network inference algorithms. We identified a set of putative candidate transcription factors and their regulatory interactions with genes that have functions in myo-inositol biosynthesis, auxin-ABA signaling, and seed dormancy. We evaluated the performance of our unsupervised network inference method by comparing the predicted regulatory network with published regulatory interactions in <italic>Arabidopsis</italic>. Some contrasting regulatory interactions were observed in low phytic acid mutants compared to non-mutant lines. These findings provide important hypotheses on expression regulation of <italic>myo</italic>-inositol metabolism and phytohormone signaling in developing low phytic acid soybeans. The computational pipeline used for unsupervised network learning in this study is provided as open source software and is freely available at <ext-link ext-link-type="uri" xlink:href="https://lilabatvt.github.io/LPANetwork/">https://lilabatvt.github.io/LPANetwork/</ext-link>.</p></abstract>
<kwd-group>
<kwd>phytic acid</kwd>
<kwd>soybean seed development</kwd>
<kwd><italic>myo</italic>-inositol metabolism</kwd>
<kwd>unsupervised machine learning</kwd>
<kwd>gene regulatory network</kwd>
</kwd-group>
<counts>
<fig-count count="5"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="63"/>
<page-count count="14"/>
<word-count count="9715"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Seed development is a complex metabolic process, which involves both synthesis and breakdown of macromolecules for growth and maintenance of the embryo (Weber et al., <xref ref-type="bibr" rid="B59">2005</xref>; Le et al., <xref ref-type="bibr" rid="B25">2007</xref>). During seed development, glucose-6-phosphaste is converted to myo-inositol, an intracellular signaling molecule, which is phosphorylated several times to form phytic acid (Raboy, <xref ref-type="bibr" rid="B42">1997</xref>). Seeds with reduced phytic acid content are commercially more valuable because consumption of low phytic acid seeds by monogastric animals alleviates mineral deficiency and reduces phosphorus pollution from animal waste (Raboy, <xref ref-type="bibr" rid="B43">2007</xref>). Mutations that block the phytic acid biosynthesis pathway have been shown to alter the seed metabolite levels in soybean, rice, maize, and other plant species (Wilcox et al., <xref ref-type="bibr" rid="B60">2000</xref>; Shi et al., <xref ref-type="bibr" rid="B54">2003</xref>, <xref ref-type="bibr" rid="B53">2005</xref>; Stevenson-Paulik et al., <xref ref-type="bibr" rid="B57">2005</xref>; Raboy, <xref ref-type="bibr" rid="B43">2007</xref>; Glover, <xref ref-type="bibr" rid="B11">2011</xref>; Jervis et al., <xref ref-type="bibr" rid="B19">2015</xref>). For example, a mutation in <italic>myo-</italic>inositol phosphate synthase (<italic>MIPS</italic>) gene results in reduced phytic acid, stachyose, raffinose, and elevated sucrose, and low seed emergence in soybean (Hitz et al., <xref ref-type="bibr" rid="B16">2002</xref>; Saghai Maroof and Buss, <xref ref-type="bibr" rid="B46">2008</xref>). Other non-biosynthetic pathway genes such as multi-drug resistance protein (<italic>MRP</italic>) genes encoding ATP-binding cassette transporters that are believed to be involved in the transport of phytic acid to storage vacuoles, are also known to regulate phytic acid levels and affect seed emergence (Shi et al., <xref ref-type="bibr" rid="B52">2007</xref>; Nagy et al., <xref ref-type="bibr" rid="B39">2009</xref>; Saghai Maroof et al., <xref ref-type="bibr" rid="B47">2009</xref>; Xu et al., <xref ref-type="bibr" rid="B62">2009</xref>; Jervis et al., <xref ref-type="bibr" rid="B19">2015</xref>).</p>
<p>Transcriptome analysis is a valuable tool for the characterization of the regulatory networks that mediate this complex interaction. The expressions of genes involved in the metabolic activities in seeds are tightly regulated by the synergistic action of many transcription factors and other regulatory genes (Weber et al., <xref ref-type="bibr" rid="B59">2005</xref>; Le et al., <xref ref-type="bibr" rid="B25">2007</xref>). Two independent studies, one with barley low phytic acid (<italic>lpa</italic>) mutant, and another with soybean <italic>mips1/mrp-l/mrp-n</italic> (&#x02018;<italic>3mlpa</italic>&#x02019;) triple mutant have reported the effect of <italic>lpa</italic> mutations on the transcriptomic profiles of developing seeds (Bowen et al., <xref ref-type="bibr" rid="B3">2007</xref>; Redekar et al., <xref ref-type="bibr" rid="B45">2015</xref>). The differential expression of transcription factor genes such as WRKY and CAMTA (Calmodulin-binding Transcription Activator), was linked to phytic acid biosynthesis pathway, suggesting a complex regulatory mechanism (Redekar et al., <xref ref-type="bibr" rid="B45">2015</xref>). Association of WRKY transcription factors and Ca<sup>2&#x0002B;</sup> binding activity with inositol metabolism was also confirmed by another independent study with maize low phytic acid breeding line Qi319 (Zhang et al., <xref ref-type="bibr" rid="B63">2016</xref>). Zhang et al. (<xref ref-type="bibr" rid="B63">2016</xref>) also identified ABC transporter gene candidates associated with low phytic acid phenotype in maize using co-regulatory network. In this article, we focus on the discovery of transcription regulatory networks to further investigate the inositol metabolism in soybean.</p>
<p>One type of the widely applied method of network inference is the use of Pearson Correlation Coefficient or related methods for data analysis (Langfelder and Horvath, <xref ref-type="bibr" rid="B23">2008</xref>; Bassel et al., <xref ref-type="bibr" rid="B2">2011</xref>; Li et al., <xref ref-type="bibr" rid="B27">2012</xref>). Although such correlation analyses can cluster genes with similar functions, the resultant networks do not predict the direction of gene regulation. Many other approaches, such as mutual information (Faith et al., <xref ref-type="bibr" rid="B9">2007</xref>), partial correlation (Faith et al., <xref ref-type="bibr" rid="B9">2007</xref>), random forest (Huynh-Thu et al., <xref ref-type="bibr" rid="B17">2010</xref>), and least angle regression (LARS) (Haury et al., <xref ref-type="bibr" rid="B14">2012</xref>), have been developed to perform inference of directed gene regulatory networks. For well-characterized model organisms such as <italic>Arabidopsis</italic>, known interactions from ChIP-chips or ChIP-seq experiments can be used as prior knowledge in supervised machine learning approaches (Maetschke et al., <xref ref-type="bibr" rid="B34">2014</xref>; Ni et al., <xref ref-type="bibr" rid="B40">2016</xref>). However, for biological systems where little prior information is available, such as in soybeans, unsupervised methods have to be used for network inference. In particular, three methods including co-expression analysis (Bassel et al., <xref ref-type="bibr" rid="B2">2011</xref>), decision trees (Zhu et al., <xref ref-type="bibr" rid="B64">2013</xref>) and mutual information (Gonzalez-Morales et al., <xref ref-type="bibr" rid="B12">2016</xref>) have been successfully applied to identify functional networks in <italic>Arabidopsis</italic> and soybean seeds. With numerous inference methods available, it has been found that congregating the prediction results from multiple methods (so-called &#x0201C;community-based method&#x0201D;) improves the prediction accuracy as compared to any individual method (Marbach et al., <xref ref-type="bibr" rid="B35">2012</xref>).</p>
<p>In this study, we performed computational inference of gene regulatory networks in low phytic acid mutants and the corresponding non-mutant soybean seeds from time course transcriptomic data. In addition to previously published RNA-seq data (Redekar et al., <xref ref-type="bibr" rid="B45">2015</xref>), we generated new RNA-seq data of developing seeds using a pair of soybean isogenic lines, one carrying the <italic>mips1</italic> mutation (&#x02018;<italic>1mlpa</italic>&#x02019;) and the other the corresponding wild type allele. We implemented a computational pipeline for unsupervised gene regulatory network inference using five different methods: ARACNE (Margolin et al., <xref ref-type="bibr" rid="B36">2006</xref>), Random Forest (Huynh-Thu et al., <xref ref-type="bibr" rid="B17">2010</xref>), LARS (Haury et al., <xref ref-type="bibr" rid="B14">2012</xref>), partial correlation (Schafer and Strimmer, <xref ref-type="bibr" rid="B49">2005b</xref>), and context likelihood relatedness (CLR) (Faith et al., <xref ref-type="bibr" rid="B9">2007</xref>). To improve computational efficiency and interpretability of the inferred network, we adopted the widely used module network approach by which genes were grouped into co-expression modules and inference of gene regulation was performed between transcription factors and gene modules (Segal et al., <xref ref-type="bibr" rid="B50">2003</xref>). We found that many gene modules included genes with meaningful biological functions and some gene modules showed genotype-specific expression patterns. We identified several transcription factors that were differentially expressed between developmental stages and some of the inferred regulatory interactions were specifically found in mutants or non-mutants. Genes involved in phytic acid metabolism and related metabolic processes were found in multiple modules and were predicted to be regulated by different transcription factors. For validation, the predicted interactions were compared with known regulatory interactions observed in the model plant species, <italic>Arabidopsis thaliana</italic>. These findings provide important hypotheses on expression regulation of <italic>myo</italic>-inositol metabolism, and phytohormone signaling in developing <italic>lpa</italic> soybeans. The computational method for inferring regulatory networks is freely available at <ext-link ext-link-type="uri" xlink:href="https://lilabatvt.github.io/LPANetwork/">https://lilabatvt.github.io/LPANetwork/</ext-link>. This method can be used to perform network inference using time series data from soybean or any other crop species.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Genetic materials</title>
<p>Four soybean experimental lines designated as: (i) <italic>3mlpa</italic>, (ii) 3MWT, (iii) <italic>1mlpa</italic>, and (iv) 1MWT were used in this study (Figure <xref ref-type="fig" rid="F1">1</xref>). The <italic>lpa</italic> mutant line, &#x02018;<italic>3mlpa</italic>&#x02019;, carrying three mutations <italic>mips1/mrp-l/mrp-n</italic>, and its non-mutant sibling line with normal phytic acid, &#x02018;3MWT&#x02019;, were derived from crossing of &#x02018;CX-1834&#x02019; (<italic>lpa</italic> line with two <italic>mpr-l/mrp-n</italic> mutations on soybean chromosomes 19 and 3, respectively) with &#x02018;V99-5089&#x02019; (<italic>lpa</italic> line with single <italic>mips1</italic> mutation) (Saghai Maroof et al., <xref ref-type="bibr" rid="B47">2009</xref>). The low phytic acid causing mutations in the parental lines have been mapped to genes Glyma.11G238800 (MIPS1), Glyma.19g169000 (MRP-L), and Glyma.03g167800 (MRP-N) (Saghai Maroof et al., <xref ref-type="bibr" rid="B47">2009</xref>). Another <italic>lpa</italic> line, &#x02018;<italic>1mlpa</italic>&#x02019;, carrying a single <italic>mips1</italic> mutation on soybean chromosome 11, and its isogenic sibling line with normal phytic acid, &#x02018;1MWT&#x02019;, were derived from crossing of &#x02018;Essex&#x02019; (a normal phytic acid line with no mutations) with V99-5089 (Saghai Maroof and Buss, <xref ref-type="bibr" rid="B46">2008</xref>; Glover, <xref ref-type="bibr" rid="B11">2011</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Experimental design and computational pipeline. Samples used in this study include both published (Redekar et al., <xref ref-type="bibr" rid="B45">2015</xref>) and newly generated data. Differential expression analysis and clustering analysis were used to produce the initial candidate genes and gene modules. Machine learning algorithms were used to construct gene regulatory networks.</p></caption>
<graphic xlink:href="fpls-08-02029-g0001.tif"/>
</fig>
</sec>
<sec>
<title>Plant growth and tissue sampling</title>
<p>Four seeds from each of experimental lines&#x02014;<italic>3mlpa</italic>, 3MWT, <italic>1mlpa</italic>, and 1MWT, were planted in each of 12 pots containing Metro-Mix<sup>&#x000AE;;</sup> 360 (Sun Gro) media topped with GardenPro ULTRA<sub>LITE</sub> soil (Redekar et al., <xref ref-type="bibr" rid="B45">2015</xref>). These plants (48/line) were grown in growth chambers with 14 /10 h photoperiod, 24&#x000B0;C/16&#x000B0;C temperature, 300&#x02013;400 &#x003BC;E light and 50&#x02013;60% relative humidity. Developing seeds were sampled in triplicates for each experimental line based on seed lengths corresponding to 2&#x02013;4 mm (S1), 4&#x02013;6 mm (S2), 6&#x02013;8 mm (S3), 8&#x02013;10 mm (S4), and 10&#x02013;12 mm (S5), respectively. Samples were flash frozen using liquid nitrogen and stored at &#x02212;70&#x000B0;C. High-quality total RNA (RIN 9&#x02013;10) was extracted from frozen samples using RNeasy Plant Mini Kit, with on-column DNase digestion (QIAGEN). Total of 60 mRNA libraries were prepared from total RNA samples and sequenced as 100SE using HiSeq2000 at the Genome Quebec Innovation Center, Canada.</p>
</sec>
<sec>
<title>Sequence data processing and differential gene expression</title>
<p>Reads were aligned to the latest soybean reference genome (&#x02018;Williams 82&#x02019; Wm82.a2.v1, downloaded from Phytozome<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref> with STAR (version 2.4.2) and number of reads mapped to each gene was counted using featureCount (version 1.4.6). Differential gene expression was analyzed using DESeq2 (version 1.8.2) in R (version 3.2.4). Four genotypes, <italic>3mlpa</italic>, 3MWT, <italic>1mlpa</italic>, and 1MWT, were analyzed in this data. For each pair of mutant and corresponding non-mutant, stage-wise comparison was performed to identify differentially expressed genes for each stage (Supplementary Figure <xref ref-type="supplementary-material" rid="SM9">1A</xref>) (Redekar et al., <xref ref-type="bibr" rid="B45">2015</xref>). For each genotype, between-stage comparisons were performed to identify differentially expressed genes between adjacent developmental stages (Supplementary Figure <xref ref-type="supplementary-material" rid="SM9">1B</xref>). These analyses were performed using DESeq2 with default parameters. Between-stage comparisons and stage-wise comparisons address different type of biological question. Between-stage comparisons find genes that change between stages, but do not directly identify genes that are affected by mutations. Stage-wise comparison, on the other hand, directly finds genes that change between mutant and non-mutant lines, but does not find genes with interactions. Differentially expressed genes were the genes with FDR adjusted <italic>p</italic> &#x0003C; 0.01 and log<sub>2</sub> fold change &#x0003E;1. Differentially expressed genes and their log<sub>2</sub> fold changes are provided as Supplementary Tables <xref ref-type="supplementary-material" rid="SM1">1</xref>&#x02013;<xref ref-type="supplementary-material" rid="SM3">3</xref>. RNA-Seq data used in this study have been deposited into the NCBI Gene Expression Omnibus (GEO) repository under accession number <ext-link ext-link-type="NCBI:geo" xlink:href="GSE101692">GSE101692</ext-link>.</p>
</sec>
<sec>
<title>Inference of gene regulatory networks</title>
<sec>
<title>Expression clustering, gene ontology, and gene function analysis</title>
<p>Gene expression levels for each gene were normalized using DESeq2 and summarized as FPKM (Fragments Per Kilo-base pair per Million reads) values. The gene expression levels (FPKM values) were averaged across replicates and only differentially expressed genes were used in the clustering analysis. K-means clustering (Sherlock, <xref ref-type="bibr" rid="B51">2000</xref>) was performed using R packages, and the number of clusters (K) was determined using the minimum Bayesian Information Criteria (BIC) method (Ramsey et al., <xref ref-type="bibr" rid="B44">2008</xref>). In brief, K was set to be an integer number from 20 to 100 with an incremental step size of 5. For each <italic>K</italic>-value, k-means clustering was performed and BIC statistics were computed. The minimum BIC was achieved with <italic>K</italic> &#x0003D; 60 (Supplementary Figure <xref ref-type="supplementary-material" rid="SM10">2</xref>). Gene Ontology (GO) annotation of all soybean genes was downloaded from Soybase<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref> GO enrichment analyses were performed for each gene module. Significantly enriched GO categories were selected using Fisher&#x00027;s exact test with FDR &#x0003C; 0.05 (Supplementary Table <xref ref-type="supplementary-material" rid="SM4">4</xref>). Transcription factor annotation was downloaded from plant TFDB (Jin et al., <xref ref-type="bibr" rid="B20">2015</xref>, <xref ref-type="bibr" rid="B21">2017</xref>). Metabolic pathway genes were downloaded from the SoyCyc 7.0<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref> database from the Plant Metabolic Network<xref ref-type="fn" rid="fn0004"><sup>4</sup></xref> website.</p>
</sec>
<sec>
<title>Network inference methods</title>
<p>To infer regulatory networks, we adopted the methods of module networks (Segal et al., <xref ref-type="bibr" rid="B50">2003</xref>). First, genes were grouped into modules using the k-means clustering method. Second, differentially expressed transcription factors were used as putative regulators for network inference. In our data, we found 60 clusters (gene modules) and 1245 transcription factors that were differentially expressed in at least one comparison. The mean expression profile for each of the 60 modules was computed and the expression levels of 1245 transcription factors were included to construct an expression matrix with 1305 rows (genes) and 20 columns (five developmental stages for four experimental lines). Five distinct network inference algorithms: ARACNE (Margolin et al., <xref ref-type="bibr" rid="B36">2006</xref>), Random Forest (Huynh-Thu et al., <xref ref-type="bibr" rid="B17">2010</xref>), LARS (Haury et al., <xref ref-type="bibr" rid="B14">2012</xref>), partial correlation (Schafer and Strimmer, <xref ref-type="bibr" rid="B49">2005b</xref>), and CLR (Faith et al., <xref ref-type="bibr" rid="B9">2007</xref>), were applied to this expression matrix to infer putative regulatory interactions between each transcription factor and gene modules. These methods were chosen because they represent a diverse set of computational methods for gene network inference. These methods were selected also because they were ranked as top performers in a recently published benchmark of network inference methods (DREAM challenge) (Marbach et al., <xref ref-type="bibr" rid="B35">2012</xref>). Details of each method, statistical analysis of network and network validation are provided as supplementary text.</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Summary of differential gene expression analysis</title>
<p>Transcriptome sequencing data from five developing seed stages of four soybean lines (<italic>3mlpa</italic>, 3MWT, <italic>1mlpa</italic>, and 1MWT) were analyzed (Figure <xref ref-type="fig" rid="F1">1</xref>). Stage-wise comparisons were performed for mutants (<italic>3mlpa, 1mlpa</italic>) and their corresponding non-mutants (3MWT, 1MWT), to determine the number of genes affected by the mutation at each stage (Figure <xref ref-type="fig" rid="F2">2</xref>). For stage-wise comparisons, when <italic>1mlpa</italic> was compared with 1MWT, we found fewer than 250 differentially expressed genes in all time points (Supplementary Figure <xref ref-type="supplementary-material" rid="SM10">2</xref>). However, we found more than 4000 differentially expressed genes between <italic>3mlpa</italic> and 3MWT (Supplementary Figure <xref ref-type="supplementary-material" rid="SM10">2</xref>). It is expected to have higher number of differentially expressed genes for comparison between recombinant inbred lines (<italic>3mlpa</italic> vs. 3MWT) as opposed to that between near-isogenic lines (<italic>1mlpa</italic> vs. 1MWT). Few genes are differentially expressed in all five stages when comparing <italic>3mlpa</italic> vs. 3MWT (Figure <xref ref-type="fig" rid="F2">2B</xref>). These results suggest that genes affected by mutations are unique at each developmental stage. Between-stages comparisons were performed for each genotype separately (Figure <xref ref-type="fig" rid="F1">1</xref>, Supplementary Figure <xref ref-type="supplementary-material" rid="SM9">1</xref>). Results of the stage-wise and between-stages differential expression analyses are provided as Supplementary Tables <xref ref-type="supplementary-material" rid="SM1">1</xref>, <xref ref-type="supplementary-material" rid="SM2">2</xref>, respectively. For between-stages comparisons (Supplementary Figure <xref ref-type="supplementary-material" rid="SM9">1B</xref>), we found that hundreds of genes are differentially expressed when comparing adjacent developmental stages (Supplementary Figure <xref ref-type="supplementary-material" rid="SM10">2</xref>). However, there are few genes differentially expressed across all stages. For example, in <italic>3mlpa</italic> mutant, only 2 genes are differentially expressed between any two adjacent stages (Figure <xref ref-type="fig" rid="F2">2A</xref>). We found 1643 genes that are differentially expressed between developmental stages and found in all four genotypes in this study (Figure <xref ref-type="fig" rid="F2">2C</xref>), suggesting these genes are a core set of genes that change expression between developmental stages and are not affected by the genotypes. We determined the number of differentially expressed genes in each of the comparisons (Supplementary Figure <xref ref-type="supplementary-material" rid="SM10">2</xref>). Interestingly, the highest numbers of differentially expressed genes were found in two comparisons: between stage 1 and stage 2 for 1MWT and for 3MWT (3MWT S2 vs. S1 and 1MWT S2 vs. S1) suggesting that non-mutant plants have a high number of differentially expressed genes in the early stages of seed development than mutants.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Venn diagrams. S1, S2, S3, S4, and S5 represent five developmental stages. <bold>(A)</bold> Number of genes differentially expressed between consecutive stages in <italic>3mlpa</italic> seeds. <bold>(B)</bold> Number of genes differentially expressed for each stage when comparing <italic>3mlpa</italic> to 3MWT seeds. <bold>(C)</bold> Number of genes differentially expressed for different genotypes for between stage comparisons.</p></caption>
<graphic xlink:href="fpls-08-02029-g0002.tif"/>
</fig>
<p>The two sets of low-phytic acid causing mutations (<italic>mips1</italic> and <italic>mrp-l/mrp-n</italic>) interrupt the phytic acid biosynthesis and transmembrane transporter activity, ultimately reducing the seed emergence potential in our mutant lines. We were, therefore, interested in studying the behavior of genes associated with phytic acid metabolism, abscisic acid (ABA) and auxin signaling and metabolism, and transmembrane transport. The log<sub>2</sub> fold change for significantly differentially expressed genes that belonged to this category is summarized in Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3</xref>.</p>
</sec>
<sec>
<title>Co-expression modules represent distinct functional categories</title>
<p>We identified 12998 genes that are differentially expressed in at least one of the comparisons and these genes were used for K-means clustering analysis to identify co-expressed gene modules (Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F3">3</xref>). We found that the optimal number of clusters (modules) is 60 based on BIC (Supplementary Figure <xref ref-type="supplementary-material" rid="SM11">3</xref>). Average expression levels of all genes in each module were used to generate a heat map (Figure <xref ref-type="fig" rid="F3">3A</xref>) and GO enrichment analyses were performed for each of the modules (Figure <xref ref-type="fig" rid="F3">3B</xref>, Supplementary Table <xref ref-type="supplementary-material" rid="SM4">4</xref>). The expression modules can be approximately classified into three main patterns. In pattern 1, there are 29 modules highly expressed at the early stage of seed development as shown in upper half of the heat map (Figure <xref ref-type="fig" rid="F3">3A</xref>, cluster 24 to cluster 18). In pattern 2, there are 11 modules highly expressed in the later stage of seed development as shown by the lower portion of the heat map (Figure <xref ref-type="fig" rid="F3">3A</xref>, modules 9 to 47, except for module 14, which shows high expression at both first and last developmental stages). In pattern 3, 12 modules showed high expression in the middle of the developmental stages but low expression levels in the early and late developmental stages. Genotype-specific expression patterns were also found by clustering analysis. For example, modules 24, 57, and 15 are highly expressed in <italic>3mlpa</italic> at the first time point, whereas the expression levels are not high in the other three genotypes. The modules identified in near isogenic lines (<italic>1mlpa</italic> and 1MWT) showed highly similar expression patterns than those identified in recombinant inbred lines (<italic>3mlpa</italic> and 3MWT).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Clustering and gene function analysis. <bold>(A)</bold> Gene expression clusters. Color indicates normalized expression levels. Genes were clustered based on the K-means clustering algorithm. Hierarchical clustering was performed on the rows of the average expression levels for each cluster. Numbers on the right of the heatmap represent cluster id (CID, 1 to 60). <bold>(B)</bold> GO functional enrichment analysis. Rows of this heatmap is organized as the same order as the expression clusters. Each column represents a GO category. Color represents -log10 FDR from enrichment analysis. Some enriched categories were summarized and annotated under the heat map. The complete set of enriched GO categories is provided as supplementary information.</p></caption>
<graphic xlink:href="fpls-08-02029-g0003.tif"/>
</fig>
<p>GO enrichment analyses showed that many gene modules are enriched with genes in specific functional categories (Supplementary Table <xref ref-type="supplementary-material" rid="SM4">4</xref>). For example, genes in modules 22, 49, 53, and 55 are highly expressed during the early stages of seed development and are enriched with genes with functions in hormone signaling and responses (Figure <xref ref-type="fig" rid="F3">3B</xref>, box b). Of these, several genes showed increased expression in <italic>3mlpa</italic> (S1) while decreased in <italic>1mlpa</italic> (S1) when compared with respective non-mutant lines at stage 1 (Table <xref ref-type="table" rid="T1">1</xref>). We also found that genes in module 42 are highly expressed at the last stage of seed development and this module is enriched with genes functioning in seed dormancy, seed germination and lipid storage (Figure <xref ref-type="fig" rid="F3">3B</xref>). Genes in module 36 are highly expressed in the middle stages of seed development and are enriched with genes in starch and lipid biosynthesis (Figure <xref ref-type="fig" rid="F3">3B</xref>, box d). The functional enrichment of genes in these modules indicates that our clustering analysis can find genes representing biological functions that are known to be active at different stages of seed development. We also found that some modules have genotype-specific expression patterns. For example, module 7 is highly expressed only in 3MWT, which is enriched with genes functioning in photosynthesis, translations, and transcription (Figure <xref ref-type="fig" rid="F3">3B</xref>, boxes a,c). This result shows some gene modules have genotype-specific expression pattern and are enriched with specific functional genes.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Differentially expressed genes from co-expression modules in early stages of seed development.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Cluster ID</bold></th>
<th valign="top" align="left"><bold>GO Term</bold></th>
<th valign="top" align="left"><bold>Gene Name</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>Log2 fold change</bold></th>
<th valign="top" align="left"><bold>Arabidopsis homolog</bold></th>
<th valign="top" align="left"><bold>Gene Symbol</bold></th>
<th valign="top" align="left"><bold>Description</bold></th>
</tr>
<tr>
<th/>
<th/>
<th/>
<th valign="top" align="left"><bold>3mlpa vs. 3MWT-S1</bold></th>
<th valign="top" align="left"><bold>1mlpa vs. 1MWT-S1</bold></th>
<th/>
<th/>
<th/>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">22</td>
<td valign="top" align="left">GO:0048838: release of seed from dormancy</td>
<td valign="top" align="left">Glyma.01G153300</td>
<td valign="top" align="left">3.13805</td>
<td valign="top" align="left">&#x02212;2.84401</td>
<td valign="top" align="left">AT4G19230</td>
<td valign="top" align="left"><italic>CYP707A1</italic></td>
<td valign="top" align="left">Cytochrome P450, family 707, subfamily A, polypeptide 1</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GO:0009738: abscisic acid-activated signaling pathway</td>
<td valign="top" align="left">Glyma.12G073000</td>
<td valign="top" align="left">2.67451</td>
<td valign="top" align="left">&#x02212;2.55671</td>
<td valign="top" align="left">AT3G45640</td>
<td valign="top" align="left"><italic>MPK3</italic></td>
<td valign="top" align="left">Mitogen-activated protein kinase 3</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.U021800</td>
<td valign="top" align="left">2.5946</td>
<td valign="top" align="left">&#x02212;2.50453</td>
<td valign="top" align="left">AT3G45640</td>
<td valign="top" align="left"><italic>MPK3</italic></td>
<td valign="top" align="left">Mitogen-activated protein kinase 3</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.07G023300</td>
<td valign="top" align="left">2.96832</td>
<td valign="top" align="left">&#x02212;2.24850</td>
<td valign="top" align="left">AT1G80840</td>
<td valign="top" align="left"><italic>WRKY40</italic></td>
<td valign="top" align="left">WRKY DNA-binding protein 40</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.10G082400</td>
<td valign="top" align="left">1.40473</td>
<td valign="top" align="left">&#x02212;1.55483</td>
<td valign="top" align="left">AT3G11820</td>
<td valign="top" align="left"><italic>SYP121, SYR1, PEN1</italic></td>
<td valign="top" align="left">Syntaxin of plants 121</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.08G271900</td>
<td valign="top" align="left">2.04156</td>
<td valign="top" align="left">&#x02212;1.54559</td>
<td valign="top" align="left">AT1G32640</td>
<td valign="top" align="left"><italic>RD22BP1, JAI1, JIN1, MYC2, ZBF1</italic></td>
<td valign="top" align="left">Basic helix-loop-helix (bHLH) DNA-binding family protein</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.03G138000</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x02212;1.46586</td>
<td valign="top" align="left">AT3G57530</td>
<td valign="top" align="left"><italic>CPK32, CDPK32</italic></td>
<td valign="top" align="left">Calcium-dependent protein kinase 32</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.05G194100</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x02212;1.41822</td>
<td valign="top" align="left">AT3G52430</td>
<td valign="top" align="left"><italic>PAD4</italic></td>
<td valign="top" align="left">Alpha/beta-Hydrolases superfamily protein</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.17G126400</td>
<td valign="top" align="left">1.18340</td>
<td valign="top" align="left">&#x02212;1.31950</td>
<td valign="top" align="left">AT3G48360</td>
<td valign="top" align="left"><italic>BT2</italic></td>
<td valign="top" align="left">BTB and TAZ domain protein 2</td>
</tr>
<tr style="border-top: thin solid #000000;">
<td valign="top" align="left">42</td>
<td valign="top" align="left">GO:0009845: seed germination</td>
<td valign="top" align="left">Glyma.17G086400</td>
<td valign="top" align="left">1.56707</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">AT3G01570</td>
<td valign="top" align="left"><italic>-</italic></td>
<td valign="top" align="left">Oleosin family protein</td>
</tr>
<tr style="border-top: thin solid #000000;">
<td valign="top" align="left">49</td>
<td valign="top" align="left">GO:0009738: abscisic acid-activated signaling pathway</td>
<td valign="top" align="left">Glyma.15G078600</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x02212;1.84111</td>
<td valign="top" align="left">AT1G28480</td>
<td valign="top" align="left"><italic>GRX480, roxy19</italic></td>
<td valign="top" align="left">Thioredoxin superfamily protein</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.14G066400</td>
<td valign="top" align="left">0</td>
<td valign="top" align="left">&#x02212;1.10638</td>
<td valign="top" align="left">AT3G11410</td>
<td valign="top" align="left"><italic>PP2CA, AHG3</italic></td>
<td valign="top" align="left">Protein phosphatase 2CA</td>
</tr>
<tr style="border-top: thin solid #000000;">
<td valign="top" align="left">53</td>
<td valign="top" align="left">GO:0009737: response to abscisic acid</td>
<td valign="top" align="left">Glyma.13G119500</td>
<td valign="top" align="left">1.60968</td>
<td valign="top" align="left">&#x02212;2.02755</td>
<td valign="top" align="left">AT3G17510</td>
<td valign="top" align="left"><italic>CIPK1, SnRK3.16</italic></td>
<td valign="top" align="left">CBL-interacting protein kinase 1</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Glyma.12G186800</td>
<td valign="top" align="left">2.75115</td>
<td valign="top" align="left">&#x02212;1.95635</td>
<td valign="top" align="left">AT2G27310</td>
<td/>
<td valign="top" align="left">F-box family protein</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GO:0009738: abscisic acid-activated signaling pathway</td>
<td valign="top" align="left">Glyma.12G186800</td>
<td valign="top" align="left">2.75115</td>
<td valign="top" align="left">&#x02212;1.95635</td>
<td valign="top" align="left">AT2G27310</td>
<td/>
<td valign="top" align="left">F-box family protein</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GO:0009733: response to auxin</td>
<td valign="top" align="left">Glyma.13G094900</td>
<td valign="top" align="left">4.14408</td>
<td valign="top" align="left">&#x02212;2.26796</td>
<td valign="top" align="left">AT5G57560</td>
<td valign="top" align="left"><italic>TCH4, XTH22</italic></td>
<td valign="top" align="left">Xyloglucan endotransglucosylase/hydrolase family protein</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Regulatory network interactions</title>
<p>Five different network inference algorithms were used to infer putative regulatory interactions between regulators and their targets (Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F4">4</xref>). Fifty-four interactions between 54 transcription factors and 32 modules were predicted by all five algorithms (Supplementary Table <xref ref-type="supplementary-material" rid="SM5">5</xref>). Some modules were predicted to be regulated by more than one transcription factor and no transcription factor was predicted to regulate two modules. The identified interactions represent highly stringent predictions and are a very conservative estimation of all possible interactions since only 0.06% of all 74,700 possible interactions were found to be significant by all five computational methods. Four hundred six interactions between 348 transcription factors and 60 modules were supported by four or more methods (Supplementary Figure <xref ref-type="supplementary-material" rid="SM12">4</xref>, Supplementary Table <xref ref-type="supplementary-material" rid="SM6">6</xref>), representing a larger number for predicted regulatory interactions. The network figure includes both directed and undirected edges (Figure <xref ref-type="fig" rid="F4">4</xref>). Each directed edge connects a transcription factor with its targeted gene module. Such edge represents predicted regulatory interactions. Each undirected edge connects a transcription factor to the module this factor belongs to. These undirected edges reflect the fact that each transcription factor is also co-expressed with other genes in the genome and can be assigned to a specific gene module. The regulatory interactions (directed edges) are further classified based on the differential expression pattern of the regulatory TFs. We found 10 TF-module interactions (black arrows) in which the TFs are differentially expressed between stage comparisons (Supplementary Figure <xref ref-type="supplementary-material" rid="SM9">1B</xref>) in both mutants (<italic>3mlpa</italic> and <italic>1mlpa</italic>) and both non-mutants (3MWT and 1MWT). These interactions are not affected by the mutations or genetic backgrounds. We found 10 interactions (Figure <xref ref-type="fig" rid="F4">4</xref>, green arrows) in which the TFs are differentially expressed between stage comparisons for non-mutants but not in mutants. These interactions are potentially lost due to the mutations. We also found nine interactions (blue arrows) that are not present in the non-mutants, but are present in either one of the mutants, suggesting that these interactions are gained in the mutants. Finally, we found 14 interactions (red arrows) that are not present in either the mutants or the non-mutants, but the TFs are differentially expressed when comparing <italic>3mlpa</italic> to 3MWT at one or more developmental stages. These interactions are altered in <italic>3mlpa</italic>/3MWT and do not change the trajectory of gene expression between stages but affect gene expression within specific stages. We found that many TFs are also connected to the target modules by undirected edges, indicating that these genes are co-expressed with their target genes. We also found several cases where a TF does not regulate its own module but is regulating other modules, suggesting our method can find non-linear interactions between TF and target modules. All the predicted regulatory interactions are provided as a (Supplementary Tables <xref ref-type="supplementary-material" rid="SM5">5</xref>, <xref ref-type="supplementary-material" rid="SM6">6</xref>).</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Gene regulatory networks in low phytic acid mutants and non-mutant seeds. Directed edges (with arrowheads) represent predicted regulatory interactions. Undirected edges (without arrowheads) connect each transcription factor and the co-expression module to which the transcription factor belongs. Black arrows: the TFs are differentially expressed in between stage comparisons in both mutants (<italic>3mlpa</italic> and <italic>1mlpa</italic>) and both non-mutants (3MWT and 1MWT). Green arrows: the TFs are differentially expressed in between stage comparisons for non-mutants but not in mutants. Blue arrows: the TFs are not differentially expressed in the non-mutants but are differentially expressed in either one of the mutants. Red arrows: the TFs are differentially expressed when comparing <italic>3mlpa</italic> to 3MWT at one or more developmental stages. Grey arrows: interactions that do not belong any of the above four categories.</p></caption>
<graphic xlink:href="fpls-08-02029-g0004.tif"/>
</fig>
</sec>
<sec>
<title>Validation of predicted regulatory networks</title>
<p>To validate the results from the computational inference, we compared the regulatory network as predicted by our methods with published regulatory interactions observed in the model plant species, <italic>Arabidopsis</italic>. We combined three recently published <italic>Arabidopsis</italic> genome-wide gene regulatory networks (Sparks et al., <xref ref-type="bibr" rid="B55">2016</xref>), including 2,914 regulatory interactions between 578 regulators and 717 targets. We mapped soybean genes to <italic>Arabidopsis</italic> genome and searched for predicted regulator-module interactions, which are also found in <italic>Arabidopsis</italic>. Among 54 interactions we predicted, five TF-module interactions and 13 TF-gene interactions from this dataset are also found in the <italic>Arabidopsis</italic> regulatory networks (Supplementary Table <xref ref-type="supplementary-material" rid="SM7">7</xref>), providing support for the predicted gene regulatory networks. This small overlapping is expected because the current <italic>Arabidopsis</italic> interaction network (2914 interactions) is only a tiny fraction of true interactions that happen <italic>in-vivo</italic> (see discussion).</p>
<p>Transcription factors regulate their target through binding of sequence specific motifs in the promoter regions of the target genes. To further validate the predicted regulatory networks, we performed promoter motif search. Among many programs that are available for motif discovery, the MEME suite contains the most comprehensive sets of programs that allows users to perform motif discovery, motif search and motif comparison. We used the MEME program to identify motifs in the promoter regions of genes in each gene module. We tested the enrichment of these newly discovered motifs and identified 101 motifs that are enriched in 36 modules, with each module having one or more enriched motifs. In our predicted interaction networks, there are 54 transcription factors that regulate 32 modules. We found that 21 out of these 32 modules contain one or more enriched motifs (Supplementary Table <xref ref-type="supplementary-material" rid="SM8">8</xref>). These motifs are putative binding sites of the 54 transcription factors that are regulating genes in these modules.</p>
<p>To further validate the results of the motif search, we compared our newly discovered motifs to a database of motifs generated by direct sequencing of binding sites of over 400 <italic>Arabidopsis</italic> TFs. The pattern of these motifs is represented by position specific weight matrices (PSWM) (Bailey et al., <xref ref-type="bibr" rid="B1">2009</xref>). This comparison aims at testing whether any of the enriched motifs are similar to the binding motifs of the genes in the same gene family in <italic>Arabidopsis</italic>. For example (Table <xref ref-type="table" rid="T2">2</xref>), we found that a bZIP transcription factor (Glyma.19G244800, whose most similar <italic>Arabidopsis</italic> gene is AT5G28770) is regulating module 57 in our predicted network. Our analysis found a GCCACGT motif enriched in the promoter regions of module 57 (<italic>p</italic> &#x0003C; 1.21e-3). This motif is highly similar to the binding motif (GCCACGT) of an <italic>Arabidopsis</italic> bZIP transcription factor (<italic>p</italic> &#x0003C; 1.82e-10). Four such examples are shown in Table <xref ref-type="table" rid="T2">2</xref>. Among the 21 modules with enriched motifs in our predicted regulatory networks, we found that 17 motifs are highly similar to the corresponding <italic>Arabidopsis</italic> motifs in the same TF gene family (Supplementary Table <xref ref-type="supplementary-material" rid="SM8">8</xref>), providing strong support to the validity of our predictions.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Motifs enrichment analysis.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>TF name</bold></th>
<th valign="top" align="left"><bold>TF family</bold></th>
<th valign="top" align="left"><bold>Target module name</bold></th>
<th valign="top" align="left"><bold>ATH TF name</bold></th>
<th valign="top" align="left"><bold>Motif enrich adj <italic>p</italic>-value</bold></th>
<th valign="top" align="left"><bold>MEME motif</bold></th>
<th valign="top" align="left"><bold>DAP-seq motif</bold></th>
<th valign="top" align="left"><bold>Motif similarity <italic>p</italic>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Glyma.19G244800</td>
<td valign="top" align="left">bZIP</td>
<td valign="top" align="left">57</td>
<td valign="top" align="left">AT5G28770</td>
<td valign="top" align="left">1.21E-03</td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0001.tif"/></td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0002.tif"/></td>
<td valign="top" align="left">1.82E-10</td>
</tr>
<tr style="border-top: thin solid #000000;">
<td valign="top" align="left">Glyma.03G042700</td>
<td valign="top" align="left">WRKY</td>
<td valign="top" align="left">55</td>
<td valign="top" align="left">AT2G38470</td>
<td valign="top" align="left">1.14E-03</td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0003.tif"/></td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0004.tif"/></td>
<td valign="top" align="left">4.14E-05</td>
</tr>
<tr style="border-top: thin solid #000000;">
<td valign="top" align="left">Glyma.05G110700</td>
<td valign="top" align="left">bHLH</td>
<td valign="top" align="left">35</td>
<td valign="top" align="left">AT4G37850</td>
<td valign="top" align="left">6.20E-03</td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0005.tif"/></td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0006.tif"/></td>
<td valign="top" align="left">8.09E-05</td>
</tr>
<tr style="border-top: thin solid #000000;">
<td valign="top" align="left">Glyma.17G101000</td>
<td valign="top" align="left">Dof</td>
<td valign="top" align="left">3</td>
<td valign="top" align="left">AT3G47500</td>
<td valign="top" align="left">3.79E-04</td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0007.tif"/></td>
<td valign="top" align="left"><inline-graphic xlink:href="fpls-08-02029-i0008.tif"/></td>
<td valign="top" align="left">8.87E-05</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>&#x0201C;<italic>ATH TF name&#x0201D; is Arabidopsis transcription factor gene homologous to the soybean transcription factor gene. &#x0201C;Motif enrich adj p-value&#x0201D; is the enrichment p-value for each of the motif found in the promoter regions of predicted gene modules. &#x0201C;DAP-seq motif&#x0201D; is the most similar motif from DAP-seq database. &#x0201C;Motif similarity p-value&#x0201D; represents the significance of similarity between the motifs found in promoters in the gene modules and the DAP-seq motifs. All significantly enriched motifs and their corresponding DAP-seq motifs are provided in Supplementary Table <xref ref-type="supplementary-material" rid="SM8">8</xref></italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Regulatory network changes and genes in phytic acid metabolic pathways</title>
<p>To understand the connections between transcription regulation and metabolic pathways that were altered in the <italic>3mlpa</italic> and <italic>1mlpa</italic> mutant lines, we downloaded the metabolic pathway annotation from the SoyCyc 7.0 database. We mapped genes in myo-inositol metabolism, stachyose metabolism and sucrose metabolism to different gene modules, because these metabolites are altered as a result of <italic>lpa</italic> mutations (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3</xref>). We found that there were 64 genes involved in these metabolic pathways, which mapped to 34 gene modules (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3</xref>). Twelve of the 64 genes were involved in stachyose (or raffinose family oligosaccharides (RFOs)) biosynthesis and eight were associated with inferred gene regulatory networks (Figure <xref ref-type="fig" rid="F5">5</xref>). In module 29, which is up-regulated in the first stage of seed development, two genes (Glyma.02G303300 and Glyma.14G010500, both encoding raffinose synthase / seed imbibition protein 1) were found to be regulated by a bZIP transcription factor (Glyma.02G131700). This bZIP transcription factor is homologous to <italic>ABF1</italic> (ABA response element-binding factor 1) in <italic>Arabidopsis</italic>, providing a potential connection between ABA response and stachyose biosynthesis (Figure <xref ref-type="fig" rid="F5">5</xref>). Some of the enzymes involved in <italic>myo</italic>-inositol biosynthesis are regulated similarly in mutant and non-mutant lines. For example, inositol-polyphosphate 5-phosphatase (Glyma.20G170500) and inositol-phosphate phosphatase (Glyma.09G011100) are found in module 16. Genes in this module are up-regulated during mid-stages of seed development. Module 16 is predicted to be regulated by two transcription factors, a bHLH transcription factor (Glyma.13G040100) and a C2H2 transcription factor (Glyma.13G327500). The bHLH transcription factor is homologous to SPCH, which regulates stomatal lineage specification during embryo development (Danzer et al., <xref ref-type="bibr" rid="B7">2015</xref>). Some regulatory interactions are only found in non-mutants. For example, genes from module 40 are predicted to be regulated by Glyma.17G085600, a MYB-related transcription factor, and this module contains two genes related to <italic>myo</italic>-inositol biosynthesis (Glyma.07G107000 encoding inositol-polyphosphate 5-phosphatase and Glyma.05G180600 encoding inositol-1-phosphate synthase). This module is highly expressed in the early stage of seed development and the MYB-related transcription factor is similar to <italic>RSM1</italic> in Arabidopsis, which has been found to be related to auxin signaling in early morphogenesis (Hamaguchi et al., <xref ref-type="bibr" rid="B13">2008</xref>). Several target genes associated with regulation of phytic acid biosynthesis pathway matched with those discovered in co-expression network of developing maize kernel with low phytic acid content (Zhang et al., <xref ref-type="bibr" rid="B63">2016</xref>). These predictions provide genotype-specific, testable hypotheses that may connect gene expression patterns with putative regulatory TFs and hormone regulations during different seed developmental stages.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Schematic diagram of regulation of inositol pathway in low phytic acid soybean mutants. Black arrows represent the flow of <italic>myo</italic>-inositol in multiple pathways in non-mutant plants. Red solid arrows with <italic>mips1</italic> label represent mutation in the rate-limiting first step of inositol pathway, catalyzed by myo-inositol phosphate synthase. Red dashed double arrows represent mutation in MRP-type ABC transporters (<italic>mrp-l/mrp-n</italic>) that block the last step in the inositol pathway, which is the movement of phytic acid to storage vacuoles. The <italic>myo</italic>-inositol pathway is blocked in single mutant (<italic>mips1</italic> or <italic>1mlpa</italic>) at the first step, and in triple mutant (<italic>mips1/mrp-l/mrp-n</italic>) at both first and last steps. Blue triangles represent predicted positive regulation in non-mutants. Red triangles represent predicted gene regulations in both single and triple mutants. For example, a bZIP transcription factor (Glyma.02G131700) is homologous to the well-known ABF1, and is involved in ABA signaling. This transcription factor is predicted to positively regulate raffinose synthase in non-mutant genotypes. A DOF transcription factor (Glyma.17G101000) is predicted to regulate inositol phosphatase in mutants. This enzyme is involved in breakdown of inositol pathway intermediates to form <italic>myo</italic>-inositol. A MYB transcription factor (Glyma.13G309200) is predicted to regulate <italic>myo</italic>-inositol transporter in mutants.</p></caption>
<graphic xlink:href="fpls-08-02029-g0005.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In this study, we performed computational inference of gene regulatory networks using data from developing soybean seeds from two mutants (<italic>3mlpa</italic> and <italic>1mlpa</italic>), with <italic>lpa</italic>-causing mutations, and the respective non-mutant siblings (3MWT and 1MWT). We identified co-expression gene modules with distinct and genotype specific expression patterns. These gene modules are also enriched with genes with various functional categories that are related to different stages of seed development. We identified transcription factors and their putative targets using supervised machine learning methods. Some of these transcription factors are differentially expressed only in non-mutants or only in mutants, suggesting that their regulations are lost or gained due to mutations. Many genes that encode enzymes in the metabolic processes of phytic acid, myo-inositol, sucrose, and stachyose and related oligosaccharides are found in these gene modules. Overall, our analysis provides a framework to connect transcription factors with genes in biological processes such as phytic acid metabolism, auxin-abscisic acid signaling and seed dormancy.</p>
<sec>
<title>Importance of regulatory network inference</title>
<p>The predicted interactions provide a testable hypothesis for experimental validation using transgenic plants or ChIP-seq experiments. The results from this analysis can also be used to guide interpretation of other genomic mapping experiments such as genome wide association studies or quantitative trait loci analyses and to provide guidance for refining candidate gene lists. Soybean has a complex genome, which encodes over 4,500 putative transcription factors and over 46,000 coding genes. Although transcriptome data have become widely accessible for research in soybean and other crop species, it is still challenging to identify mechanistic connections between the observed transcriptome data with underlying regulatory networks. Differential gene expression analyses can be used to identify candidate genes that change under certain conditions or in specific mutants. However, in most situations, one still faces the problem of interpreting a long list of differentially expressed genes.</p>
<p>Our approach provides one alternative solution to this problem using well-developed machine learning methods to infer regulatory interactions. Our method implements the &#x0201C;community approach,&#x0201D; which has been shown to provide better performance than any individual method alone. The five computational methods are based on fundamentally different statistical and mathematical formulations thus complement each other and provide a list of highly confident prediction results. Our approach successfully reduces the total number of candidate genes from over 10,000 genes that change in at least one comparison to 54 transcription factors, providing a much shorter list of key genes that can be focused on for validation experiments. However, combining five methods also limited the total number of predicted interactions (54 predicted interactions), because each method predicts some interactions that are not predicted by other methods.</p>
<p>We validated the predicted interactions using <italic>Arabidopsis</italic> gene regulatory networks. <italic>Arabidopsis</italic> is the model plant species, which provides most gene regulatory network information among all plant species. Although soybean and <italic>Arabidopsis</italic> diverged over 120 million years ago, key genes in metabolic pathways and signaling networks are conserved in both species (Jung et al., <xref ref-type="bibr" rid="B22">2000</xref>; Hegeman et al., <xref ref-type="bibr" rid="B15">2001</xref>; Le et al., <xref ref-type="bibr" rid="B24">2010</xref>; Xu et al., <xref ref-type="bibr" rid="B61">2011</xref>; Leite et al., <xref ref-type="bibr" rid="B26">2014</xref>; Gerrard Wheeler et al., <xref ref-type="bibr" rid="B10">2016</xref>), which is expected for a physiological process as conserved as seed development. Therefore, one can expect some regulatory interactions to be conserved between these two species. In fact, 13 interactions predicted by our method are also found in the <italic>Arabidopsis</italic> gene regulatory networks (Supplementary Table <xref ref-type="supplementary-material" rid="SM7">7</xref>). The <italic>Arabidopsis</italic> genome contains approximately 2200 TFs and more than 27000 genes (Jin et al., <xref ref-type="bibr" rid="B20">2015</xref>, <xref ref-type="bibr" rid="B21">2017</xref>; Cheng et al., <xref ref-type="bibr" rid="B6">2016</xref>). There are more than 59 million possible interactions between these TF and genes. Although the number of biologically active interactions is probably &#x0003C; 1% of all possible interactions, the total number of true interactions is still far more than the 2914 interactions that were used in this comparison. Therefore, it is not surprising that we found a small overlap between the predicted interactions and those identified in <italic>Arabidopsis</italic>. As more interactions will be identified in both plant species, we would expect such overlap to increase. In our analysis, 17 motifs from 21 enriched modules (Supplementary Table <xref ref-type="supplementary-material" rid="SM8">8</xref>) similar to the motifs identified in <italic>Arabidopsis</italic> orthologous genes, indicating that the interactions identified in this study are likely to be conserved between the two species. The actual number of conserved interactions is likely to be underestimated, because the regulatory network from <italic>Arabidopsis</italic> is far from complete. Nevertheless, our results provide an important first step toward characterizing gene regulatory networks in soybeans and other crop species.</p>
<p>One would expect more transcription factors being active during the seed development process. To observe a larger number of predicted interactions, we also provide results that are predicted by four out of five methods (Supplementary Table <xref ref-type="supplementary-material" rid="SM6">6</xref>). This can be further extended to include predictions from fewer methods. Although aggregating multiple methods has been shown to outperform individual methods, some predictions by a specific method can represent interactions that cannot be detected by other approaches. If a specific target gene or specific function is of interest, one can also use our method to generate a ranked list of all predictions for the target of interest and use predicted regulators as candidate genes.</p>
</sec>
<sec>
<title>Regulation of <italic>myo</italic>-inositol metabolism in <italic>3mlpa</italic> mutant line</title>
<p><italic>Myo-</italic>inositol is an essential signaling molecule with multifunctional properties including gene regulation, chromatin modeling, mRNA transport, signal transduction, cell death, pathogen resistance, vesicle trafficking, plasma membrane, and cell wall formation (Martin, <xref ref-type="bibr" rid="B37">1998</xref>; Stevenson et al., <xref ref-type="bibr" rid="B56">2000</xref>; Chen and Xiong, <xref ref-type="bibr" rid="B5">2010</xref>). It is synthesized in a two-step pathway (Figure <xref ref-type="fig" rid="F5">5</xref>), where glucose-6-phosphate is first converted to inositol monophosphate (IMP), a rate-limiting step catalyzed by MIPS1 enzyme, followed by dephosphorylation of IMP to form <italic>myo</italic>-inositol (Loewus and Murthy, <xref ref-type="bibr" rid="B32">2000</xref>). Upon synthesis, <italic>myo</italic>-inositol is utilized in and recycled from multiple metabolic pathways such as biosynthesis of phytic acid and RFOs (such as stachyose and raffinose), inositol and phosphatidylinositol intermediates, auxin-inositol conjugates and glucuronic acid.</p>
<p>The <italic>mips1</italic> mutation in <italic>Arabidopsis</italic> is associated with reduction in cellular <italic>myo</italic>-inositol levels and defects in early embryogenesis (Meng et al., <xref ref-type="bibr" rid="B38">2009</xref>; Chaouch and Noctor, <xref ref-type="bibr" rid="B4">2010</xref>; Chen and Xiong, <xref ref-type="bibr" rid="B5">2010</xref>; Donahue et al., <xref ref-type="bibr" rid="B8">2010</xref>). The <italic>mips1</italic> mutation in soybean (as in parent V99-5089, <italic>1mlpa</italic> of this study) is associated with decreased levels of phytic acid and RFOs such as raffinose and stachyose, increased levels of sucrose and low emergence. The soybean <italic>mips1</italic> mutants also displayed normal RFO phenotype upon application of exogenous <italic>myo-</italic>inositol (Hitz et al., <xref ref-type="bibr" rid="B16">2002</xref>). It is likely that, similar to <italic>Arabidopsis</italic>, the <italic>mips1</italic> mutation in soybean reduces <italic>myo</italic>-inositol levels, preventing RFO biosynthesis in parent V99-5089 (or in <italic>1mlp</italic>a), as <italic>myo</italic>-inositol is one of the starting substrates in this pathway. Since <italic>myo</italic>-inositol is not consumed by synthesis of RFO (it is a necessary intermediate, but recycled along the pathway), it is possible that the concentration of <italic>myo</italic>-inositol is reduced to such a low level such that synthesis of RFOs is greatly reduced. Sucrose is consumed in stachyose synthesis; it is therefore possible that inhibition of this pathway by the absence of <italic>myo</italic>-inositol is causing the increase in sucrose levels in V99-5089, due to accumulation of unused sucrose substrate.</p>
<p>The <italic>mrp-l/mrp-n</italic> mutations resulted in reduced seed phytic acid and low emergence, but did not alter the RFOs composition. In addition, the <italic>myo-</italic>inositol content in <italic>mrp-l/mrp-n</italic> mutant increases during the seed development phase prior to maturation (Israel et al., <xref ref-type="bibr" rid="B18">2011</xref>). This suggests that in the presence of <italic>mrp-l/mrp-n</italic> mutations, the <italic>mips1-</italic>associated decrease in RFOs composition is restored, despite the reduced myo-inositol production due to the <italic>mips1</italic> mutation. This supports the prevalent hypothesis that the lack of transporters in the <italic>mrp-l/mrp-n</italic> mutant may trigger hydrolysis of cytoplasmic phytic acid to form inositol intermediates and <italic>myo</italic>-inositol, hence elevating the <italic>myo</italic>-inositol levels and preventing phytic acid synthesis, diverting the metabolic pathways to <italic>myo</italic>-inositol production (Figure <xref ref-type="fig" rid="F5">5</xref>). Another possibility is that lack of intracellular <italic>myo</italic>-inositol or phytic acid triggers feedback regulation that up-regulates transporters which can import inositol from the outside of the seed, hence elevating <italic>myo</italic>-inositol levels. This is in agreement with increase in the expression of inositol transporter genes during early stages of seed development in <italic>3mlpa</italic> mutant (Redekar et al., <xref ref-type="bibr" rid="B45">2015</xref>).</p>
<p>In the present study, the inositol transporter gene (Glyma.09G011400) is also up-regulated in <italic>1mlpa</italic>, a single <italic>mips1</italic> mutant. This common up-regulation of inositol transporter in <italic>mips1</italic> and in <italic>mips1/mrp-l/mrp-n</italic> mutants suggests that these mutations are triggering the same signaling pathway (Figure <xref ref-type="fig" rid="F5">5</xref>). This inositol transporter gene belongs to module 5 and is predicted to be regulated by a MYB transcription factor (Glyma.13G309200), which represents a promising target for experimental validation (Figure <xref ref-type="fig" rid="F5">5</xref>). These target genes are in agreement with those identified in developing maize kernel with low phytic acid content (Zhang et al., <xref ref-type="bibr" rid="B63">2016</xref>).</p>
<p>We identified over 30 genes associated with <italic>myo</italic>-inositol metabolism in this network analysis (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3</xref>). Some of these genes belong to the modules that are only regulated in mutants. For example, an inositol-polyphosphate 5-phosphatase (Glyma.01G200500) is predicted to be regulated by a DOF transcription factor (Glyma.17G101000). This enzyme catalyzes an intermediate step in converting phytic acid to <italic>myo</italic>-inositol. The DOF transcription factor is differentially expressed when comparing <italic>3mlpa</italic> with 3MWT at stages 1, 3, and 5 and is not differentially expressed in any other comparisons. This observation is consistent with our hypothesis that in <italic>3mlpa</italic> mutant, phytic acid is recycled to produce <italic>myo</italic>-inositol, which participates in RFO synthetic pathways.</p>
</sec>
<sec>
<title>Regulation of auxin-ABA signaling and seed dormancy-related genes in <italic>lpa</italic> mutants</title>
<p>A main goal of this work was to elucidate the molecular basis of negative downstream impacts of <italic>lpa</italic> mutations. Here we found one possible component of this: that in soybean these mutations greatly impact phytohormone pathways. In addition to <italic>myo</italic>-inositol, the importance of phytohormones such as auxin and ABA, in seed development is well-documented (Locascio et al., <xref ref-type="bibr" rid="B31">2014</xref>). Auxin is a key hormone through all phases of seed development including embryogenesis, organ differentiation, endosperm formation, and seed maturation, whereas ABA is involved in onset and maintaining seed dormancy, and is active in the seed maturation and desiccation phases (Locascio et al., <xref ref-type="bibr" rid="B31">2014</xref>). Cross signaling between auxin and ABA are involved in regulating seed dormancy and hence germination (Liu et al., <xref ref-type="bibr" rid="B30">2013</xref>). Hormone signal transduction is controlled by regulating their biosynthesis, accumulation and distribution in different sections and stages of developing seeds by factors such as <italic>myo</italic>-inositol. The <italic>Arabidopsis mips1</italic> mutants have demonstrated defects in cotyledon development and reduced expression of basipetal auxin efflux carriers such as <italic>PIN1</italic> and <italic>PIN2</italic> (Chen and Xiong, <xref ref-type="bibr" rid="B5">2010</xref>; Luo et al., <xref ref-type="bibr" rid="B33">2011</xref>). The factors regulating cellular <italic>myo</italic>-inositol levels might therefore also regulate cross signaling of auxin and ABA. However, it is still unclear to what extent auxin-ABA cross signaling is disturbed in <italic>lpa</italic>-mutants.</p>
<p>In this study, nearly the entire spectrum of the auxin signaling pathway was identified and was differentially regulated in <italic>1mlpa</italic> and <italic>3mlpa</italic> mutants as compared to the corresponding wild types. We identified 188 auxin-related genes differentially expressed in one or more comparisons tested in this study (Supplementary Table <xref ref-type="supplementary-material" rid="SM3">3</xref>). These included genes involved in auxin metabolism, transport, signal transduction, and transcription regulation. We also observed 36 genes from the ABA signaling pathway differentially expressed in one or more comparisons. The <italic>1mlpa</italic> vs. 1MWT comparison identified down regulation of two ABA catabolism genes in <italic>1mlpa</italic> (Glyma.01G153300 and Glyma.09G218600, which remove active ABA), whereas, in <italic>3mlpa</italic> vs. 3MWT comparison, these two genes are up-regulated.</p>
<p>In summary, we identified potential candidate genes that may play a role in regulating inositol metabolism, auxin-ABA signaling and seed maturation-dormancy in low phytic acid soybean during seed development. Although follow up experiments are required to validate these findings, the comprehensive regulatory network and the computational analysis pipeline of this study has set the necessary groundwork for future hypothesis driven investigations.</p>
</sec>
</sec>
<sec id="s5">
<title>Author contributions</title>
<p>NR and MASM designed the experiment. NR performed the sequencing experiments. NR and SL analyzed the data. SL developed the machine-learning pipeline. NR, SL, GP, VR and MASM wrote the paper.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<ack><p>We would like to thank Virginia Tech&#x00027;s Advanced Research Computing servers and Translational Plant Sciences&#x00027; MAGYK server for their support.</p>
</ack>
<sec sec-type="supplementary-material" id="s6">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpls.2017.02029/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpls.2017.02029/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table1.XLSX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table2.XLSX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table3.XLSX" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table4.XLSX" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table5.XLSX" id="SM5" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table6.XLSX" id="SM6" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table7.XLSX" id="SM7" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table8.XLSX" id="SM8" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image1.TIFF" id="SM9" mimetype="image/tiff" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image2.TIFF" id="SM10" mimetype="image/tiff" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image3.TIFF" id="SM11" mimetype="image/tiff" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image4.TIFF" id="SM12" mimetype="image/tiff" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.PDF" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bailey</surname> <given-names>T. L.</given-names></name> <name><surname>Boden</surname> <given-names>M.</given-names></name> <name><surname>Buske</surname> <given-names>F. A.</given-names></name> <name><surname>Frith</surname> <given-names>M.</given-names></name> <name><surname>Grant</surname> <given-names>C. E.</given-names></name> <name><surname>Clementi</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>MEME SUITE: tools for motif discovery and searching</article-title>. <source>Nucleic Acids Res.</source> <volume>37</volume>, <fpage>W202</fpage>&#x02013;<lpage>W208</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkp335</pub-id><pub-id pub-id-type="pmid">19458158</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bassel</surname> <given-names>G. W.</given-names></name> <name><surname>Lan</surname> <given-names>H.</given-names></name> <name><surname>Glaab</surname> <given-names>E.</given-names></name> <name><surname>Gibbs</surname> <given-names>D. J.</given-names></name> <name><surname>Gerjets</surname> <given-names>T.</given-names></name> <name><surname>Krasnogor</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Genome-wide network model capturing seed germination reveals coordinated regulation of plant cellular phase transitions</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>108</volume>, <fpage>9709</fpage>&#x02013;<lpage>9714</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1100958108</pub-id><pub-id pub-id-type="pmid">21593420</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowen</surname> <given-names>D. E.</given-names></name> <name><surname>Souza</surname> <given-names>E. J.</given-names></name> <name><surname>Guttieri</surname> <given-names>M. J.</given-names></name> <name><surname>Raboy</surname> <given-names>V.</given-names></name> <name><surname>Fu</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>A low phytic acid barley mutation alters seed gene expression</article-title>. <source>Crop Sci.</source> <volume>47</volume>, <fpage>S-149</fpage>. <pub-id pub-id-type="doi">10.2135/cropsci2006.07.0456tpg</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chaouch</surname> <given-names>S.</given-names></name> <name><surname>Noctor</surname> <given-names>G.</given-names></name></person-group> (<year>2010</year>). <article-title>Myo-inositol abolishes salicylic acid-dependent cell death and pathogen defence responses triggered by peroxisomal hydrogen peroxide</article-title>. <source>New Phytol.</source> <volume>188</volume>, <fpage>711</fpage>&#x02013;<lpage>718</lpage>. <pub-id pub-id-type="doi">10.1111/j.1469-8137.2010.03453.x</pub-id><pub-id pub-id-type="pmid">20807338</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>H.</given-names></name> <name><surname>Xiong</surname> <given-names>L.</given-names></name></person-group> (<year>2010</year>). <article-title>myo-Inositol-1-phosphate synthase is required for polar auxin transport and organ development</article-title>. <source>J. Biol. Chem.</source> <volume>285</volume>, <fpage>24238</fpage>&#x02013;<lpage>24247</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M110.123661</pub-id><pub-id pub-id-type="pmid">20516080</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="other"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>C.-Y.</given-names></name> <name><surname>Krishnakumar</surname> <given-names>V.</given-names></name> <name><surname>Chan</surname> <given-names>A.</given-names></name> <name><surname>Schobel</surname> <given-names>S.</given-names></name> <name><surname>Town</surname> <given-names>C. D.</given-names></name></person-group> (<year>2016</year>). <article-title>Araport11: a complete reannotation of the <italic>Arabidopsis thaliana</italic> reference genome</article-title>. <source>bioRxiv</source>. <pub-id pub-id-type="pmid">27862469</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Danzer</surname> <given-names>J.</given-names></name> <name><surname>Mellott</surname> <given-names>E.</given-names></name> <name><surname>Bui</surname> <given-names>A. Q.</given-names></name> <name><surname>Le</surname> <given-names>B. H.</given-names></name> <name><surname>Martin</surname> <given-names>P.</given-names></name> <name><surname>Hashimoto</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Down-regulating the expression of 53 soybean transcription factor genes uncovers a role for speechless in initiating stomatal cell lineages during embryo development</article-title>. <source>Plant Physiol.</source> <volume>168</volume>, <fpage>1025</fpage>&#x02013;<lpage>1035</lpage>. <pub-id pub-id-type="doi">10.1104/pp.15.00432</pub-id><pub-id pub-id-type="pmid">25963149</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Donahue</surname> <given-names>J. L.</given-names></name> <name><surname>Alford</surname> <given-names>S. R.</given-names></name> <name><surname>Torabinejad</surname> <given-names>J.</given-names></name> <name><surname>Kerwin</surname> <given-names>R. E.</given-names></name> <name><surname>Nourbakhsh</surname> <given-names>A.</given-names></name> <name><surname>Ray</surname> <given-names>W. K.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>The <italic>Arabidopsis thaliana</italic> Myo-inositol 1-phosphate synthase1 gene is required for Myo-inositol synthesis and suppression of cell death</article-title>. <source>Plant Cell</source> <volume>22</volume>, <fpage>888</fpage>&#x02013;<lpage>903</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.109.071779</pub-id><pub-id pub-id-type="pmid">20215587</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Faith</surname> <given-names>J. J.</given-names></name> <name><surname>Hayete</surname> <given-names>B.</given-names></name> <name><surname>Thaden</surname> <given-names>J. T.</given-names></name> <name><surname>Mogno</surname> <given-names>I.</given-names></name> <name><surname>Wierzbowski</surname> <given-names>J.</given-names></name> <name><surname>Cottarel</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>Large-scale mapping and validation of <italic>Escherichia coli</italic> transcriptional regulation from a compendium of expression profiles</article-title>. <source>PLoS Biol.</source> <volume>5</volume>:<fpage>e8</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pbio.0050008</pub-id><pub-id pub-id-type="pmid">17214507</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gerrard Wheeler</surname> <given-names>M. C.</given-names></name> <name><surname>Arias</surname> <given-names>C. L.</given-names></name> <name><surname>Righini</surname> <given-names>S.</given-names></name> <name><surname>Badia</surname> <given-names>M. B.</given-names></name> <name><surname>Andreo</surname> <given-names>C. S.</given-names></name> <name><surname>Drincovich</surname> <given-names>M. F.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Differential contribution of malic enzymes during soybean and castor seeds maturation</article-title>. <source>PLoS ONE</source> <volume>11</volume>:<fpage>e0158040</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0158040</pub-id><pub-id pub-id-type="pmid">27347875</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Glover</surname> <given-names>N.</given-names></name></person-group> (<year>2011</year>). <source>The Genetic Basis of Phytate, Oligosaccharide Content, and Emergence in Soybean</source>. Doctor of Philosophy, <publisher-name>Virginia Tech</publisher-name>.</citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez-Morales</surname> <given-names>S. I.</given-names></name> <name><surname>Chavez-Montes</surname> <given-names>R. A.</given-names></name> <name><surname>Hayano-Kanashiro</surname> <given-names>C.</given-names></name> <name><surname>Alejo-Jacuinde</surname> <given-names>G.</given-names></name> <name><surname>Rico-Cambron</surname> <given-names>T. Y.</given-names></name> <name><surname>de Folter</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Regulatory network analysis reveals novel regulators of seed desiccation tolerance in <italic>Arabidopsis thaliana</italic></article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>113</volume>, <fpage>E5232</fpage>&#x02013;<lpage>E5241</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1610985113</pub-id><pub-id pub-id-type="pmid">27551092</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamaguchi</surname> <given-names>A.</given-names></name> <name><surname>Yamashino</surname> <given-names>T.</given-names></name> <name><surname>Koizumi</surname> <given-names>N.</given-names></name> <name><surname>Kiba</surname> <given-names>T.</given-names></name> <name><surname>Kojima</surname> <given-names>M.</given-names></name> <name><surname>Sakakibara</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>A small subfamily of Arabidopsis RADIALIS-LIKE SANT/MYB genes: a link to HOOKLESS1-mediated signal transduction during early morphogenesis</article-title>. <source>Biosci. Biotechnol. Biochem.</source> <volume>72</volume>, <fpage>2687</fpage>&#x02013;<lpage>2696</lpage>. <pub-id pub-id-type="doi">10.1271/bbb.80348</pub-id><pub-id pub-id-type="pmid">18838801</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haury</surname> <given-names>A. C.</given-names></name> <name><surname>Mordelet</surname> <given-names>F.</given-names></name> <name><surname>Vera-Licona</surname> <given-names>P.</given-names></name> <name><surname>Vert</surname> <given-names>J. P.</given-names></name></person-group> (<year>2012</year>). <article-title>TIGRESS: trustful inference of gene regulation using stability selection</article-title>. <source>BMC Syst. Biol.</source> <volume>6</volume>:<fpage>145</fpage>. <pub-id pub-id-type="doi">10.1186/1752-0509-6-145</pub-id><pub-id pub-id-type="pmid">23173819</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hegeman</surname> <given-names>C. E.</given-names></name> <name><surname>Good</surname> <given-names>L. L.</given-names></name> <name><surname>Grabau</surname> <given-names>E. A.</given-names></name></person-group> (<year>2001</year>). <article-title>Expression of D-myo-inositol-3-phosphate synthase in soybean. Implications for phytic acid biosynthesis</article-title>. <source>Plant Physiol.</source> <volume>125</volume>, <fpage>1941</fpage>&#x02013;<lpage>1948</lpage>. <pub-id pub-id-type="doi">10.1104/pp.125.4.1941</pub-id><pub-id pub-id-type="pmid">11299373</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hitz</surname> <given-names>W. D.</given-names></name> <name><surname>Carlson</surname> <given-names>T. J.</given-names></name> <name><surname>Kerr</surname> <given-names>P. S.</given-names></name> <name><surname>Sebastian</surname> <given-names>S. A.</given-names></name></person-group> (<year>2002</year>). <article-title>Biochemical and molecular characterization of a mutation that confers a decreased raffinosaccharide and phytic acid phenotype on soybean seeds</article-title>. <source>Plant Physiol.</source> <volume>128</volume>, <fpage>650</fpage>&#x02013;<lpage>660</lpage>. <pub-id pub-id-type="doi">10.1104/pp.010585</pub-id><pub-id pub-id-type="pmid">11842168</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huynh-Thu</surname> <given-names>V. A.</given-names></name> <name><surname>Irrthum</surname> <given-names>A.</given-names></name> <name><surname>Wehenkel</surname> <given-names>L.</given-names></name> <name><surname>Geurts</surname> <given-names>P.</given-names></name></person-group> (<year>2010</year>). <article-title>Inferring regulatory networks from expression data using tree-based methods</article-title>. <source>PLoS ONE</source> <volume>5</volume>:<fpage>12776</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0012776</pub-id><pub-id pub-id-type="pmid">20927193</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Israel</surname> <given-names>D. W.</given-names></name> <name><surname>Taliercio</surname> <given-names>E.</given-names></name> <name><surname>Kwanyuen</surname> <given-names>P.</given-names></name> <name><surname>Burton</surname> <given-names>J. W.</given-names></name> <name><surname>Dean</surname> <given-names>L.</given-names></name></person-group> (<year>2011</year>). <article-title>Inositol metabolism in developing seed of low and normal phytic acid soybean lines</article-title>. <source>Crop Sci.</source> <volume>51</volume>, <fpage>282</fpage>&#x02013;<lpage>289</lpage>. <pub-id pub-id-type="doi">10.2135/cropsci2010.03.0123</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jervis</surname> <given-names>J.</given-names></name> <name><surname>Kastl</surname> <given-names>C.</given-names></name> <name><surname>Hildreth</surname> <given-names>S. B.</given-names></name> <name><surname>Biyashev</surname> <given-names>R.</given-names></name> <name><surname>Grabau</surname> <given-names>E. A.</given-names></name> <name><surname>Saghai-Maroof</surname> <given-names>M. A.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Metabolite profiling of soybean seed extracts from near-isogenic low and normal phytate lines using orthogonal separation strategies</article-title>. <source>J. Agric. Food Chem.</source> <volume>63</volume>, <fpage>9879</fpage>&#x02013;<lpage>9887</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jafc.5b04002</pub-id><pub-id pub-id-type="pmid">26487475</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>J.</given-names></name> <name><surname>He</surname> <given-names>K.</given-names></name> <name><surname>Tang</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Lv</surname> <given-names>L.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>An Arabidopsis transcriptional regulatory map reveals distinct functional and evolutionary features of novel transcription factors</article-title>. <source>Mol. Biol. Evol.</source> <volume>32</volume>, <fpage>1767</fpage>&#x02013;<lpage>1773</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msv058</pub-id><pub-id pub-id-type="pmid">25750178</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>J.</given-names></name> <name><surname>Tian</surname> <given-names>F.</given-names></name> <name><surname>Yang</surname> <given-names>D. C.</given-names></name> <name><surname>Meng</surname> <given-names>Y. Q.</given-names></name> <name><surname>Kong</surname> <given-names>L.</given-names></name> <name><surname>Luo</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume>, <fpage>D1040</fpage>&#x02013;<lpage>D1045</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw982</pub-id><pub-id pub-id-type="pmid">27924042</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jung</surname> <given-names>W.</given-names></name> <name><surname>Yu</surname> <given-names>O.</given-names></name> <name><surname>Lau</surname> <given-names>S. M.</given-names></name> <name><surname>O&#x00027;Keefe</surname> <given-names>D. P.</given-names></name> <name><surname>Odell</surname> <given-names>J.</given-names></name> <name><surname>Fader</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2000</year>). <article-title>Identification and expression of isoflavone synthase, the key enzyme for biosynthesis of isoflavones in legumes</article-title>. <source>Nat. Biotechnol.</source> <volume>18</volume>, <fpage>208</fpage>&#x02013;<lpage>212</lpage>. <pub-id pub-id-type="doi">10.1038/72671</pub-id><pub-id pub-id-type="pmid">10657130</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langfelder</surname> <given-names>P.</given-names></name> <name><surname>Horvath</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>WGCNA: an R package for weighted correlation network analysis</article-title>. <source>BMC Bioinformatics</source> <volume>9</volume>:<fpage>559</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-9-559</pub-id><pub-id pub-id-type="pmid">19114008</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Le</surname> <given-names>B. H.</given-names></name> <name><surname>Cheng</surname> <given-names>C.</given-names></name> <name><surname>Bui</surname> <given-names>A. Q.</given-names></name> <name><surname>Wagmaister</surname> <given-names>J. A.</given-names></name> <name><surname>Henry</surname> <given-names>K. F.</given-names></name> <name><surname>Pelletier</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Global analysis of gene activity during Arabidopsis seed development and identification of seed-specific transcription factors</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>107</volume>, <fpage>8063</fpage>&#x02013;<lpage>8070</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1003530107</pub-id><pub-id pub-id-type="pmid">20385809</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Le</surname> <given-names>B. H.</given-names></name> <name><surname>Wagmaister</surname> <given-names>J. A.</given-names></name> <name><surname>Kawashima</surname> <given-names>T.</given-names></name> <name><surname>Bui</surname> <given-names>A. Q.</given-names></name> <name><surname>Harada</surname> <given-names>J. J.</given-names></name> <name><surname>Goldberg</surname> <given-names>R. B.</given-names></name></person-group> (<year>2007</year>). <article-title>Using genomics to study legume seed development</article-title>. <source>Plant Physiol.</source> <volume>144</volume>, <fpage>562</fpage>&#x02013;<lpage>574</lpage>. <pub-id pub-id-type="doi">10.1104/pp.107.100362</pub-id><pub-id pub-id-type="pmid">17556519</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leite</surname> <given-names>J. P.</given-names></name> <name><surname>Barbosa</surname> <given-names>E. G.</given-names></name> <name><surname>Marin</surname> <given-names>S. R.</given-names></name> <name><surname>Marinho</surname> <given-names>J. P.</given-names></name> <name><surname>Carvalho</surname> <given-names>J. F.</given-names></name> <name><surname>Pagliarini</surname> <given-names>R. F.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Overexpression of the activated form of the AtAREB1 gene (AtAREB1DeltaQT) improves soybean responses to water deficit</article-title>. <source>Genet. Mol. Res.</source> <volume>13</volume>, <fpage>6272</fpage>&#x02013;<lpage>6286</lpage>. <pub-id pub-id-type="doi">10.4238/2014.August.15.10</pub-id><pub-id pub-id-type="pmid">25158254</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Pandey</surname> <given-names>S.</given-names></name> <name><surname>Gookin</surname> <given-names>T. E.</given-names></name> <name><surname>Zhao</surname> <given-names>Z.</given-names></name> <name><surname>Wilson</surname> <given-names>L.</given-names></name> <name><surname>Assmann</surname> <given-names>S. M.</given-names></name></person-group> (<year>2012</year>). <article-title>Gene-sharing networks reveal organizing principles of transcriptomes in Arabidopsis and other multicellular organisms</article-title>. <source>Plant Cell</source> <volume>24</volume>, <fpage>1362</fpage>&#x02013;<lpage>1378</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.111.094748</pub-id><pub-id pub-id-type="pmid">22517316</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>W.</given-names></name> <name><surname>Mazarei</surname> <given-names>M.</given-names></name> <name><surname>Peng</surname> <given-names>Y.</given-names></name> <name><surname>Fethe</surname> <given-names>M. H.</given-names></name> <name><surname>Rudis</surname> <given-names>M. R.</given-names></name> <name><surname>Lin</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Computational discovery of soybean promoter cis-regulatory elements for the construction of soybean cyst nematode-inducible synthetic promoters</article-title>. <source>Plant Biotechnol. J.</source> <volume>12</volume>, <fpage>1015</fpage>&#x02013;<lpage>1026</lpage>. <pub-id pub-id-type="doi">10.1111/pbi.12206</pub-id><pub-id pub-id-type="pmid">24893752</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name> <name><surname>Feng</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>Q.</given-names></name> <name><surname>Yang</surname> <given-names>H. Q.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Auxin controls seed dormancy through stimulation of abscisic acid signaling by inducing ARF-mediated ABI3 activation in Arabidopsis</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>110</volume>, <fpage>15485</fpage>&#x02013;<lpage>15490</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1304651110</pub-id><pub-id pub-id-type="pmid">23986496</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Locascio</surname> <given-names>A.</given-names></name> <name><surname>Roig-Villanova</surname> <given-names>I.</given-names></name> <name><surname>Bernardi</surname> <given-names>J.</given-names></name> <name><surname>Varotto</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Current perspectives on the hormonal control of seed development in Arabidopsis and maize: a focus on auxin</article-title>. <source>Front. Plant Sci.</source> <volume>5</volume>:<fpage>412</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2014.00412</pub-id><pub-id pub-id-type="pmid">25202316</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loewus</surname> <given-names>F. A.</given-names></name> <name><surname>Murthy</surname> <given-names>P. P. N.</given-names></name></person-group> (<year>2000</year>). <article-title>myo-Inositol metabolism in plants</article-title>. <source>Plant Sci.</source> <volume>150</volume>, <fpage>1</fpage>&#x02013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1016/S0168-9452(99)00150-8</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Luo</surname> <given-names>Y.</given-names></name> <name><surname>Qin</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Liang</surname> <given-names>Y.</given-names></name> <name><surname>Song</surname> <given-names>Y.</given-names></name> <name><surname>Zhao</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>D-myo-inositol-3-phosphate affects phosphatidylinositol-mediated endomembrane function in Arabidopsis and is essential for auxin-regulated embryogenesis</article-title>. <source>Plant Cell</source> <volume>23</volume>, <fpage>1352</fpage>&#x02013;<lpage>1372</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.111.083337</pub-id><pub-id pub-id-type="pmid">21505066</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maetschke</surname> <given-names>S. R.</given-names></name> <name><surname>Madhamshettiwar</surname> <given-names>P. B.</given-names></name> <name><surname>Davis</surname> <given-names>M. J.</given-names></name> <name><surname>Ragan</surname> <given-names>M. A.</given-names></name></person-group> (<year>2014</year>). <article-title>Supervised, semi-supervised and unsupervised inference of gene regulatory networks</article-title>. <source>Brief. Bioinformatics</source> <volume>15</volume>, <fpage>195</fpage>&#x02013;<lpage>211</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbt034</pub-id><pub-id pub-id-type="pmid">23698722</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marbach</surname> <given-names>D.</given-names></name> <name><surname>Costello</surname> <given-names>J. C.</given-names></name> <name><surname>Kuffner</surname> <given-names>R.</given-names></name> <name><surname>Vega</surname> <given-names>N. M.</given-names></name> <name><surname>Prill</surname> <given-names>R. J.</given-names></name> <name><surname>Camacho</surname> <given-names>D. M.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Wisdom of crowds for robust gene network inference</article-title>. <source>Nat. Methods</source> <volume>9</volume>, <fpage>796</fpage>&#x02013;<lpage>804</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.2016</pub-id><pub-id pub-id-type="pmid">22796662</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Margolin</surname> <given-names>A. A.</given-names></name> <name><surname>Nemenman</surname> <given-names>I.</given-names></name> <name><surname>Basso</surname> <given-names>K.</given-names></name> <name><surname>Wiggins</surname> <given-names>C.</given-names></name> <name><surname>Stolovitzky</surname> <given-names>G.</given-names></name> <name><surname>Dalla Favera</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context</article-title>. <source>BMC Bioinformatics</source> <volume>7</volume>(<supplement>Suppl. 1</supplement>):<issue>S7</issue>. <pub-id pub-id-type="doi">10.1186/1471-2105-7-S1-S7</pub-id><pub-id pub-id-type="pmid">16723010</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>T. F.</given-names></name></person-group> (<year>1998</year>). <article-title>Phosphoinositide lipids as signaling molecules: common themes for signal transduction, cytoskeletal regulation, and membrane trafficking</article-title>. <source>Annu. Rev. Cell Dev. Biol.</source> <volume>14</volume>, <fpage>231</fpage>&#x02013;<lpage>264</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.cellbio.14.1.231</pub-id><pub-id pub-id-type="pmid">9891784</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meng</surname> <given-names>P. H.</given-names></name> <name><surname>Raynaud</surname> <given-names>C.</given-names></name> <name><surname>Tcherkez</surname> <given-names>G.</given-names></name> <name><surname>Blanchet</surname> <given-names>S.</given-names></name> <name><surname>Massoud</surname> <given-names>K.</given-names></name> <name><surname>Domenichini</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Crosstalks between Myo-inositol metabolism, programmed cell death and basal immunity in arabidopsis</article-title>. <source>PLoS ONE</source> <volume>4</volume>:<fpage>e7364</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0007364</pub-id><pub-id pub-id-type="pmid">19812700</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nagy</surname> <given-names>R.</given-names></name> <name><surname>Grob</surname> <given-names>H.</given-names></name> <name><surname>Weder</surname> <given-names>B.</given-names></name> <name><surname>Green</surname> <given-names>P.</given-names></name> <name><surname>Klein</surname> <given-names>M.</given-names></name> <name><surname>Frelet-Barrand</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>The Arabidopsis ATP-binding cassette protein AtMRP5/AtABCC5 is a high affinity inositol hexakisphosphate transporter involved in guard cell signaling and phytate storage</article-title>. <source>J. Biol. Chem.</source> <volume>284</volume>, <fpage>33614</fpage>&#x02013;<lpage>33622</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M109.030247</pub-id><pub-id pub-id-type="pmid">19797057</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ni</surname> <given-names>Y.</given-names></name> <name><surname>Aghamirzaie</surname> <given-names>D.</given-names></name> <name><surname>Elmarakeby</surname> <given-names>H.</given-names></name> <name><surname>Collakova</surname> <given-names>E.</given-names></name> <name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Grene</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>A machine learning approach to predict gene regulatory networks in seed development in arabidopsis</article-title>. <source>Front. Plant Sci.</source> <volume>7</volume>:<fpage>1936</fpage>. <pub-id pub-id-type="doi">10.3389/fpls.2016.01936</pub-id><pub-id pub-id-type="pmid">28066488</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x00027;Malley</surname> <given-names>R. C.</given-names></name> <name><surname>Huang</surname> <given-names>S. C.</given-names></name> <name><surname>Song</surname> <given-names>L.</given-names></name> <name><surname>Lewsey</surname> <given-names>M. G.</given-names></name> <name><surname>Bartlett</surname> <given-names>A.</given-names></name> <name><surname>Nery</surname> <given-names>J. R.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Cistrome and epicistrome features shape the regulatory DNA landscape</article-title>. <source>Cell</source> <volume>166</volume>, <fpage>1280</fpage>&#x02013;<lpage>1292</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2016.04.038</pub-id><pub-id pub-id-type="pmid">27610578</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Raboy</surname> <given-names>V.</given-names></name></person-group> (<year>1997</year>). <article-title>Accumulation and storage of phosphate and minerals</article-title>, in <source>Cellular and Molecular Biology of Plant Seed Development</source>, eds <person-group person-group-type="editor"><name><surname>Larkins</surname> <given-names>B. A.</given-names></name> <name><surname>Vasil</surname> <given-names>I. K.</given-names></name></person-group> (<publisher-loc>Dordrecht</publisher-loc>: <publisher-name>Kluwer Academic Publishers</publisher-name>), <fpage>441</fpage>&#x02013;<lpage>477</lpage></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raboy</surname> <given-names>V.</given-names></name></person-group> (<year>2007</year>). <article-title>The ABCs of low-phytate crops</article-title>. <source>Nat. Biotechnol.</source> <volume>25</volume>, <fpage>874</fpage>&#x02013;<lpage>875</lpage>. <pub-id pub-id-type="doi">10.1038/nbt0807-874</pub-id><pub-id pub-id-type="pmid">17687363</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramsey</surname> <given-names>S. A.</given-names></name> <name><surname>Klemm</surname> <given-names>S. L.</given-names></name> <name><surname>Zak</surname> <given-names>D. E.</given-names></name> <name><surname>Kennedy</surname> <given-names>K. A.</given-names></name> <name><surname>Thorsson</surname> <given-names>V.</given-names></name> <name><surname>Li</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Uncovering a macrophage transcriptional program by integrating evidence from motif scanning and expression dynamics</article-title>. <source>PLoS Comput. Biol.</source> <volume>4</volume>:<fpage>e1000021</fpage>. <pub-id pub-id-type="doi">10.1371/annotation/e14ad837-e5ff-4bd5-a5f2-f32e784d75a2</pub-id><pub-id pub-id-type="pmid">18369420</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Redekar</surname> <given-names>N.</given-names></name> <name><surname>Biyashev</surname> <given-names>R.</given-names></name> <name><surname>Jensen</surname> <given-names>R.</given-names></name> <name><surname>Helm</surname> <given-names>R.</given-names></name> <name><surname>Grabau</surname> <given-names>E.</given-names></name> <name><surname>Maroof</surname> <given-names>S. M. A.</given-names></name></person-group> (<year>2015</year>). <article-title>Genome-wide transcriptome analysis of developing seeds from low and normal phytic acid soybean lines</article-title>. <source>BMC Genomics</source> <volume>16</volume>, <fpage>1074</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-015-2283-9</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Saghai Maroof</surname> <given-names>M. A.</given-names></name> <name><surname>Buss</surname> <given-names>G. R.</given-names></name></person-group> (<year>2008</year>). <source>Low Phytic Acid, Low Stachyose, High Sucrose Soybean Lines</source>. <publisher-name>Google Patents</publisher-name>.</citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saghai Maroof</surname> <given-names>M. A.</given-names></name> <name><surname>Glover</surname> <given-names>N. M.</given-names></name> <name><surname>Biyashev</surname> <given-names>R. M.</given-names></name> <name><surname>Buss</surname> <given-names>G. R.</given-names></name> <name><surname>Grabau</surname> <given-names>E. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Genetic basis of the low-phytate trait in the soybean line CX1834</article-title>. <source>Crop Sci.</source> <volume>49</volume>, <fpage>69</fpage>. <pub-id pub-id-type="doi">10.2135/cropsci2008.06.0362</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schafer</surname> <given-names>J.</given-names></name> <name><surname>Strimmer</surname> <given-names>K.</given-names></name></person-group> (<year>2005a</year>). <article-title>An empirical Bayes approach to inferring large-scale gene association networks</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>754</fpage>&#x02013;<lpage>764</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti062</pub-id><pub-id pub-id-type="pmid">15479708</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schafer</surname> <given-names>J.</given-names></name> <name><surname>Strimmer</surname> <given-names>K.</given-names></name></person-group> (<year>2005b</year>). <article-title>A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics</article-title>. <source>Stat. Appl. Genet. Mol. Biol.</source> <volume>4</volume>, Article32. <pub-id pub-id-type="doi">10.2202/1544-6115.1175</pub-id><pub-id pub-id-type="pmid">16646851</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Segal</surname> <given-names>E.</given-names></name> <name><surname>Shapira</surname> <given-names>M.</given-names></name> <name><surname>Regev</surname> <given-names>A.</given-names></name> <name><surname>Pe&#x00027;er</surname> <given-names>D.</given-names></name> <name><surname>Botstein</surname> <given-names>D.</given-names></name> <name><surname>Koller</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2003</year>). <article-title>Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data</article-title>. <source>Nat. Genet.</source> <volume>34</volume>, <fpage>166</fpage>&#x02013;<lpage>176</lpage>. <pub-id pub-id-type="doi">10.1038/ng1165</pub-id><pub-id pub-id-type="pmid">12740579</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sherlock</surname> <given-names>G.</given-names></name></person-group> (<year>2000</year>). <article-title>Analysis of large-scale gene expression data</article-title>. <source>Curr. Opin. Immunol.</source> <volume>12</volume>, <fpage>201</fpage>&#x02013;<lpage>205</lpage>. <pub-id pub-id-type="doi">10.1016/S0952-7915(99)00074-6</pub-id><pub-id pub-id-type="pmid">10712947</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>J. R.</given-names></name> <name><surname>Wang</surname> <given-names>H. Y.</given-names></name> <name><surname>Schellin</surname> <given-names>K.</given-names></name> <name><surname>Li</surname> <given-names>B. L.</given-names></name> <name><surname>Faller</surname> <given-names>M.</given-names></name> <name><surname>Stoop</surname> <given-names>J. M.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>Embryo-specific silencing of a transporter reduces phytic acid content of maize and soybean seeds</article-title>. <source>Nat. Biotechnol.</source> <volume>25</volume>, <fpage>930</fpage>&#x02013;<lpage>937</lpage>. <pub-id pub-id-type="doi">10.1038/nbt1322</pub-id><pub-id pub-id-type="pmid">17676037</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Hazebroek</surname> <given-names>J.</given-names></name> <name><surname>Ertl</surname> <given-names>D. S.</given-names></name> <name><surname>Harp</surname> <given-names>T.</given-names></name></person-group> (<year>2005</year>). <article-title>The maize low-phytic acid 3 encodes a myo-inositol kinase that plays a role in phytic acid biosynthesis in developing seeds</article-title>. <source>Plant J.</source> <volume>42</volume>, <fpage>708</fpage>&#x02013;<lpage>719</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2005.02412.x</pub-id><pub-id pub-id-type="pmid">15918884</pub-id></citation></ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Hazebroek</surname> <given-names>J.</given-names></name> <name><surname>Meeley</surname> <given-names>R. B.</given-names></name> <name><surname>Ertl</surname> <given-names>D. S.</given-names></name></person-group> (<year>2003</year>). <article-title>The maize low-phytic acid mutant lpa2 is caused by mutation in an inositol phosphate kinase gene</article-title>. <source>Plant Physiol.</source> <volume>131</volume>, <fpage>507</fpage>&#x02013;<lpage>515</lpage>. <pub-id pub-id-type="doi">10.1104/pp.014258</pub-id><pub-id pub-id-type="pmid">12586875</pub-id></citation></ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sparks</surname> <given-names>E.</given-names></name> <name><surname>Drapek</surname> <given-names>C.</given-names></name> <name><surname>Gaudinier</surname> <given-names>A.</given-names></name> <name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Ansariola</surname> <given-names>M.</given-names></name> <name><surname>Shen</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Establishment of expression in the shortroot-scarecrow transcriptional cascade through opposing activities of both activators and repressors</article-title>. <source>Dev. Cell</source> <volume>39</volume>, <fpage>585</fpage>&#x02013;<lpage>596</lpage>. <pub-id pub-id-type="doi">10.1016/j.devcel.2016.09.031</pub-id><pub-id pub-id-type="pmid">27923776</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stevenson</surname> <given-names>J. M.</given-names></name> <name><surname>Perera</surname> <given-names>I. Y.</given-names></name> <name><surname>Heilmann</surname> <given-names>I.</given-names></name> <name><surname>Persson</surname> <given-names>S.</given-names></name> <name><surname>Boss</surname> <given-names>W. F.</given-names></name></person-group> (<year>2000</year>). <article-title>Inositol signaling and plant growth</article-title>. <source>Trends Plant Sci.</source> <volume>5</volume>, <fpage>252</fpage>&#x02013;<lpage>258</lpage>. <pub-id pub-id-type="doi">10.1016/S1360-1385(00)01652-6</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stevenson-Paulik</surname> <given-names>J.</given-names></name> <name><surname>Bastidas</surname> <given-names>R. J.</given-names></name> <name><surname>Chiou</surname> <given-names>S.-T.</given-names></name> <name><surname>Frye</surname> <given-names>R. A.</given-names></name> <name><surname>York</surname> <given-names>J. D.</given-names></name></person-group> (<year>2005</year>). <article-title>Generation of phytate-free seeds in Arabidopsis through disruption of inositol polyphosphate kinases</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>102</volume>, <fpage>12612</fpage>&#x02013;<lpage>12617</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0504172102</pub-id><pub-id pub-id-type="pmid">16107538</pub-id></citation></ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Taylor-Teeples</surname> <given-names>M.</given-names></name> <name><surname>Lin</surname> <given-names>L.</given-names></name> <name><surname>de Lucas</surname> <given-names>M.</given-names></name> <name><surname>Turco</surname> <given-names>G.</given-names></name> <name><surname>Toal</surname> <given-names>T. W.</given-names></name> <name><surname>Gaudinier</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>An Arabidopsis gene regulatory network for secondary cell wall synthesis</article-title>. <source>Nature</source> <volume>517</volume>, <fpage>571</fpage>&#x02013;<lpage>575</lpage>. <pub-id pub-id-type="doi">10.1038/nature14099</pub-id><pub-id pub-id-type="pmid">25533953</pub-id></citation></ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weber</surname> <given-names>H.</given-names></name> <name><surname>Borisjuk</surname> <given-names>L.</given-names></name> <name><surname>Wobus</surname> <given-names>U.</given-names></name></person-group> (<year>2005</year>). <article-title>Molecular physiology of legume seed development</article-title>. <source>Annu. Rev. Plant Biol.</source> <volume>56</volume>, <fpage>253</fpage>&#x02013;<lpage>279</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.arplant.56.032604.144201</pub-id><pub-id pub-id-type="pmid">15862096</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wilcox</surname> <given-names>J. R.</given-names></name> <name><surname>Premachandra</surname> <given-names>G. S.</given-names></name> <name><surname>Young</surname> <given-names>K. A.</given-names></name> <name><surname>Raboy</surname> <given-names>V.</given-names></name></person-group> (<year>2000</year>). <article-title>Isolation of high seed inorganic P, low-phytate soybean mutants</article-title>. <source>Crop Sci.</source> <volume>40</volume>:<fpage>1601</fpage>. <pub-id pub-id-type="doi">10.2135/cropsci2000.4061601x</pub-id></citation></ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>F.</given-names></name> <name><surname>Meng</surname> <given-names>T.</given-names></name> <name><surname>Li</surname> <given-names>P.</given-names></name> <name><surname>Yu</surname> <given-names>Y.</given-names></name> <name><surname>Cui</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>A soybean dual-specificity kinase, GmSARK, and its Arabidopsis homolog, AtSARK, regulate leaf senescence through synergistic actions of auxin and ethylene</article-title>. <source>Plant Physiol.</source> <volume>157</volume>, <fpage>2131</fpage>&#x02013;<lpage>2153</lpage>. <pub-id pub-id-type="doi">10.1104/pp.111.182899</pub-id><pub-id pub-id-type="pmid">22034630</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>X. H.</given-names></name> <name><surname>Zhao</surname> <given-names>H. J.</given-names></name> <name><surname>Liu</surname> <given-names>Q. L.</given-names></name> <name><surname>Frank</surname> <given-names>T.</given-names></name> <name><surname>Engel</surname> <given-names>K. H.</given-names></name> <name><surname>An</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Mutations of the multi-drug resistance-associated protein ABC transporter gene 5 result in reduction of phytic acid in rice seeds</article-title>. <source>Theor. Appl. Genet.</source> <volume>119</volume>, <fpage>75</fpage>&#x02013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1007/s00122-009-1018-1</pub-id><pub-id pub-id-type="pmid">19370321</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Yang</surname> <given-names>W.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>Zhou</surname> <given-names>X.</given-names></name> <name><surname>Jiang</surname> <given-names>L.</given-names></name> <name><surname>Ma</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Analysis of weighted co-regulatory networks in maize provides insights into new genes and regulatory mechanisms related to inositol phosphate metabolism</article-title>. <source>BMC Genomics</source> <volume>17</volume>:<fpage>129</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-016-2476-x</pub-id><pub-id pub-id-type="pmid">26911482</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>M.</given-names></name> <name><surname>Dahmen</surname> <given-names>J. L.</given-names></name> <name><surname>Stacey</surname> <given-names>G.</given-names></name> <name><surname>Cheng</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data</article-title>. <source>BMC Bioinformatics</source> <volume>14</volume>:<fpage>278</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-14-278</pub-id><pub-id pub-id-type="pmid">24053776</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup><ext-link ext-link-type="uri" xlink:href="https://phytozome.jgi.doe.gov/">https://phytozome.jgi.doe.gov/</ext-link></p></fn>
<fn id="fn0002"><p><sup>2</sup><ext-link ext-link-type="uri" xlink:href="https://soybase.org/">https://soybase.org/</ext-link></p></fn>
<fn id="fn0003"><p><sup>3</sup><ext-link ext-link-type="uri" xlink:href="http://www.plantcyc.org/databases/soycyc/7.0">http://www.plantcyc.org/databases/soycyc/7.0</ext-link></p></fn>
<fn id="fn0004"><p><sup>4</sup><ext-link ext-link-type="uri" xlink:href="http://www.plantcyc.org/">http://www.plantcyc.org/</ext-link></p></fn>
</fn-group>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term><italic>lpa</italic></term>
<def><p>Low phytic acid</p></def></def-item>
<def-item><term>MIPS</term>
<def><p><italic>myo-</italic>inositol phosphate synthase</p></def></def-item>
<def-item><term>MRP</term>
<def><p>Multi-drug resistance protein</p></def></def-item>
<def-item><term>DEG</term>
<def><p>Differentially expressed gene</p></def></def-item>
<def-item><term>RFO</term>
<def><p>Raffinose family oligosaccharide</p></def></def-item>
<def-item><term>ABA</term>
<def><p>Abscisic acid.</p></def></def-item>
</def-list>
</glossary>
<fn-group>
<fn fn-type="financial-disclosure"><p><bold>Funding.</bold> This project was funded by Bio-design and Bioprocessing Research Center, John Lee Pratt Fellowship Program, Agricultural Experiment Station Hatch Program, and Open Access Subvention Fund&#x02014;all at Virginia Tech.</p>
</fn>
</fn-group>
</back>
</article>