<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Research Foundation</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2011.00040</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Methods Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Mass Spectra-Based Framework for Automated Structural Elucidation of Metabolome Data to Explore Phytochemical Diversity</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Matsuda</surname> <given-names>Fumio</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Nakabayashi</surname> <given-names>Ryo</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Sawada</surname> <given-names>Yuji</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Suzuki</surname> <given-names>Makoto</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Hirai</surname> <given-names>Masami Y.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Kanaya</surname> <given-names>Shigehiko</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Saito</surname> <given-names>Kazuki</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff5"><sup>5</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>RIKEN Plant Science Center</institution> <country>Yokohama, Japan</country></aff>
<aff id="aff2"><sup>2</sup><institution>Organization of Advanced Science and Technology, Kobe University</institution> <country>Kobe, Japan</country></aff>
<aff id="aff3"><sup>3</sup><institution>Japan Science and Technology Agency</institution> <country>CREST, Kawaguchi, Japan</country></aff>
<aff id="aff4"><sup>4</sup><institution>Graduate School of Information Science, Nara Institute of Science and Technology</institution> <country>Ikoma, Japan</country></aff>
<aff id="aff5"><sup>5</sup><institution>Graduate School of Pharmaceutical Sciences, Chiba University</institution> <country>Chiba, Japan</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Alisdair Fernie, Max Planck Institut for Plant Physiology, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Asaph Aharoni, Weizmann Institute of Science, Israel; Takayuki Tohge, Max Planck Institute of Molecular Plant Physiology, Germany</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Kazuki Saito, Metabolome Research Group, RIKEN Plant Science Center, Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan. e-mail: <email>ksaito&#x00040;psc.riken.jp</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Frontiers in Plant Physiology, a specialty of Frontiers in Plant Science.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>22</day>
<month>08</month>
<year>2011</year>
</pub-date>
<pub-date pub-type="collection">
<year>2011</year>
</pub-date>
<volume>2</volume>
<elocation-id>40</elocation-id>
<history>
<date date-type="received">
<day>29</day>
<month>04</month>
<year>2011</year>
</date>
<date date-type="accepted">
<day>01</day>
<month>08</month>
<year>2011</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2011 Matsuda, Nakabayashi, Sawada, Suzuki, Hirai, Kanaya and Saito.</copyright-statement>
<copyright-year>2011</copyright-year>
<license license-type="open-access" xlink:href="http://www.frontiersin.org/licenseagreement"><p>This is an open-access article subject to a non-exclusive license between the authors and Frontiers Media SA, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and other Frontiers conditions are complied with.</p></license>
</permissions>
<abstract>
<p>A novel framework for automated elucidation of metabolite structures in liquid chromatography&#x02013;mass spectrometer metabolome data was constructed by integrating databases. High-resolution tandem mass spectra data automatically acquired from each metabolite signal were used for database searches. Three distinct databases, KNApSAcK, ReSpect, and the PRIMe standard compound database, were employed for the structural elucidation. The outputs were retrieved using the CAS metabolite identifier for identification and putative annotation. A simple metabolite ontology system was also introduced to attain putative characterization of the metabolite signals. The automated method was applied for the metabolome data sets obtained from the rosette leaves of 20 <italic>Arabidopsis</italic> accessions. Phenotypic variations in novel <italic>Arabidopsis</italic> metabolites among these accessions could be investigated using this method.
</p>
</abstract>
<kwd-group>
<kwd>metabolome analysis</kwd>
<kwd>liquid chromatography-mass spectrometry</kwd>
<kwd>structural elucidation</kwd>
<kwd>database searching</kwd>
<kwd>natural variations in secondary metabolite</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="48"/>
<page-count count="10"/>
<word-count count="6775"/>
</counts>
</article-meta>
</front>
<body><sec sec-type="introduction">
<title>Introduction</title>
<p>The ability to produce various secondary metabolites has evolved in plants for the purpose of self-defense, environmental adaptation, and interaction with other organisms. Because humans utilize phytochemicals as a rich resource for various purposes such as the production of pharmaceuticals, further understanding of the genetic background behind the diversity of secondary metabolites produced by plants will facilitate more intensive application of these compounds (Saito and Matsuda, <xref ref-type="bibr" rid="B39">2010</xref>). Recent progress in gene sequencing has enabled generation of a large volume of data on genetic polymorphisms that is related to natural variations in phytochemicals (Clark et al., <xref ref-type="bibr" rid="B6">2007</xref>; Ossowski et al., <xref ref-type="bibr" rid="B34">2008</xref>; Zeller et al., <xref ref-type="bibr" rid="B48">2008</xref>). Accordingly, it is expected that novel genes and functions of plant secondary metabolism as well as those involved in evolution could be investigated based on the association between genotypes and metabolic phenotypes (metabolotypes; Plantegenet et al., <xref ref-type="bibr" rid="B35">2009</xref>; Weigel and Mott, <xref ref-type="bibr" rid="B45">2009</xref>). Since the metabolotype data required for such analyses is both qualitative (structure of secondary metabolites) and quantitative (amount of metabolite), metabolic profiling analysis using liquid chromatography&#x02013;tandem mass spectrometry (LC&#x02013;MS) has been used to obtain comprehensive profiles of plant secondary metabolites (De Vos et al., <xref ref-type="bibr" rid="B9">2007</xref>). While qualitative data describing hundreds of metabolite signals have routinely been acquired during analysis (Keurentjes et al., <xref ref-type="bibr" rid="B16">2006</xref>), structural elucidation of the observed signals using LC&#x02013;MS is still difficult (Moco et al., <xref ref-type="bibr" rid="B28">2006</xref>; Bottcher et al., <xref ref-type="bibr" rid="B4">2007</xref>; Iijima et al., <xref ref-type="bibr" rid="B13">2008</xref>; Matsuda et al., <xref ref-type="bibr" rid="B22">2010a</xref>).</p>
<p>The structure-related information available from LC&#x02013;MS analysis includes the retention time, exact mass number, and tandem mass spectrum (MS/MS spectrum). The structure associated with each metabolite signal has been estimated by searching databases containing reference data using the information obtained from LC&#x02013;MS analysis (Moco et al., <xref ref-type="bibr" rid="B29">2007</xref>; Kind and Fiehn, <xref ref-type="bibr" rid="B17">2010</xref>; Neumann and Bocker, <xref ref-type="bibr" rid="B33">2010</xref>). The amount of information obtained from database searches varies among metabolite peaks; therefore, four levels of structural elucidation have been standardized by the metabolome standard initiative (MSI) as follows (Fiehn et al., <xref ref-type="bibr" rid="B11">2007</xref>; Sumner et al., <xref ref-type="bibr" rid="B43">2007</xref>): (1) Identified: a minimum of two independent data points relative to an authentic compound analyzed under identical experimental conditions. (2) Putatively annotated: without chemical reference standards, based on physicochemical properties and/or spectral similarity with public/commercial spectral libraries. (3) Putatively characterized: based on characteristic physicochemical properties of a chemical class of compounds, or spectral similarity to known compounds of a chemical class. (4) Unknown. Based on the standardized format, a framework for automated structural elucidation is required to explore the structural diversity of phytochemicals. However, several technical issues must be solved before database-assisted elucidation of metabolite structures (Kind and Fiehn, <xref ref-type="bibr" rid="B17">2010</xref>; Neumann and Bocker, <xref ref-type="bibr" rid="B33">2010</xref>). One bottleneck is represented by a shortage of standard compounds and their associated MS/MS spectra data. Owing to the poor availability of plant secondary metabolites, only a very low percentage of the observed metabolite signals can be assigned by comparison of the chromatographic behavior with chemical reference standards (Matsuda et al., <xref ref-type="bibr" rid="B22">2010a</xref>). Although great effort has been put into construction of the MS/MS spectral databases (Moco et al., <xref ref-type="bibr" rid="B28">2006</xref>; Wishart et al., <xref ref-type="bibr" rid="B47">2007</xref>; Horai et al., <xref ref-type="bibr" rid="B12">2010</xref>), further enrichment is required for structural elucidation of the wider range of metabolites. Another difficulty is the low reproducibility of the structure-related information. For instance, the fragment patterns in MS/MS spectra depend on the mass spectrometers and their operating conditions. The error derived from the analysis also exists in the high-resolution mass spectral data (Mihaleva et al., <xref ref-type="bibr" rid="B27">2008</xref>; Matsuda et al., <xref ref-type="bibr" rid="B25">2009b</xref>). Owing to these technical problems, elucidation of the structure associated with signals corresponding to metabolomes is time consuming, which has hampered the investigation of phytochemical diversity across plant species or ecotypes.</p>
<p>In this study, a novel framework for the automated elucidation of metabolite structures in LC&#x02013;MS metabolome data was constructed by integrating three different databases. To overcome the aforementioned problems, the MS/MS spectra databases were enriched using literature reported information. Additionally, the high-resolution MS/MS spectra data were redundantly acquired from each metabolite signal to improve the quality of structure-related information that was used to search the databases. The outputs were retrieved using the CAS metabolite identifier for identification and putative annotation. A simple metabolite ontology system was also introduced to enable putative characterization of the metabolite signals. The automated method developed here was applied for metabolome data sets obtained from the rosette leaves of 20 <italic>Arabidopsis</italic> accessions, from which phenotypic variations in novel <italic>Arabidopsis</italic> metabolites among these accessions could be investigated.</p>
</sec>
<sec sec-type="materials|methods" id="s1">
<title>Materials and Methods</title>
<sec>
<title>Plant materials</title>
<p>Seeds of 20 accessions of <italic>Arabidopsis</italic> <italic>thaliana</italic>, CS22676 Bay-0, CS22677 Bor-4, CS22678 Br-0, CS22679 Bur-0, CS22680 C24, CS22681 Col-0, CS22682 Cvi-0, CS22683 Est-1, CS22684 Fei-0, CS22685 Goettingen-7, CS22686 Ler-1, CS22687 NFA-8, CS22688 RRS-7, CS22689 RRS-10, CS22690 Sha, CS22691 Tamm-2, CS22692 Ts-1, CS22693 Tsu-1, CS22694 Van-0, and CS22695 Lov-5, were obtained from the ABRC. The seeds were soaked on MS agar plates and then incubated at 22&#x000B0;C under 16&#x02009;h day and 8&#x02009;h night conditions. At 18&#x02009;days after germination, the aerial parts of the seedlings were harvested.</p>
</sec>
<sec>
<title>Metabolome analysis using LC-ESI-Q-Tof/MS</title>
<p>The collected sample tissues were weighed and stored at &#x02212;80&#x000B0;C until analysis. The frozen tissues of independent plants were homogenized in five volumes of 80% aqueous methanol containing 0.1% acetic acid, 0.5&#x02009;mg/l of lidocaine, and <italic>d</italic>-camphor sulfonic acid (Tokyo Kasei, Tokyo, Japan) using a mixer mill (MM 300, Retsch) with a zirconia bead for 6&#x02009;min at 20&#x02009;Hz. Next, the samples were centrifuged at 15,000&#x02009;g for 10&#x02009;min and filtered (Ultrafree-MC filter, 0.2&#x02009;&#x003BC;m; Millipore, Bedford, MA, USA). The sample extracts were then applied to an HLB &#x003BC;Elution plate (Waters, Milford, MA, USA) that had been equilibrated with 80% aqueous methanol containing 0.1% acetic acid. The eluates (3&#x02009;&#x003BC;l) were subsequently subjected to metabolome analysis by LC coupled with electrospray quadrupole time-of-flight tandem MS using an Acquity BEH ODS column (LC-ESI-Q-Tof/MS, HPLC: Waters Acquity UPLC system; MS: Waters Q-Tof Premier). The metabolome analysis and data processing were conducted according to a previously described method (Matsuda et al., <xref ref-type="bibr" rid="B26">2009c</xref>, <xref ref-type="bibr" rid="B22">2010a</xref>). Briefly, the metabolome data were obtained in the negative ion mode (<italic>m</italic>/<italic>z</italic> 100&#x02013;2,000; dwell time: 0.45&#x02009;s; interscan delay: 0.05&#x02009;s, centroid), from which a data matrix was generated with the aid of MetAlign (De Vos et al., <xref ref-type="bibr" rid="B9">2007</xref>; Lommen, <xref ref-type="bibr" rid="B20">2009</xref>). In order to reduce a redundancy of the data matrix, fragment ions were removed by a following procedure. A metabolite signal was removed from the matrix when there is another intense peak eluted at similar retention times [within the retention time threshold (&#x0003C;0.5&#x02009;s)] with the highest correlation coefficient above the threshold value (&#x0003E;0.8). The analysis was conducted using five biological replicates of 20 accessions, from which a data matrix composed of 703 signals (peaks) was obtained (<bold>Table S1</bold> in Supplementary Material). The number of signals would not reflect an exact number of detected metabolites due to the complex nature of the metabolome data.</p>
<p>To construct MS2T libraries, the extracts of five ecotypes were mixed and utilized for the MS2T data acquisition. The analyses were repeatedly conducted for four mixtures by previously described methods (Matsuda et al., <xref ref-type="bibr" rid="B26">2009c</xref>). Each MS2T entry was assigned a unique accession code, such as ATH10n03690, in which ATH10n is the name of the library and 03690 is the entry number. All data obtained in this study are available at the PRIMe website<xref ref-type="fn" rid="fn1"><sup>1</sup></xref> (Akiyama et al., <xref ref-type="bibr" rid="B1">2008</xref>).</p>
</sec>
</sec>
<sec>
<title>Databases and Software</title>
<p>The ReSpect (RIKEN MS/MS spectra database for phytochemicals; 2011 January version), KNApSAcK (2010.12.24 version; Shinbo et al., <xref ref-type="bibr" rid="B41">2006</xref>; Takahashi et al., <xref ref-type="bibr" rid="B44">2008</xref>), and PRIMe standard compound database (2009 November version) were used in this study. The genetic polymorphism data from 20 <italic>Arabidopsis</italic> accessions were downloaded from the TAIR web site (Clark et al., <xref ref-type="bibr" rid="B6">2007</xref>; Poole, <xref ref-type="bibr" rid="B36">2007</xref>). All data processing procedures were conducted using the in-house script written with Perl. Structural elucidation work was performed in-batch search for all metabolite signals.</p>
<p>In the automated structural elucidation procedure, several thresholds were required to conduct the database searches. The thresholds used in this study are described in Figures <xref ref-type="fig" rid="F2">2</xref> and <xref ref-type="fig" rid="F3">3</xref>. To search the MS/MS spectra, the similarity scores were determined by employing dot product method with mass tolerance at 0.5&#x02009;Da (Stein and Scott, <xref ref-type="bibr" rid="B42">1994</xref>). The two spectra were considered to be the similar when the similarity score was greater than 0.6. For hierarchical clustering analysis, log2-transformed <italic>Z</italic>-scored signal intensity data were processed using MEV version 4.4 (Saeed et al., <xref ref-type="bibr" rid="B38">2003</xref>, <xref ref-type="bibr" rid="B37">2006</xref>).</p>
<sec>
<title>Results</title>
</sec>
<sec>
<title>Acquisition of metabolome data from 20 <italic>Arabidopsis</italic> accessions</title>
<p>To investigate variations in the composition of secondary metabolites among <italic>Arabidopsis</italic> strains (accessions), metabolic profile data were obtained from the rosette leaves of 20 accessions of <italic>Arabidopsis</italic> by LC-ESI-Q-Tof/MS analysis (Matsuda et al., <xref ref-type="bibr" rid="B26">2009c</xref>, <xref ref-type="bibr" rid="B22">2010a</xref>). The 20 diverse accessions evaluated herein were previously selected by Clark et al. (<xref ref-type="bibr" rid="B6">2007</xref>) to investigate the genetic variations within the population of <italic>Arabidopsis</italic>. The analysis was conducted using five biological replicates of 20 accessions, from which a data matrix composed of 703 metabolite signals (peaks) was obtained (<bold>Table S1</bold> in Supplementary Material). Here, the dataset was designed as AtMetExpress 20 Ecotypes and each metabolite signal was addressed by a unique ID, such as aen00884. Hierarchical clustering analysis of the dataset revealed that there were large variations in the metabolic profiles across 20 accessions, which should be derived from those genetic polymorphisms (Figure <xref ref-type="fig" rid="F1">1</xref>). To acquire information for structural elucidation of those metabolite signals, MS/MS spectra data were obtained from identical extracts by using the automated data acquisition methods described in Section <xref ref-type="sec" rid="s1">&#x0201C;Materials and Methods.&#x0201D;</xref> Since the analyses were conducted repeatedly, multiple MS/MS spectra data were recorded for each metabolite signal (Matsuda et al., <xref ref-type="bibr" rid="B26">2009c</xref>). Consequently, MS/MS spectral tag (MS2T) libraries containing 126,889 accessions were constructed (Table <xref ref-type="table" rid="T1">1</xref>). Each MS2T entry was assigned a unique ID such as ATH67n06391. Based on the MS2T data, the structure of each metabolite signal was elucidated by searching the databases.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Hierarchical clustering analysis of metabolic profile data of the AtMetExpress 20 ecotype dataset</bold>. Log2-transformed and <italic>Z</italic>-scored signal intensity data were hierarchically classified using the average linkage clustering methods.</p></caption>
<graphic xlink:href="fpls-02-00040-g001.tif"/>
</fig>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>List of databases and datasets used in this study</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Databases</th>
<th align="left">Description</th>
<th align="left">Number of accessions</th>
<th align="left">Data source</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">AtMetExpress 20 ecotype</td>
<td align="left" valign="top">Metabolic profile data obtained from 20 accessions of <italic>Arabidopsis</italic> strains</td>
<td align="left">100 metabolic profile data (20 accessions by five biological replicates) containing 703 metabolite signals</td>
<td align="left" valign="top"><uri xlink:href="http://prime.psc.riken.jp/?action&#x02009;&#x0003D;&#x02009;drop_index">http://prime.psc.riken.jp/?action&#x02009;&#x0003D;&#x02009;drop_index</uri></td>
</tr>
<tr>
<td align="left" valign="top">MS2T library</td>
<td align="left" valign="top">Library of high-resolution MS/MS spectra data obtained from the actual <italic>Arabidopsis</italic> extracts</td>
<td align="left">Subset of MS2T library containing 126,889 accessions obtained from the <italic>Arabidopsis</italic> ecotypes were used in this study</td>
<td align="left" valign="top"><uri xlink:href="http://prime.psc.riken.jp/lcms/ms2tview/ms2tview.html">http://prime.psc.riken.jp/lcms/ms2tview/ms2tview.html</uri></td>
</tr>
<tr>
<td align="left" valign="top">ReSpect for phytochemicals</td>
<td align="left" valign="top">MS/MS spectra database of standard and literature reported phytochemicals</td>
<td align="left">Literature data: 3,136 records corresponding to 2,741 metabolites Q-TOF/MS data 1,050 records/575 standard compounds QqQ/MS data: 4,258 records/861 standards. Total 8,444 records/3,595 metabolites</td>
<td align="left" valign="top"><uri xlink:href="http://spectra.psc.riken.jp/">http://spectra.psc.riken.jp/</uri></td>
</tr>
<tr>
<td align="left" valign="top">RIKEN Standard compound database</td>
<td align="left" valign="top">List of standard compounds and physicochemical data</td>
<td align="left" valign="top">LC&#x02013;MS/MS retention time and <italic>m</italic>/<italic>z</italic> data of 600 compounds</td>
<td align="left" valign="top"><uri xlink:href="http://prime.psc.riken.jp/lcms/data/StandardCompound/">http://prime.psc.riken.jp/lcms/data/StandardCompound/</uri></td>
</tr>
<tr>
<td align="left" valign="top">KNApSAcK</td>
<td align="left" valign="top">Comprehensive species&#x02013;metabolite relationship database</td>
<td align="left" valign="top">Collection of 50,048 unique metabolites and 101,500 metabolite&#x02013;species pairs</td>
<td align="left" valign="top"><uri xlink:href="http://kanaya.naist.jp/KNApSAcK_Family/">http://kanaya.naist.jp/KNApSAcK_Family/</uri></td>
</tr>
<tr>
<td align="left" valign="top">Metabolite ontology</td>
<td align="left" valign="top">Simple classification of phytochemicals</td>
<td align="left" valign="top">322 ontology terms are assigned for the ReSpect database</td>
<td align="left" valign="top">In preparation</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Preparation of standard compound databases and the compound ontology system</title>
<p>Three distinct databases, KNApSAcK, ReSpect, and the PRIMe standard compound database, were employed for the structural elucidation (Table <xref ref-type="table" rid="T1">1</xref>). ReSpect is a new web data resource that incorporates records from existing literature as well as the MS/MS data from our standard compounds. This database contains 8,444 records corresponding to 3,595 metabolites. ReSpect is the first tool for annotation of phytochemicals that is based on downloadable MS/MS data resources and databases (Sawada et al., in preparation). KNApSAcK is a comprehensive species&#x02013;metabolite relationship database developed by the Kanaya lab in NAIST (Shinbo et al., <xref ref-type="bibr" rid="B41">2006</xref>; Takahashi et al., <xref ref-type="bibr" rid="B44">2008</xref>). KNApSAcK contains the structural data of 50,048 metabolites and 101,500 metabolite&#x02013;species pairs. In this study, KNApSAcK was used to elucidate molecular formulas of candidate metabolites from the high-resolution mass spectra data. The PRIMe standard compound database contains a retention time and <italic>m</italic>/<italic>z</italic> data of 600 authentic compounds acquired using an identical analytical method (Matsuda et al., <xref ref-type="bibr" rid="B26">2009c</xref>). For the automated metabolite annotations, accessions in these databases were assigned with corresponding CAS identifiers.</p>
<p>Since CAS identifiers basically address a structurally confirmed metabolites (Matsuda et al., <xref ref-type="bibr" rid="B24">2009a</xref>), the metabolite annotation procedure based on the identifier cannot deal with information describing partially characterized metabolites. For example, the metabolite structures were often estimated to be from a compound class such as &#x0201C;kaempferol glycoside&#x0201D; and &#x0201C;amino acid derivative&#x0201D; (Bottcher et al., <xref ref-type="bibr" rid="B4">2007</xref>; Iijima et al., <xref ref-type="bibr" rid="B13">2008</xref>; Matsuda et al., <xref ref-type="bibr" rid="B22">2010a</xref>). In the case of gene annotation, each gene was tentatively annotated by gene ontology terms that were manually assigned or automatically estimated from the sequence similarities. Although detailed compound ontology systems and vocabularies have been developed using several databases such as CheBi and KEGG (Degtyarenko et al., <xref ref-type="bibr" rid="B10">2008</xref>; Kanehisa et al., <xref ref-type="bibr" rid="B15">2008</xref>; Matsuda et al., <xref ref-type="bibr" rid="B24">2009a</xref>), a simple compound ontology system was newly introduced in this study to cover the wide range of phytochemicals. Here entries in the PRIMe databases were classified within three levels, ranging from basic (Class 1) to detailed (Class 3) with considering the basic skeleton and modified parts of metabolites (<bold>Table S2</bold> in Supplementary Material). The ontology terms prepared in this study is not comprehensive, since the classification system was arbitrary prepared by manually curating the entries of ReSpect MS/MS spectra database for an assistance of structural elucidation of metabolome data. For instance, partially characterized metabolites could be classified as follows: kaempferol-3,7-dirhamnoside is a member of Class 1: flavonoid, Class 2: flavonol, and Class 3: kaempferol glycoside; tryptophan is a member of Class 1: amino acid and Class 2: tryptophan; and pinoresinol-dihexoside is a member of Class 1: phenylpropanoid, Class 2: lignan, and Class 3: pinoresinol glycoside.</p>
<p>These metabolite classifications have been assigned to all accessions in the ReSpect and PRIMe standard compound databases. A detailed classification study is currently in progress for KNApSAcK, and 60% of the accessions in this database have been assigned to Class 1 or 2.</p>
</sec>
<sec>
<title>Identification and putative annotation using cas identifiers</title>
<p>Based on the MS2T libraries and reference databases, the metabolite signals in the AtMetExpress 20 ecotypes dataset were identified or putatively annotated using the following automated procedure. For the case of a metabolite signal assigned as aen00884 (Rt 3.82 and <italic>m</italic>/<italic>z</italic> 422), the MS2T library contains 19 MS2T accessions acquired from the identical metabolite signal with various spectral quality (Figure <xref ref-type="fig" rid="F2">2</xref>A). In other words, the metabolite signal was tagged with 19 accessions of corresponding MS2Ts. Each MS2T accession consists of the exact mass number of the precursor ion and MS/MS spectra data. Thus, MS/MS spectra data were submitted to the ReSpect database to identify metabolites producing similar MS/MS spectra. In the case of the MS2T accession, ATH67n06391, the MS/MS spectrum was similar to that of 13 compounds whose CAS numbers are obtained as search results (Figure <xref ref-type="fig" rid="F2">2</xref>B). Additionally, the exact mass number of the precursor ion was used to search the KNApSAcK database to find metabolites possessing a highly similar mass number, by which the CAS number of 1 metabolite was obtained. A common CAS number (499-30-9, 2-phenylglucolsinolate) observed in the outputs of both the ReSpect and KNApSAcK searches indicated that it is a candidate structure of the metabolite signal deduced from the MS2T data. To improve the search quality, the procedure was repeated for all 19 MS2T accessions, and the same results were observed for 11 MS2Ts. Since identical metabolites were elucidated using two distinct search methods with high reproducibility (&#x0003E;50%), it is likely that the metabolite signal was derived from 2-phenylethylglucosinolate or its structural isomers. Based on the MSI standard, the metabolite signal could be putatively annotated using the automated structure elucidation procedure (Figure <xref ref-type="fig" rid="F2">2</xref>D).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Procedure for peak identification and putative annotation using CAS identifiers</bold>. For the case of a metabolite signal, aen00884, the MS2T library contains 19 MS2T accessions acquired from the identical metabolite signal <bold>(A)</bold>. MS/MS spectra data in MS2T were submitted to the ReSpect database <bold>(B)</bold>. The exact mass number data of the precursor ion was used to search the KNApSAcK database <bold>(C)</bold>. Since a common CAS number (499-30-9, 2-phenylglucolsinolate) in the outputs of both searches were observed for 11 MS2Ts, the metabolite signal was putatively annotated as 2-phenylethylglucosinolate <bold>(D)</bold>. Since the authentic compound of 2-phenylethylglucosinolate,CAS 499-30-9, was also detected at a similar retention time and mass number, the metabolite signal was identified as 2-phenylethylglucosinolate <bold>(F)</bold>.</p></caption>
<graphic xlink:href="fpls-02-00040-g002.tif"/>
</fig>
<p>Furthermore, an automated search of the PRIMe standard compound database revealed that the authentic compound of 2-phenylethylglucosinolate, CAS 499-30-9, was also detected at a similar retention time and mass number as the queried metabolite signal. Since three distinct pieces of information, including the MS/MS spectra, exact mass number, and chromatographic behavior, were matched to the identical metabolite, the metabolite signal was identified as 2-phenylethylglucosinolate (Figure <xref ref-type="fig" rid="F2">2</xref>F).</p>
<p>Among the 703 metabolite signals in the AtMetExpress 20 ecotype dataset, 25 and 106 peaks could be identified and putatively annotated, respectively, using the procedure described above (<bold>Table S1</bold> in Supplementary Material). Additionally, comparison with the manually curated results produced in our previous study (Matsuda et al., <xref ref-type="bibr" rid="B22">2010a</xref>) revealed no significant error among the 32 commonly annotated metabolite signals.</p>
</sec>
<sec>
<title>Processing of putatively characterized metabolites</title>
<p>In addition to the identification and putative annotation using the CAS metabolite identifiers, putative characterization of the metabolite signals was conducted by introducing the metabolite ontology system. The procedure is explained using the metabolite signal described above as an example (peak ID: aen008844). For each MS2T accession tagged to the metabolite signal, MS/MS spectra data and the exact mass number were used for ReSpect (Figure <xref ref-type="fig" rid="F3">3</xref>A) and KNApSAcK (Figure <xref ref-type="fig" rid="F3">3</xref>B) searches. The compound ontology information instead of CAS identifiers was obtained as outputs in these procedures. The outputs of KNAsSAcK and ReSpect searches were compared to identify a common result, which is a compound ontology estimated from the MS2T accession. Repeated searching for 19 MS2T accessions of aen008844 resulted in 11 MS2Ts being identified as glucosinolate based on the Class 1 ontology. The Class 2 ontology benzylglucosinolate was not accepted, because the result was estimated from only 2 MS2T accessions. Using the procedure, the metabolite signal was successfully characterized as glucosinolate based on the Class 1 ontology (Figure <xref ref-type="fig" rid="F3">3</xref>C).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Procedure for putative characterization of metabolite signal using metabolite ontology</bold>. For 19 MS2T accessions tagged to the metabolite signal, aen008844, MS/MS spectra data and the exact mass number were used for ReSpect <bold>(A)</bold> and KNApSAcK <bold>(B)</bold> searches. The compound ontology information in KNAsSAcK and ReSpect searches were compared to identify a common result. Repeated searching for 19 MS2T accessions resulted in 11 MS2Ts being identified as glucosinolate based on the Class 1 ontology <bold>(C)</bold>.</p></caption>
<graphic xlink:href="fpls-02-00040-g003.tif"/>
</fig>
<p>This procedure was conducted for all metabolite signals of the AtMetExpress 20 Ecotype dataset, and 188 among 703 metabolite signals were automatically characterized. In the case of Class 1 ontology, 1 alkaloids, 7 amino acids, 33 flavonoids, 68 glucosinolates, 47 phenylpropanoids, 4 terpenoids, and 28 other characterizations were assigned to the metabolome data (<bold>Table S2</bold> in Supplementary Material).</p>
</sec>
<sec>
<title>Structural elucidation of metabolite signals using the database search results</title>
<p>Based on the results obtained using the automated methods, the structures of the novel <italic>Arabidopsis</italic> metabolites were manually elucidated. Among the putatively characterized metabolite signals, the metabolite signal aen006966 (Rt 4.051&#x02009;min and <italic>m</italic>/<italic>z</italic> 369) was putatively characterized as being in Class 1: phenylpropanoid. The MS/MS spectral data for ATH67n05643 (Figure <xref ref-type="fig" rid="F4">4</xref>A) indicated that the metabolite would be a coumarin hexoside based on the fragment pattern. An additional KNApSAcK search suggested that a plausible candidate of the metabolite is fraxin (CAS 524-32-1), although the position of glycosylation is unclear. Using a similar procedure, a metabolites putatively characterized as Class 1: phenylpropanoid (aen012096: Rt 3.848&#x02009;min, <italic>m</italic>/<italic>z</italic> 501) were found to be malonyl-hexosyl-sinapate (Figure <xref ref-type="fig" rid="F4">4</xref>B).</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>MS/MS spectra of putatively characterized metabolites</bold>. The predicted molecular formulas of key fragments, neutral losses, and elucidated structures of <bold>(A)</bold> putative fraxin, <bold>(B)</bold> hexosylsinapoylmalate, and <bold>(C)</bold> hexosyl-coumarin are shown in the figure.</p></caption>
<graphic xlink:href="fpls-02-00040-g004.tif"/>
</fig>
<p>Structural elucidation of the putative phenylpropanoid aen006925 (Rt 5.762, <italic>m</italic>/<italic>z</italic> 367) indicated that this metabolite is a hexoside of an unknown aglycon (Figure <xref ref-type="fig" rid="F4">4</xref>C). Because the molecular formula of the aglycone was deduced to be C<sub>11</sub>H<sub>9</sub>O<sub>4</sub> (<italic>m</italic>/<italic>z</italic> 205.0502 <italic>obsd</italic>, <italic>m</italic>/<italic>z</italic> 205.0500 <italic>theor</italic>), the aglycone should be a methylated hydroxy-coumarin (according to the presence of four oxygen atoms, aglycone should contain at least two hydroxy-groups on the coumarin moiety), or dimethoxycoumarin. Thus, the compound aen006925 can be a glycoside (or <italic>C</italic>-glycoside) of these two aglycones, both of which are novel <italic>Arabidopsis</italic> metabolites. While strict structural elucidation must be conducted following the protocols accepted for natural product chemistry (Nakabayashi et al., <xref ref-type="bibr" rid="B31">2009</xref>; Matsuda et al., <xref ref-type="bibr" rid="B23">2010b</xref>), the results presented here demonstrate that a portion of the phytochemical diversity in <italic>Arabidopsis</italic> could be elucidated from MS/MS spectra via automated structural elucidation.</p>
</sec>
<sec>
<title>Phenotypic variations across <italic>Arabidopsis</italic> accessions</title>
<p>The structural elucidation based on the compound ontology information enabled us to deal with putatively characterized metabolite signals such as glucosinolates and flavonoids without strict metabolite identification or annotation. Here, the natural variations in accumulation levels among 20 <italic>Arabidopsis</italic> accessions were compared for metabolites belonging to lignan, amino acids, flavonoids, and glucosinolate (Figure <xref ref-type="fig" rid="F5">5</xref>). The metabolites assigned by lignan (Figure <xref ref-type="fig" rid="F5">5</xref>A) and amino acid (Figure <xref ref-type="fig" rid="F5">5</xref>B) were constitutively accumulated with small natural variations, suggesting that the production of those metabolites is essential for <italic>Arabidopsis</italic> (Matsuda et al., <xref ref-type="bibr" rid="B22">2010a</xref>). Indeed, more than 10 genes encoding the dirigent protein for lignan biosynthesis are present in the <italic>Arabidopsis</italic> genome (Burlat et al., <xref ref-type="bibr" rid="B5">2001</xref>; Davin and Lewis, <xref ref-type="bibr" rid="B7">2005</xref>; Nakatsubo et al., <xref ref-type="bibr" rid="B32">2008</xref>). This redundancy would contribute to the constitutive production of lignans, although the details regarding their physiological role in the growth of <italic>Arabidopsis</italic> remain unknown. In contrast, the metabolites identified as flavonoids (Figure <xref ref-type="fig" rid="F5">5</xref>C) and glucosinolates (Figure <xref ref-type="fig" rid="F5">5</xref>D) tended to show larger natural variations among the 20 accessions. These results suggest that the levels of flavonoids and glucosinolates in rosette leaves are controlled by genetic polymorphisms, which would contribute to the adaptation of each accession to local environments (Li et al., <xref ref-type="bibr" rid="B19">2008</xref>; Bednarek and Osbourn, <xref ref-type="bibr" rid="B3">2009</xref>; Janowitz et al., <xref ref-type="bibr" rid="B14">2009</xref>; Sawada et al., <xref ref-type="bibr" rid="B40">2009</xref>; Manzaneda et al., <xref ref-type="bibr" rid="B21">2010</xref>; De Kraker and Gershenzon, <xref ref-type="bibr" rid="B8">2011</xref>). To investigate the association between large variations in metabolic phenotypes and genetic polymorphisms, we considered the levels of 3-hydroxy-<italic>n</italic>-propylglucosinolate (aen007244) among 20 accessions. Despite significant production of Bor-4, Tsu-1, Bay-0, and Ler-1, the glucosinolate was not detected from other accessions, including Col-0 (Figure <xref ref-type="fig" rid="F6">6</xref>A). Single nucleotide polymorphisms (SNPs) that commonly occurred in Bor-4, Tsu-1, Bay-0, and Ler-1, as well as did not occurred in other accessions, were searched against the re-sequence data produced by Clark et al. (<xref ref-type="bibr" rid="B6">2007</xref>). The results revealed that 80 SNPs of 96 corresponding SNPs formed a linkage disequilibrium (LD) block along the long arm of chromosome 4 (Figure <xref ref-type="fig" rid="F6">6</xref>B). Among the 28 ORFs in the 11-kb region (from At4g02870 to At4g03090), there is an enzyme gene responsible for the last step of hydroxyalkylglucosinolate biosynthesis (AOP3, At4g03050). Although the biological meaning of the LD are unclear, the association between the natural variations in the 3-hydroxy-<italic>n</italic>-propylglucosinolate levels and genetic polymorphisms in the AOP3 gene has been reported (Kliebenstein et al., <xref ref-type="bibr" rid="B18">2001</xref>; Wentzell et al., <xref ref-type="bibr" rid="B46">2007</xref>).</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Natural variation in metabolite levels among 20 <bold><italic>Arabidopsis</italic></bold> accessions belonging to lignan (A), amino acid (B), flavonoid (C), and glucosinolate (D)</bold>. The relative abundances of metabolites were determined by dividing each metabolite level by the average level.</p></caption>
<graphic xlink:href="fpls-02-00040-g005.tif"/>
</fig>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p><bold>Association between levels of 3-hydroxy-n-propylglucosinolate and single nucleotide polymorphisms (SNPs) across 20 accessions of <italic>Arabidopsis</italic></bold>. <bold>(A)</bold> Heat-map representation of 3-hydroxy-n-propylglucosinolate levels in each accession. <bold>(B)</bold> Positions of SNPs associated with 3-hydroxy-n-propylglucosinolate levels on the <italic>Arabidopsis</italic> genome. Blue triangles indicate positions of the SNPs.</p></caption>
<graphic xlink:href="fpls-02-00040-g006.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion">
<title>Discussion</title>
<p>A framework for the automated structural elucidation of LC&#x02013;MS metabolome data was developed to investigate the structural diversity of phytochemicals. Although the framework requires a large amount of structure-related information (MS2T library) and intensive searches of large databases (Figures <xref ref-type="fig" rid="F2">2</xref> and <xref ref-type="fig" rid="F3">3</xref>), the processing of the AtMetExpress 20 ecotype dataset (Figure <xref ref-type="fig" rid="F1">1</xref>) demonstrated that the method is able to reasonably estimate metabolite structures. By referring to the automatically assigned information, the effort required for the manual curation of metabolome data could be drastically reduced (Figure <xref ref-type="fig" rid="F4">4</xref>), which accelerated the investigation of natural variations in the <italic>Arabidopsis</italic> secondary metabolites (Figures <xref ref-type="fig" rid="F5">5</xref> and <xref ref-type="fig" rid="F6">6</xref>). These results demonstrated that the framework is effective for the structural elucidation of LC&#x02013;MS metabolome data, although several technical improvements are required for more comprehensive annotation of the metabolites.</p>
<p>Since the MS/MS spectra database is one of the most important kernels in the framework (Figures <xref ref-type="fig" rid="F3">3</xref> and <xref ref-type="fig" rid="F4">4</xref>), the search results are highly dependent on the database quality. For example, processing of the AtMetExpress 20 ecotype dataset failed to identify metabolites belonging to alkaloids and terpenoids, probably because the current version of ReSpect contains poor entries of those metabolites in contrast to the rich flavonoids and glucosinolates data<xref ref-type="fn" rid="fn2"><sup>2</sup></xref>. This bias is derived from the available standard compounds and published MS/MS spectra data. However, the data dependency indicated that further enrichment of the MS/MS spectra database by the addition of alkaloids, terpenoids, and other phytochemicals could directly improve the results of the structural elucidation. To promote the integration and sharing of spectral data, all ReSpect contents were opened to the public from the PRIME Web site (Table <xref ref-type="table" rid="T1">1</xref>).</p>
<p>Structures elucidated by an automated method should contain incorrect hits derived from errors in mass analyses, indicating that the false discovery rate (FDR) of large-scale search results must be evaluated (Matsuda et al., <xref ref-type="bibr" rid="B25">2009b</xref>; Saito and Matsuda, <xref ref-type="bibr" rid="B39">2010</xref>). In the case of the homology searches of gene sequences, the levels of FDR could be controlled using a probability-based searching algorithm such as BLAST (Altschul and Erickson, <xref ref-type="bibr" rid="B2">1985</xref>). In this study, the cosine product (dot product) method was employed to search MS/MS spectra because it is robust enough to identify identical spectra (Stein and Scott, <xref ref-type="bibr" rid="B42">1994</xref>). A drawback of this method is a FDR control since the similarity score is not based on probability. To reduce false positives, the output obtained from the ReSpect search was compared with that derived from KNApSAcK to identify common results (Figures <xref ref-type="fig" rid="F2">2</xref> and <xref ref-type="fig" rid="F3">3</xref>). The cross-check strategy should reduce false-positive hits, but many metabolite signals were assigned with no structural information. In the case of the AtMetExpress 20 Ecotype dataset, 94% of 703 metabolite signals were tagged by at least one MS2T, and the metabolite structures could be somehow estimated for approximately 30% of the signals (<bold>Table S1</bold> in Supplementary Material). Further development of a probability-based algorithm to determine the similarity between MS/MS spectra is required to increase the numbers of structurally elucidated metabolite signals while controlling FDR (Mylonas et al., <xref ref-type="bibr" rid="B30">2009</xref>).</p>
<p>In the framework developed herein, putative characterization of the metabolite signal could be attained by introducing a new simple ontology system to cover the wider range of plant metabolites. Additionally, the performance of the ontology system was demonstrated for the AtMetExpress 20 ecotype datasets, which revealed the diversity of secondary metabolites in <italic>Arabidopsis</italic> based on structural elucidation using the putatively characterized information. The comparison of levels of putatively characterized metabolites revealed the genetic background of metabolotype variations, which would facilitate the analysis of these associations with genetic polymorphism and evolution.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>We would like to thank Drs. K. Hanada, K. Akiyama, T. Sakurai, R. Niida, and A. Takahashi (RIKEN PSC) for their useful comments regarding this manuscript and their technical support. This work was partly supported by a grant from the Ministry of Agriculture, Forestry and Fisheries of Japan (Genomics for Agricultural Innovation, NVR-0005).</p>
</ack>
<sec sec-type="supplementary-material">
<title>Supplementary Material</title>
<supplementary-material id="sm1">
<p>The Supplementary Material for this article can be found online at <uri xlink:href="http://www.frontiersin.org/Plant_Physiology/10.3389/fpls.2011.00040/abstract/">http://www.frontiersin.org/Plant_Physiology/10.3389/fpls.2011.00040/abstract/</uri></p>
</supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Akiyama</surname> <given-names>K.</given-names></name> <name><surname>Chikayama</surname> <given-names>E.</given-names></name> <name><surname>Yuasa</surname> <given-names>H.</given-names></name> <name><surname>Shimada</surname> <given-names>Y.</given-names></name> <name><surname>Tohge</surname> <given-names>T.</given-names></name> <name><surname>Shinozaki</surname> <given-names>K.</given-names></name> <name><surname>Hirai</surname> <given-names>M. Y.</given-names></name> <name><surname>Sakurai</surname> <given-names>T.</given-names></name> <name><surname>Kikuchi</surname> <given-names>J.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name></person-group> (<year>2008</year>). <article-title>PRIMe: a web site that assembles tools for metabolomics and transcriptomics</article-title>. <source>In Silico Biol</source>. <volume>8</volume>, <fpage>339</fpage>&#x02013;<lpage>345</lpage>.<pub-id pub-id-type="pmid">19032166</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Altschul</surname> <given-names>S. F.</given-names></name> <name><surname>Erickson</surname> <given-names>B. W.</given-names></name></person-group> (<year>1985</year>). <article-title>Significance of nucleotide sequence alignments: a method for random sequence permutation that preserves dinucleotide and codon usage</article-title>. <source>Mol. Biol. Evol.</source> <volume>2</volume>, <fpage>526</fpage>&#x02013;<lpage>538</lpage>.<pub-id pub-id-type="pmid">3870875</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bednarek</surname> <given-names>P.</given-names></name> <name><surname>Osbourn</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Plant-microbe interactions: chemical diversity in plant defense</article-title>. <source>Science</source> <volume>324</volume>, <fpage>746</fpage>&#x02013;<lpage>748</lpage>.<pub-id pub-id-type="doi">10.1126/science.1171661</pub-id><pub-id pub-id-type="pmid">19423814</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bottcher</surname> <given-names>C.</given-names></name> <name><surname>Roepenack-Lahaye</surname> <given-names>E. V.</given-names></name> <name><surname>Willscher</surname> <given-names>E.</given-names></name> <name><surname>Scheel</surname> <given-names>D.</given-names></name> <name><surname>Clemens</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>Evaluation of matrix effects in metabolite profiling based on capillary liquid chromatography electrospray ionization quadrupole time-of-flight mass spectrometry</article-title>. <source>Anal. Chem.</source> <volume>79</volume>, <fpage>1507</fpage>&#x02013;<lpage>1513</lpage>.<pub-id pub-id-type="doi">10.1021/ac061037q</pub-id><pub-id pub-id-type="pmid">17297948</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burlat</surname> <given-names>V.</given-names></name> <name><surname>Kwon</surname> <given-names>M.</given-names></name> <name><surname>Davin</surname> <given-names>L. B.</given-names></name> <name><surname>Lewis</surname> <given-names>N. G.</given-names></name></person-group> (<year>2001</year>). <article-title>Dirigent proteins and dirigent sites in lignifying tissues</article-title>. <source>Phytochemistry</source> <volume>57</volume>, <fpage>883</fpage>&#x02013;<lpage>897</lpage>.<pub-id pub-id-type="doi">10.1016/S0031-9422(01)00117-0</pub-id><pub-id pub-id-type="pmid">11423139</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>R. M.</given-names></name> <name><surname>Schweikert</surname> <given-names>G.</given-names></name> <name><surname>Toomajian</surname> <given-names>C.</given-names></name> <name><surname>Ossowski</surname> <given-names>S.</given-names></name> <name><surname>Zeller</surname> <given-names>G.</given-names></name> <name><surname>Shinn</surname> <given-names>P.</given-names></name> <name><surname>Warthmann</surname> <given-names>N.</given-names></name> <name><surname>Hu</surname> <given-names>T. T.</given-names></name> <name><surname>Fu</surname> <given-names>G.</given-names></name> <name><surname>Hinds</surname> <given-names>D. A.</given-names></name> <name><surname>Chen</surname> <given-names>H.</given-names></name> <name><surname>Frazer</surname> <given-names>K. A.</given-names></name> <name><surname>Huson</surname> <given-names>D. H.</given-names></name> <name><surname>Scholkopf</surname> <given-names>B.</given-names></name> <name><surname>Nordborg</surname> <given-names>M.</given-names></name> <name><surname>Ratsch</surname> <given-names>G.</given-names></name> <name><surname>Ecker</surname> <given-names>J. R.</given-names></name> <name><surname>Weigel</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana</article-title>. <source>Science</source> <volume>317</volume>, <fpage>338</fpage>&#x02013;<lpage>342</lpage>.<pub-id pub-id-type="doi">10.1126/science.1138632</pub-id><pub-id pub-id-type="pmid">17641193</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davin</surname> <given-names>L. B.</given-names></name> <name><surname>Lewis</surname> <given-names>N. G.</given-names></name></person-group> (<year>2005</year>). <article-title>Lignin primary structures and dirigent sites</article-title>. <source>Curr. Opin. Biotechnol.</source> <volume>16</volume>, <fpage>407</fpage>&#x02013;<lpage>415</lpage>.<pub-id pub-id-type="doi">10.1016/j.copbio.2005.06.011</pub-id><pub-id pub-id-type="pmid">16023847</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De Kraker</surname> <given-names>J. W.</given-names></name> <name><surname>Gershenzon</surname> <given-names>J.</given-names></name></person-group> (<year>2011</year>). <article-title>From amino acid to glucosinolate biosynthesis: protein sequence changes in the evolution of methylthioalkylmalate synthase in Arabidopsis</article-title>. <source>Plant Cell</source> <volume>23</volume>, <fpage>38</fpage>&#x02013;<lpage>53</lpage>.<pub-id pub-id-type="doi">10.1105/tpc.110.079269</pub-id><pub-id pub-id-type="pmid">21205930</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De Vos</surname> <given-names>R. C.</given-names></name> <name><surname>Moco</surname> <given-names>S.</given-names></name> <name><surname>Lommen</surname> <given-names>A.</given-names></name> <name><surname>Keurentjes</surname> <given-names>J. J.</given-names></name> <name><surname>Bino</surname> <given-names>R. J.</given-names></name> <name><surname>Hall</surname> <given-names>R. D.</given-names></name></person-group> (<year>2007</year>). <article-title>Untargeted large-scale plant metabolomics using liquid chromatography coupled to mass spectrometry</article-title>. <source>Nat. Protoc.</source> <volume>2</volume>, <fpage>778</fpage>&#x02013;<lpage>791</lpage>.<pub-id pub-id-type="doi">10.1038/nprot.2007.95</pub-id><pub-id pub-id-type="pmid">17446877</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Degtyarenko</surname> <given-names>K.</given-names></name> <name><surname>De Matos</surname> <given-names>P.</given-names></name> <name><surname>Ennis</surname> <given-names>M.</given-names></name> <name><surname>Hastings</surname> <given-names>J.</given-names></name> <name><surname>Zbinden</surname> <given-names>M.</given-names></name> <name><surname>Mcnaught</surname> <given-names>A.</given-names></name> <name><surname>Alcantara</surname> <given-names>R.</given-names></name> <name><surname>Darsow</surname> <given-names>M.</given-names></name> <name><surname>Guedj</surname> <given-names>M.</given-names></name> <name><surname>Ashburner</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>ChEBI: a database and ontology for chemical entities of biological interest</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume>, <fpage>D344</fpage>&#x02013;<lpage>350</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkm791</pub-id><pub-id pub-id-type="pmid">17932057</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fiehn</surname> <given-names>O.</given-names></name> <name><surname>Robertson</surname> <given-names>D.</given-names></name> <name><surname>Griffin</surname> <given-names>J.</given-names></name> <name><surname>Werf</surname> <given-names>M. V. D.</given-names></name> <name><surname>Nikolau</surname> <given-names>B.</given-names></name> <name><surname>Morrison</surname> <given-names>N.</given-names></name> <name><surname>Sumner</surname> <given-names>L. W.</given-names></name> <name><surname>Goodacre</surname> <given-names>R.</given-names></name> <name><surname>Hardy</surname> <given-names>N. W.</given-names></name> <name><surname>Taylor</surname> <given-names>C.</given-names></name> <name><surname>Fostel</surname> <given-names>J.</given-names></name> <name><surname>Kristal</surname> <given-names>B.</given-names></name> <name><surname>Kaddurah-Daouk</surname> <given-names>R.</given-names></name> <name><surname>Mendes</surname> <given-names>P.</given-names></name> <name><surname>Ommen</surname> <given-names>B. V.</given-names></name> <name><surname>Lindon</surname> <given-names>J. C.</given-names></name> <name><surname>Sansone</surname> <given-names>S.-A.</given-names></name></person-group> (<year>2007</year>). <article-title>The metabolomics standards initiative (MSI)</article-title>. <source>Metabolomics</source> <volume>3</volume>, <fpage>175</fpage>&#x02013;<lpage>178</lpage>.<pub-id pub-id-type="doi">10.1007/s11306-007-0070-6</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horai</surname> <given-names>H.</given-names></name> <name><surname>Arita</surname> <given-names>M.</given-names></name> <name><surname>Kanaya</surname> <given-names>S.</given-names></name> <name><surname>Nihei</surname> <given-names>Y.</given-names></name> <name><surname>Ikeda</surname> <given-names>T.</given-names></name> <name><surname>Suwa</surname> <given-names>K.</given-names></name> <name><surname>Ojima</surname> <given-names>Y.</given-names></name> <name><surname>Tanaka</surname> <given-names>K.</given-names></name> <name><surname>Tanaka</surname> <given-names>S.</given-names></name> <name><surname>Aoshima</surname> <given-names>K.</given-names></name> <name><surname>Oda</surname> <given-names>Y.</given-names></name> <name><surname>Kakazu</surname> <given-names>Y.</given-names></name> <name><surname>Kusano</surname> <given-names>M.</given-names></name> <name><surname>Tohge</surname> <given-names>T.</given-names></name> <name><surname>Matsuda</surname> <given-names>F.</given-names></name> <name><surname>Sawada</surname> <given-names>Y.</given-names></name> <name><surname>Hirai</surname> <given-names>M. Y.</given-names></name> <name><surname>Nakanishi</surname> <given-names>H.</given-names></name> <name><surname>Ikeda</surname> <given-names>K.</given-names></name> <name><surname>Akimoto</surname> <given-names>N.</given-names></name> <name><surname>Maoka</surname> <given-names>T.</given-names></name> <name><surname>Takahashi</surname> <given-names>H.</given-names></name> <name><surname>Ara</surname> <given-names>T.</given-names></name> <name><surname>Sakurai</surname> <given-names>N.</given-names></name> <name><surname>Suzuki</surname> <given-names>H.</given-names></name> <name><surname>Shibata</surname> <given-names>D.</given-names></name> <name><surname>Neumann</surname> <given-names>S.</given-names></name> <name><surname>Iida</surname> <given-names>T.</given-names></name> <name><surname>Tanaka</surname> <given-names>K.</given-names></name> <name><surname>Funatsu</surname> <given-names>K.</given-names></name> <name><surname>Matsuura</surname> <given-names>F.</given-names></name> <name><surname>Soga</surname> <given-names>T.</given-names></name> <name><surname>Taguchi</surname> <given-names>R.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name> <name><surname>Nishioka</surname> <given-names>T.</given-names></name></person-group> (<year>2010</year>). <article-title>MassBank: a public repository for sharing mass spectral data for life sciences</article-title>. <source>J. Mass Spectrom.</source> <volume>45</volume>, <fpage>703</fpage>&#x02013;<lpage>714</lpage>.<pub-id pub-id-type="doi">10.1002/jms.1777</pub-id><pub-id pub-id-type="pmid">20623627</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Iijima</surname> <given-names>Y.</given-names></name> <name><surname>Nakamura</surname> <given-names>Y.</given-names></name> <name><surname>Ogata</surname> <given-names>Y.</given-names></name> <name><surname>Tanaka</surname> <given-names>K.</given-names></name> <name><surname>Sakurai</surname> <given-names>N.</given-names></name> <name><surname>Suda</surname> <given-names>K.</given-names></name> <name><surname>Suzuki</surname> <given-names>T.</given-names></name> <name><surname>Suzuki</surname> <given-names>H.</given-names></name> <name><surname>Okazaki</surname> <given-names>K.</given-names></name> <name><surname>Kitayama</surname> <given-names>M.</given-names></name> <name><surname>Kanaya</surname> <given-names>S.</given-names></name> <name><surname>Aoki</surname> <given-names>K.</given-names></name> <name><surname>Shibata</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Metabolite annotations based on the integration of mass spectral information</article-title>. <source>Plant J.</source> <volume>54</volume>, <fpage>949</fpage>&#x02013;<lpage>962</lpage>.<pub-id pub-id-type="doi">10.1111/j.1365-313X.2008.03434.x</pub-id><pub-id pub-id-type="pmid">18266924</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Janowitz</surname> <given-names>T.</given-names></name> <name><surname>Trompetter</surname> <given-names>I.</given-names></name> <name><surname>Piotrowski</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Evolution of nitrilases in glucosinolate-containing plants</article-title>. <source>Phytochemistry</source> <volume>70</volume>, <fpage>1680</fpage>&#x02013;<lpage>1686</lpage>.<pub-id pub-id-type="doi">10.1016/j.phytochem.2009.07.028</pub-id><pub-id pub-id-type="pmid">19698961</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Araki</surname> <given-names>M.</given-names></name> <name><surname>Goto</surname> <given-names>S.</given-names></name> <name><surname>Hattori</surname> <given-names>M.</given-names></name> <name><surname>Hirakawa</surname> <given-names>M.</given-names></name> <name><surname>Itoh</surname> <given-names>M.</given-names></name> <name><surname>Katayama</surname> <given-names>T.</given-names></name> <name><surname>Kawashima</surname> <given-names>S.</given-names></name> <name><surname>Okuda</surname> <given-names>S.</given-names></name> <name><surname>Tokimatsu</surname> <given-names>T.</given-names></name> <name><surname>Yamanishi</surname> <given-names>Y.</given-names></name></person-group> (<year>2008</year>). <article-title>KEGG for linking genomes to life and the environment</article-title>. <source>Nucleic Acids Res.</source> <volume>36</volume>, <fpage>D480</fpage>&#x02013;<lpage>D484</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkm882</pub-id><pub-id pub-id-type="pmid">18077471</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keurentjes</surname> <given-names>J. J.</given-names></name> <name><surname>Fu</surname> <given-names>J.</given-names></name> <name><surname>De Vos</surname> <given-names>C. H.</given-names></name> <name><surname>Lommen</surname> <given-names>A.</given-names></name> <name><surname>Hall</surname> <given-names>R. D.</given-names></name> <name><surname>Bino</surname> <given-names>R. J.</given-names></name> <name><surname>Van Der Plas</surname> <given-names>L. H.</given-names></name> <name><surname>Jansen</surname> <given-names>R. C.</given-names></name> <name><surname>Vreugdenhil</surname> <given-names>D.</given-names></name> <name><surname>Koornneef</surname> <given-names>M.</given-names></name></person-group> (<year>2006</year>). <article-title>The genetics of plant metabolism</article-title>. <source>Nat. Genet.</source> <volume>38</volume>, <fpage>842</fpage>&#x02013;<lpage>849</lpage>.<pub-id pub-id-type="doi">10.1038/ng1815</pub-id><pub-id pub-id-type="pmid">16751770</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kind</surname> <given-names>T.</given-names></name> <name><surname>Fiehn</surname> <given-names>O.</given-names></name></person-group> (<year>2010</year>). <article-title>Advances in structure elucidation of small molecules using mass spectrometry</article-title>. <source>Bioanal. Rev.</source> <volume>2</volume>, <fpage>23</fpage>&#x02013;<lpage>60</lpage>.<pub-id pub-id-type="doi">10.1007/s12566-010-0015-9</pub-id><pub-id pub-id-type="pmid">21289855</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kliebenstein</surname> <given-names>D. J.</given-names></name> <name><surname>Lambrix</surname> <given-names>V. M.</given-names></name> <name><surname>Reichelt</surname> <given-names>M.</given-names></name> <name><surname>Gershenzon</surname> <given-names>J.</given-names></name> <name><surname>Mitchell-Olds</surname> <given-names>T.</given-names></name></person-group> (<year>2001</year>). <article-title>Gene duplication in the diversification of secondary metabolism: tandem 2-oxoglutarate-dependent dioxygenases control glucosinolate biosynthesis in Arabidopsis</article-title>. <source>Plant Cell</source> <volume>13</volume>, <fpage>681</fpage>&#x02013;<lpage>693</lpage>.<pub-id pub-id-type="doi">10.1105/tpc.13.3.681</pub-id><pub-id pub-id-type="pmid">11251105</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Hansen</surname> <given-names>B. G.</given-names></name> <name><surname>Ober</surname> <given-names>J. A.</given-names></name> <name><surname>Kliebenstein</surname> <given-names>D. J.</given-names></name> <name><surname>Halkier</surname> <given-names>B. A.</given-names></name></person-group> (<year>2008</year>). <article-title>Subclade of flavin-monooxygenases involved in aliphatic glucosinolate biosynthesis</article-title>. <source>Plant Physiol.</source> <volume>148</volume>, <fpage>1721</fpage>&#x02013;<lpage>1733</lpage>.<pub-id pub-id-type="doi">10.1104/pp.108.123471</pub-id><pub-id pub-id-type="pmid">18799661</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lommen</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>MetAlign: interface-driven, versatile metabolomics tool for hyphenated full-scan mass spectrometry data preprocessing</article-title>. <source>Anal. Chem.</source> <volume>81</volume>, <fpage>3079</fpage>&#x02013;<lpage>3086</lpage>.<pub-id pub-id-type="doi">10.1021/ac900036d</pub-id><pub-id pub-id-type="pmid">19301908</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manzaneda</surname> <given-names>A. J.</given-names></name> <name><surname>Prasad</surname> <given-names>K. V.</given-names></name> <name><surname>Mitchell-Olds</surname> <given-names>T.</given-names></name></person-group> (<year>2010</year>). <article-title>Variation and fitness costs for tolerance to different types of herbivore damage in Boechera stricta genotypes with contrasting glucosinolate structures</article-title>. <source>New Phytol.</source> <volume>188</volume>, <fpage>464</fpage>&#x02013;<lpage>477</lpage>.<pub-id pub-id-type="doi">10.1111/j.1469-8137.2010.03385.x</pub-id><pub-id pub-id-type="pmid">20663059</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsuda</surname> <given-names>F.</given-names></name> <name><surname>Hirai</surname> <given-names>M. Y.</given-names></name> <name><surname>Sasaki</surname> <given-names>E.</given-names></name> <name><surname>Akiyama</surname> <given-names>K.</given-names></name> <name><surname>Yonekura-Sakakibara</surname> <given-names>K.</given-names></name> <name><surname>Provart</surname> <given-names>N. J.</given-names></name> <name><surname>Sakurai</surname> <given-names>T.</given-names></name> <name><surname>Shimada</surname> <given-names>Y.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name></person-group> (<year>2010a</year>). <article-title>AtMetExpress development: a phytochemical atlas of Arabidopsis development</article-title>. <source>Plant Physiol.</source> <volume>152</volume>, <fpage>566</fpage>&#x02013;<lpage>578</lpage>.<pub-id pub-id-type="doi">10.1104/pp.109.148031</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsuda</surname> <given-names>F.</given-names></name> <name><surname>Ishihara</surname> <given-names>A.</given-names></name> <name><surname>Takanashi</surname> <given-names>K.</given-names></name> <name><surname>Morino</surname> <given-names>K.</given-names></name> <name><surname>Miyazawa</surname> <given-names>H.</given-names></name> <name><surname>Wakasa</surname> <given-names>K.</given-names></name> <name><surname>Miyagawa</surname> <given-names>H.</given-names></name></person-group> (<year>2010b</year>). <article-title>Metabolic profiling analysis of genetically modified rice seedlings that overproduce tryptophan reveals the occurrence of its inter-tissue translocation</article-title>. <source>Plant Biotechnol.</source> <volume>27</volume>, <fpage>17</fpage>&#x02013;<lpage>27</lpage>.<pub-id pub-id-type="doi">10.5511/plantbiotechnology.27.17</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsuda</surname> <given-names>F.</given-names></name> <name><surname>Redestig</surname> <given-names>H.</given-names></name> <name><surname>Sawada</surname> <given-names>Y.</given-names></name> <name><surname>Shinbo</surname> <given-names>Y.</given-names></name> <name><surname>Hirai</surname> <given-names>M. Y.</given-names></name> <name><surname>Kanaya</surname> <given-names>S.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name></person-group> (<year>2009a</year>). <article-title>Visualization of metabolite identifier information</article-title>. <source>Plant Biotechnol.</source> <volume>26</volume>, <fpage>479</fpage>&#x02013;<lpage>483</lpage>.<pub-id pub-id-type="doi">10.5511/plantbiotechnology.26.479</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsuda</surname> <given-names>F.</given-names></name> <name><surname>Shinbo</surname> <given-names>Y.</given-names></name> <name><surname>Oikawa</surname> <given-names>A.</given-names></name> <name><surname>Hira</surname> <given-names>M. Y.</given-names></name> <name><surname>Fiehn</surname> <given-names>O.</given-names></name> <name><surname>Kanaya</surname> <given-names>S.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name></person-group> (<year>2009b</year>). <article-title>Assessment of metabolome annotation quality: a method for evaluating the false discovery rate of elemental composition searches</article-title>. <source>PLoS ONE</source> <volume>4</volume>, <fpage>e7490</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pone.0007490</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsuda</surname> <given-names>F.</given-names></name> <name><surname>Yonekura-Sakakibara</surname> <given-names>K.</given-names></name> <name><surname>Niida</surname> <given-names>R.</given-names></name> <name><surname>Kuromori</surname> <given-names>T.</given-names></name> <name><surname>Shinozaki</surname> <given-names>K.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name></person-group> (<year>2009c</year>). <article-title>MS/MS spectral tag (MS2T)-based annotation of non-targeted profile of plant secondary metabolites</article-title>. <source>Plant J.</source> <volume>57</volume>, <fpage>555</fpage>&#x02013;<lpage>577</lpage>.<pub-id pub-id-type="doi">10.1111/j.1365-313X.2008.03663.x</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mihaleva</surname> <given-names>V. V.</given-names></name> <name><surname>Vorst</surname> <given-names>O.</given-names></name> <name><surname>Maliepaard</surname> <given-names>C.</given-names></name> <name><surname>Verhoeven</surname> <given-names>H. A.</given-names></name> <name><surname>Vos</surname> <given-names>R. C. H. D.</given-names></name> <name><surname>Hall</surname> <given-names>R. D.</given-names></name> <name><surname>Ham</surname> <given-names>R. C. H. J. V.</given-names></name></person-group> (<year>2008</year>). <article-title>Accurate mass error correction in liquid chromatography time-of-flight mass spectrometry based metabolomics</article-title>. <source>Metabolomics</source> <volume>4</volume>, <fpage>171</fpage>&#x02013;<lpage>182</lpage>.<pub-id pub-id-type="doi">10.1007/s11306-008-0108-4</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moco</surname> <given-names>S.</given-names></name> <name><surname>Bino</surname> <given-names>R. J.</given-names></name> <name><surname>Vorst</surname> <given-names>O.</given-names></name> <name><surname>Verhoeven</surname> <given-names>H. A.</given-names></name> <name><surname>De Groot</surname> <given-names>J.</given-names></name> <name><surname>Van Beek</surname> <given-names>T. A.</given-names></name> <name><surname>Vervoort</surname> <given-names>J.</given-names></name> <name><surname>De Vos</surname> <given-names>C. H.</given-names></name></person-group> (<year>2006</year>). <article-title>A liquid chromatography-mass spectrometry-based metabolome database for tomato</article-title>. <source>Plant Physiol.</source> <volume>141</volume>, <fpage>1205</fpage>&#x02013;<lpage>1218</lpage>.<pub-id pub-id-type="doi">10.1104/pp.106.078428</pub-id><pub-id pub-id-type="pmid">16896233</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moco</surname> <given-names>S.</given-names></name> <name><surname>Bino</surname> <given-names>R. J.</given-names></name> <name><surname>Vos</surname> <given-names>R. C. H. D.</given-names></name> <name><surname>Vervoort</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>Metabolomics technologies and metabolite identification</article-title>. <source>Trends Analyt. Chem.</source> <volume>26</volume>, <fpage>855</fpage>&#x02013;<lpage>866</lpage>.<pub-id pub-id-type="doi">10.1016/j.trac.2007.08.003</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mylonas</surname> <given-names>R.</given-names></name> <name><surname>Mauron</surname> <given-names>Y.</given-names></name> <name><surname>Masselot</surname> <given-names>A.</given-names></name> <name><surname>Binz</surname> <given-names>P. A.</given-names></name> <name><surname>Budin</surname> <given-names>N.</given-names></name> <name><surname>Fathi</surname> <given-names>M.</given-names></name> <name><surname>Viette</surname> <given-names>V.</given-names></name> <name><surname>Hochstrasser</surname> <given-names>D. F.</given-names></name> <name><surname>Lisacek</surname> <given-names>F.</given-names></name></person-group> (<year>2009</year>). <article-title>X-Rank: a robust algorithm for small molecule identification using tandem mass spectrometry</article-title>. <source>Anal. Chem.</source> <volume>81</volume>, <fpage>7604</fpage>&#x02013;<lpage>7610</lpage>.<pub-id pub-id-type="doi">10.1021/ac900954d</pub-id><pub-id pub-id-type="pmid">19702277</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakabayashi</surname> <given-names>R.</given-names></name> <name><surname>Kusano</surname> <given-names>M.</given-names></name> <name><surname>Kobayashi</surname> <given-names>M.</given-names></name> <name><surname>Tohge</surname> <given-names>T.</given-names></name> <name><surname>Yonekura-Sakakibara</surname> <given-names>K.</given-names></name> <name><surname>Kogure</surname> <given-names>N.</given-names></name> <name><surname>Yamazaki</surname> <given-names>M.</given-names></name> <name><surname>Kitajima</surname> <given-names>M.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name> <name><surname>Takayama</surname> <given-names>H.</given-names></name></person-group> (<year>2009</year>). <article-title>Metabolomics-oriented isolation and structure elucidation of 37 compounds including two anthocyanins from Arabidopsis thaliana</article-title>. <source>Phytochemistry</source> <volume>70</volume>, <fpage>1017</fpage>&#x02013;<lpage>1029</lpage>.<pub-id pub-id-type="doi">10.1016/j.phytochem.2009.03.021</pub-id><pub-id pub-id-type="pmid">19497599</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakatsubo</surname> <given-names>T.</given-names></name> <name><surname>Mizutani</surname> <given-names>M.</given-names></name> <name><surname>Suzuki</surname> <given-names>S.</given-names></name> <name><surname>Hattori</surname> <given-names>T.</given-names></name> <name><surname>Umezawa</surname> <given-names>T.</given-names></name></person-group> (<year>2008</year>). <article-title>Characterization of Arabidopsis thaliana pinoresinol reductase, a new type of enzyme involved in lignan biosynthesis</article-title>. <source>J. Biol. Chem.</source> <volume>283</volume>, <fpage>15550</fpage>&#x02013;<lpage>15557</lpage>.<pub-id pub-id-type="doi">10.1074/jbc.M801131200</pub-id><pub-id pub-id-type="pmid">18347017</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neumann</surname> <given-names>S.</given-names></name> <name><surname>Bocker</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Computational mass spectrometry for metabolomics: identification of metabolites and small molecules</article-title>. <source>Anal. Bioanal. Chem.</source> <volume>398</volume>, <fpage>2779</fpage>&#x02013;<lpage>2788</lpage>.<pub-id pub-id-type="doi">10.1007/s00216-010-4142-5</pub-id><pub-id pub-id-type="pmid">20936272</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ossowski</surname> <given-names>S.</given-names></name> <name><surname>Schneeberger</surname> <given-names>K.</given-names></name> <name><surname>Clark</surname> <given-names>R. M.</given-names></name> <name><surname>Lanz</surname> <given-names>C.</given-names></name> <name><surname>Warthmann</surname> <given-names>N.</given-names></name> <name><surname>Weigel</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Sequencing of natural strains of Arabidopsis thaliana with short reads</article-title>. <source>Genome Res.</source> <volume>18</volume>, <fpage>2024</fpage>&#x02013;<lpage>2033</lpage>.<pub-id pub-id-type="doi">10.1101/gr.080200.108</pub-id><pub-id pub-id-type="pmid">18818371</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Plantegenet</surname> <given-names>S.</given-names></name> <name><surname>Weber</surname> <given-names>J.</given-names></name> <name><surname>Goldstein</surname> <given-names>D. R.</given-names></name> <name><surname>Zeller</surname> <given-names>G.</given-names></name> <name><surname>Nussbaumer</surname> <given-names>C.</given-names></name> <name><surname>Thomas</surname> <given-names>J.</given-names></name> <name><surname>Weigel</surname> <given-names>D.</given-names></name> <name><surname>Harshman</surname> <given-names>K.</given-names></name> <name><surname>Hardtke</surname> <given-names>C. S.</given-names></name></person-group> (<year>2009</year>). <article-title>Comprehensive analysis of Arabidopsis expression level polymorphisms with simple inheritance</article-title>. <source>Mol. Syst. Biol.</source> <volume>5</volume>, <fpage>242</fpage>.<pub-id pub-id-type="doi">10.1038/msb.2008.79</pub-id><pub-id pub-id-type="pmid">19225455</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poole</surname> <given-names>R. L.</given-names></name></person-group> (<year>2007</year>). <article-title>The TAIR database</article-title>. <source>Methods Mol. Biol.</source> <volume>406</volume>, <fpage>179</fpage>&#x02013;<lpage>212</lpage>.<pub-id pub-id-type="doi">10.1007/978-1-59745-535-0_8</pub-id><pub-id pub-id-type="pmid">18287693</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saeed</surname> <given-names>A. I.</given-names></name> <name><surname>Bhagabati</surname> <given-names>N. K.</given-names></name> <name><surname>Braisted</surname> <given-names>J. C.</given-names></name> <name><surname>Liang</surname> <given-names>W.</given-names></name> <name><surname>Sharov</surname> <given-names>V.</given-names></name> <name><surname>Howe</surname> <given-names>E. A.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Thiagarajan</surname> <given-names>M.</given-names></name> <name><surname>White</surname> <given-names>J. A.</given-names></name> <name><surname>Quackenbush</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <article-title>TM4 microarray software suite</article-title>. <source>Meth. Enzymol.</source> <volume>411</volume>, <fpage>134</fpage>&#x02013;<lpage>193</lpage>.<pub-id pub-id-type="doi">10.1016/S0076-6879(06)11009-5</pub-id><pub-id pub-id-type="pmid">16939790</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saeed</surname> <given-names>A. I.</given-names></name> <name><surname>Sharov</surname> <given-names>V.</given-names></name> <name><surname>White</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Liang</surname> <given-names>W.</given-names></name> <name><surname>Bhagabati</surname> <given-names>N.</given-names></name> <name><surname>Braisted</surname> <given-names>J.</given-names></name> <name><surname>Klapa</surname> <given-names>M.</given-names></name> <name><surname>Currier</surname> <given-names>T.</given-names></name> <name><surname>Thiagarajan</surname> <given-names>M.</given-names></name> <name><surname>Sturn</surname> <given-names>A.</given-names></name> <name><surname>Snuffin</surname> <given-names>M.</given-names></name> <name><surname>Rezantsev</surname> <given-names>A.</given-names></name> <name><surname>Popov</surname> <given-names>D.</given-names></name> <name><surname>Ryltsov</surname> <given-names>A.</given-names></name> <name><surname>Kostukovich</surname> <given-names>E.</given-names></name> <name><surname>Borisovsky</surname> <given-names>I.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name> <name><surname>Vinsavich</surname> <given-names>A.</given-names></name> <name><surname>Trush</surname> <given-names>V.</given-names></name> <name><surname>Quackenbush</surname> <given-names>J.</given-names></name></person-group> (<year>2003</year>). <article-title>TM4: a free, open-source system for microarray data management and analysis</article-title>. <source>Biotechniques</source> <volume>34</volume>, <fpage>374</fpage>&#x02013;<lpage>378</lpage>.<pub-id pub-id-type="pmid">12613259</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saito</surname> <given-names>K.</given-names></name> <name><surname>Matsuda</surname> <given-names>F.</given-names></name></person-group> (<year>2010</year>). <article-title>Metabolomics for functional genomics, systems biology, and biotechnology</article-title>. <source>Annu. Rev. Plant Biol.</source> <volume>61</volume>, <fpage>463</fpage>&#x02013;<lpage>489</lpage>.<pub-id pub-id-type="doi">10.1146/annurev.arplant.043008.092035</pub-id><pub-id pub-id-type="pmid">19152489</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sawada</surname> <given-names>Y.</given-names></name> <name><surname>Kuwahara</surname> <given-names>A.</given-names></name> <name><surname>Nagano</surname> <given-names>M.</given-names></name> <name><surname>Narisawa</surname> <given-names>T.</given-names></name> <name><surname>Sakata</surname> <given-names>A.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name> <name><surname>Hirai</surname> <given-names>M. Y.</given-names></name></person-group> (<year>2009</year>). <article-title>Omics-based approaches to methionine side chain elongation in Arabidopsis: characterization of the genes encoding methylthioalkylmalate isomerase and methylthioalkylmalate dehydrogenase</article-title>. <source>Plant Cell Physiol.</source> <volume>50</volume>, <fpage>1181</fpage>&#x02013;<lpage>1190</lpage>.<pub-id pub-id-type="doi">10.1093/pcp/pcp058</pub-id><pub-id pub-id-type="pmid">19493961</pub-id></citation></ref>
<ref id="B41"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Shinbo</surname> <given-names>Y.</given-names></name> <name><surname>Nakamura</surname> <given-names>Y.</given-names></name> <name><surname>Altaf-Ul-Amin</surname> <given-names>M.</given-names></name> <name><surname>Asahi</surname> <given-names>H.</given-names></name> <name><surname>Kurokawa</surname> <given-names>K.</given-names></name> <name><surname>Arita</surname> <given-names>M.</given-names></name> <name><surname>Saito</surname> <given-names>K.</given-names></name> <name><surname>Ohta</surname> <given-names>D.</given-names></name> <name><surname>Shibata</surname> <given-names>D.</given-names></name> <name><surname>Kanaya</surname> <given-names>S.</given-names></name></person-group> (<year>2006</year>). <article-title>&#x0201C;KNApSAcK: A comprehensive species-metabolite relationship database,&#x0201D;</article-title> in <source>Biotechnology in Agriculture and Forestry 57 Plant Metabolomics</source>, eds <person-group person-group-type="editor"><name><surname>Saito</surname> <given-names>K.</given-names></name> <name><surname>Dixon</surname> <given-names>R. A.</given-names></name> <name><surname>Willmitzer</surname> <given-names>L.</given-names></name></person-group> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>165</fpage>&#x02013;<lpage>181</lpage>.</citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stein</surname> <given-names>S. E.</given-names></name> <name><surname>Scott</surname> <given-names>D. R.</given-names></name></person-group> (<year>1994</year>). <article-title>Optimization and testing of mass-spectral library search algorithms for compound identification</article-title>. <source>J. Am. Soc. Mass Spectrom.</source> <volume>5</volume>, <fpage>859</fpage>&#x02013;<lpage>866</lpage>.<pub-id pub-id-type="doi">10.1016/1044-0305(94)87009-8</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sumner</surname> <given-names>L. W.</given-names></name> <name><surname>Amberg</surname> <given-names>A.</given-names></name> <name><surname>Barrett</surname> <given-names>D.</given-names></name> <name><surname>Beale</surname> <given-names>M. H.</given-names></name> <name><surname>Beger</surname> <given-names>R.</given-names></name> <name><surname>Daykin</surname> <given-names>C. A.</given-names></name> <name><surname>Fan</surname> <given-names>T. W.-M.</given-names></name> <name><surname>Fiehn</surname> <given-names>O.</given-names></name> <name><surname>Goodacre</surname> <given-names>R.</given-names></name> <name><surname>Griffin</surname> <given-names>J. L.</given-names></name> <name><surname>Hankemeier</surname> <given-names>T.</given-names></name> <name><surname>Hardy</surname> <given-names>N.</given-names></name> <name><surname>Harnly</surname> <given-names>J.</given-names></name> <name><surname>Higashi</surname> <given-names>R.</given-names></name> <name><surname>Kopka</surname> <given-names>J.</given-names></name> <name><surname>Lane</surname> <given-names>A. N.</given-names></name> <name><surname>Lindon</surname> <given-names>J. C.</given-names></name> <name><surname>Marriott</surname> <given-names>P.</given-names></name> <name><surname>Nicholls</surname> <given-names>A. W.</given-names></name> <name><surname>Reily</surname> <given-names>M. D.</given-names></name> <name><surname>Thaden</surname> <given-names>J. J.</given-names></name> <name><surname>Viant</surname> <given-names>M. R.</given-names></name></person-group> (<year>2007</year>). <article-title>Proposed minimum reporting standards for chemical analysis</article-title>. <source>Metabolomics</source> <volume>3</volume>, <fpage>211</fpage>&#x02013;<lpage>221</lpage>.<pub-id pub-id-type="doi">10.1007/s11306-007-0082-2</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Takahashi</surname> <given-names>H.</given-names></name> <name><surname>Kai</surname> <given-names>K.</given-names></name> <name><surname>Shinbo</surname> <given-names>Y.</given-names></name> <name><surname>Tanaka</surname> <given-names>K.</given-names></name> <name><surname>Ohta</surname> <given-names>D.</given-names></name> <name><surname>Oshima</surname> <given-names>T.</given-names></name> <name><surname>Altaf-Ul-Amin</surname> <given-names>M.</given-names></name> <name><surname>Kurokawa</surname> <given-names>K.</given-names></name> <name><surname>Ogasawara</surname> <given-names>N.</given-names></name> <name><surname>Kanaya</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>Metabolomics approach for determining growth-specific metabolites based on Fourier transform ion cyclotron resonance mass spectrometry</article-title>. <source>Anal. Bioanal. Chem.</source> <volume>391</volume>, <fpage>2769</fpage>&#x02013;<lpage>2782</lpage>.<pub-id pub-id-type="doi">10.1007/s00216-008-2195-5</pub-id><pub-id pub-id-type="pmid">18560811</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weigel</surname> <given-names>D.</given-names></name> <name><surname>Mott</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>The 1001 genomes project for Arabidopsis thaliana</article-title>. <source>Genome Biol.</source> <volume>10</volume>, <fpage>107</fpage>.<pub-id pub-id-type="doi">10.1186/gb-2009-10-10-r107</pub-id><pub-id pub-id-type="pmid">19519932</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wentzell</surname> <given-names>A. M.</given-names></name> <name><surname>Rowe</surname> <given-names>H. C.</given-names></name> <name><surname>Hansen</surname> <given-names>B. G.</given-names></name> <name><surname>Ticconi</surname> <given-names>C.</given-names></name> <name><surname>Halkier</surname> <given-names>B. A.</given-names></name> <name><surname>Kliebenstein</surname> <given-names>D. J.</given-names></name></person-group> (<year>2007</year>). <article-title>Linking metabolic QTLs with network and cis-eQTLs controlling biosynthetic pathways</article-title>. <source>PLoS Genet.</source> <volume>3</volume>, <fpage>e162</fpage>.<pub-id pub-id-type="doi">10.1371/journal.pgen.0030162</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wishart</surname> <given-names>D. S.</given-names></name> <name><surname>Tzur</surname> <given-names>D.</given-names></name> <name><surname>Knox</surname> <given-names>C.</given-names></name> <name><surname>Eisner</surname> <given-names>R.</given-names></name> <name><surname>Guo</surname> <given-names>A. C.</given-names></name> <name><surname>Young</surname> <given-names>N.</given-names></name> <name><surname>Cheng</surname> <given-names>D.</given-names></name> <name><surname>Jewell</surname> <given-names>K.</given-names></name> <name><surname>Arndt</surname> <given-names>D.</given-names></name> <name><surname>Sawhney</surname> <given-names>S.</given-names></name> <name><surname>Fung</surname> <given-names>C.</given-names></name> <name><surname>Nikolai</surname> <given-names>L.</given-names></name> <name><surname>Lewis</surname> <given-names>M.</given-names></name> <name><surname>Coutouly</surname> <given-names>M. A.</given-names></name> <name><surname>Forsythe</surname> <given-names>I.</given-names></name> <name><surname>Tang</surname> <given-names>P.</given-names></name> <name><surname>Shrivastava</surname> <given-names>S.</given-names></name> <name><surname>Jeroncic</surname> <given-names>K.</given-names></name> <name><surname>Stothard</surname> <given-names>P.</given-names></name> <name><surname>Amegbey</surname> <given-names>G.</given-names></name> <name><surname>Block</surname> <given-names>D.</given-names></name> <name><surname>Hau</surname> <given-names>D. D.</given-names></name> <name><surname>Wagner</surname> <given-names>J.</given-names></name> <name><surname>Miniaci</surname> <given-names>J.</given-names></name> <name><surname>Clements</surname> <given-names>M.</given-names></name> <name><surname>Gebremedhin</surname> <given-names>M.</given-names></name> <name><surname>Guo</surname> <given-names>N.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Duggan</surname> <given-names>G. E.</given-names></name> <name><surname>Macinnis</surname> <given-names>G. D.</given-names></name> <name><surname>Weljie</surname> <given-names>A. M.</given-names></name> <name><surname>Dowlatabadi</surname> <given-names>R.</given-names></name> <name><surname>Bamforth</surname> <given-names>F.</given-names></name> <name><surname>Clive</surname> <given-names>D.</given-names></name> <name><surname>Greiner</surname> <given-names>R.</given-names></name> <name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Marrie</surname> <given-names>T.</given-names></name> <name><surname>Sykes</surname> <given-names>B. D.</given-names></name> <name><surname>Vogel</surname> <given-names>H. J.</given-names></name> <name><surname>Querengesser</surname> <given-names>L.</given-names></name></person-group> (<year>2007</year>). <article-title>HMDB: the human metabolome database</article-title>. <source>Nucleic Acids Res.</source> <volume>35</volume>, <fpage>D521</fpage>&#x02013;<lpage>D526</lpage>.<pub-id pub-id-type="doi">10.1093/nar/gkl923</pub-id><pub-id pub-id-type="pmid">17202168</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeller</surname> <given-names>G.</given-names></name> <name><surname>Clark</surname> <given-names>R. M.</given-names></name> <name><surname>Schneeberger</surname> <given-names>K.</given-names></name> <name><surname>Bohlen</surname> <given-names>A.</given-names></name> <name><surname>Weigel</surname> <given-names>D.</given-names></name> <name><surname>Ratsch</surname> <given-names>G.</given-names></name></person-group> (<year>2008</year>). <article-title>Detecting polymorphic regions in Arabidopsis thaliana with resequencing microarrays</article-title>. <source>Genome Res.</source> <volume>18</volume>, <fpage>918</fpage>&#x02013;<lpage>929</lpage>.<pub-id pub-id-type="doi">10.1101/gr.070169.107</pub-id><pub-id pub-id-type="pmid">18323538</pub-id></citation></ref>
</ref-list>
<fn-group><fn id="fn1"><p><sup>1</sup><uri xlink:href="http://prime.psc.riken.jp/">http://prime.psc.riken.jp/</uri></p></fn>
<fn id="fn2"><p><sup>2</sup><uri xlink:href="http://spectra.psc.riken.jp/menta.cgi/analyses/explains?chapter&#x0003D;1">http://spectra.psc.riken.jp/menta.cgi/analyses/explains?chapter&#x0003D;1</uri></p></fn>
</fn-group>
</back>
</article>
