<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Archiving and Interchange DTD v2.3 20070202//EN" "archivearticle.dtd">
<article article-type="methods-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">857839</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2022.857839</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Methods</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>CPIELA: Computational Prediction of Plant Protein&#x2013;Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information</article-title>
<alt-title alt-title-type="left-running-head">Li et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Prediction of Plant PPIs</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Li</surname>
<given-names>Li-Ping</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1640730/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Zhang</surname>
<given-names>Bo</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Cheng</surname>
<given-names>Li</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>College of Grassland and Environment Sciences</institution>, <institution>Xinjiang Agricultural University</institution>, <addr-line>Urumqi</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Xinjiang Key Laboratory of Grassland Resources and Ecology</institution>, <addr-line>Urumqi</addr-line>, <country>China</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>Xinjiang Technical Institute of Physics and Chemistry</institution>, <institution>Chinese Academy of Science</institution>, <addr-line>Urumqi</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/674022/overview">Pu-Feng Du</ext-link>, Tianjin University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/862620/overview">Tiantian He</ext-link>, Agency for Science, Technology and Research (A&#x2217;STAR), Singapore</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/749958/overview">Bin Liu</ext-link>, Beijing Institute of Technology, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Li-Ping Li, <email>cs2bioinformatics@gmail.com</email>; Bo Zhang, <email>xjauzb@sina.com</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>11</day>
<month>03</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>857839</elocation-id>
<history>
<date date-type="received">
<day>19</day>
<month>01</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>10</day>
<month>02</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Li, Zhang and Cheng.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Li, Zhang and Cheng</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Identification and characterization of plant protein&#x2013;protein interactions (PPIs) are critical in elucidating the functions of proteins and molecular mechanisms in a plant cell. Although experimentally validated plant PPIs data have become increasingly available in diverse plant species, the high-throughput techniques are usually expensive and labor-intensive. With the incredibly valuable plant PPIs data accumulating in public databases, it is progressively important to propose computational approaches to facilitate the identification of possible PPIs. In this article, we propose an effective framework for predicting plant PPIs by combining the position-specific scoring matrix (PSSM), local optimal-oriented pattern (LOOP), and ensemble rotation forest (ROF) model. Specifically, the plant protein sequence is firstly transformed into the PSSM, in which the protein evolutionary information is perfectly preserved. Then, the local textural descriptor LOOP is employed to extract texture variation features from PSSM. Finally, the ROF classifier is adopted to infer the potential plant PPIs. The performance of CPIELA is evaluated via cross-validation on three plant PPIs datasets: <italic>Arabidopsis thaliana</italic>, <italic>Zea mays</italic>, and <italic>Oryza sativa</italic>. The experimental results demonstrate that the CPIELA method achieved the high average prediction accuracies of 98.63%, 98.09%, and 94.02%, respectively. To further verify the high performance of CPIELA, we also compared it with the other state-of-the-art methods on three gold standard datasets. The experimental results illustrate that CPIELA is efficient and reliable for predicting plant PPIs. It is anticipated that the CPIELA approach could become a useful tool for facilitating the identification of possible plant&#x20;PPIs.</p>
</abstract>
<kwd-group>
<kwd>plant</kwd>
<kwd>protein&#x2013;protein interactions</kwd>
<kwd>machine learning</kwd>
<kwd>sequence</kwd>
<kwd>evolutionary information</kwd>
</kwd-group>
<contract-sponsor id="cn001">National Aerospace Science Foundation of China<named-content content-type="fundref-id">10.13039/501100010012</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Plant protein&#x2013;protein interactions (PPIs) participate in almost all aspects of cellular processes such as homeostasis control, signal transduction, organ formation, and plant defense (<xref ref-type="bibr" rid="B33">Morsy et&#x20;al., 2008</xref>; <xref ref-type="bibr" rid="B52">Yuan et&#x20;al., 2008</xref>; <xref ref-type="bibr" rid="B15">Fukao, 2012</xref>; <xref ref-type="bibr" rid="B43">Sheth and Thaker, 2014</xref>; <xref ref-type="bibr" rid="B10">Cheng et&#x20;al., 2021</xref>). Thus, understanding plant PPIs could provide important insights into the pathological processes and the regulation of plant developmental processes. Consequently, constructing a PPI network at the system level is one of the key tasks to elucidate molecular mechanisms. In the past decades, several innovative high-throughput techniques, such as the yeast two-hybrid (Y2H) (<xref ref-type="bibr" rid="B8">Causier and Davies, 2002</xref>), bimolecular fluorescence complementation (BiFC) (<xref ref-type="bibr" rid="B5">Bracha-Drori et&#x20;al., 2010</xref>), affinity purification coupled to mass spectrometry (AP-MS) (<xref ref-type="bibr" rid="B39">Puig et&#x20;al., 2001</xref>), and protein microarrays (<xref ref-type="bibr" rid="B24">Hultschig et&#x20;al., 2006</xref>), have been designed to detect plant PPIs. However, the aforementioned high throughput biological experiments have some unavoidable technical limitations (<xref ref-type="bibr" rid="B53">Yuan-Ke et&#x20;al., 2019</xref>). For example, the number of PPIs obtained by high-throughput biological experiments is still much smaller than the number of expected PPIs (<xref ref-type="bibr" rid="B1">Aloy and Russell, 2004</xref>). It is believed that, for the most studied organisms (yeast), the number of PPIs is still underestimated (<xref ref-type="bibr" rid="B42">Sambourg and Thierry-Mieg, 2010</xref>). Furthermore, the techniques employed to detect plant PPIs are expensive and time-consuming, limiting the wide application of these approaches. In addition, most experimental techniques are often associated with high levels of a false-positive&#x20;rate.</p>
<p>To conquer the disadvantages of previous biological approaches in a rapid and convenient way, computational approaches have become a hot research topic for predicting PPIs in proteomics research (<xref ref-type="bibr" rid="B46">Xiaoli et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B28">Lenz et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B22">He et&#x20;al., 2021a</xref>; <xref ref-type="bibr" rid="B18">Green et&#x20;al., 2021</xref>). In recent years, several public databases have been constructed to store the plant PPIs detected by biological experiments. For example, Dreze et&#x20;al. constructed a proteome-wide binary PPI network of <italic>Arabidopsis thaliana</italic> consisting of more than 6,000 highly reliable PPIs among about 2,700 proteins (<xref ref-type="bibr" rid="B14">Dreze et&#x20;al., 2011</xref>). Over the past decades, several computational methods that predict PPIs have been proposed by exploiting features ranging from network topology, protein sequence, phylogenetic profile, protein domain, and function annotation, among others (<xref ref-type="bibr" rid="B51">You et&#x20;al., 2016a</xref>; <xref ref-type="bibr" rid="B47">Yi et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B30">Liu et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B29">Li et&#x20;al., 2021</xref>). Min et&#x20;al. generated a high-confident database of plant PPIs derived from the published studies and several databases (<xref ref-type="bibr" rid="B32">Min et&#x20;al., 2010</xref>). Ding et&#x20;al. used domain and ortholog identification combination approach to infer the genome-wide protein&#x2013;protein interactions for <italic>Citrus sinensis</italic> (<xref ref-type="bibr" rid="B13">Ding et&#x20;al., 2014</xref>). Geisler-Lee et&#x20;al. presented a PPI network for <italic>Arabidopsis thaliana</italic>, predicted from interacting orthologs in <italic>Caenorhabditis elegans</italic>, <italic>Saccharomyces cerevisiae</italic>, <italic>Homo sapiens</italic>, and <italic>Drosophila melanogaster</italic> (<xref ref-type="bibr" rid="B16">Geisler-Lee et&#x20;al., 2007</xref>). In another work by Brandao et&#x20;al., a user-friendly tool, AtPIN, aggregated information on PPIs of <italic>Arabidopsis thaliana</italic>, sub-cellular localization, and ontology to map PPIs in <italic>Arabidopsis thaliana</italic> (<xref ref-type="bibr" rid="B6">Brand&#xe3;o et&#x20;al., 2009</xref>). Zhu et&#x20;al. constructed a genome-scale PPI network named PRIN in <italic>Oryza sativa</italic> by employing the InParanoid method based on the interolog approach. The PRIN approach integrated more than 533,000 PPIs among about 48,150 proteins from six organisms and detected more than 76,500 predicted rice PPIs among about 5,050 proteins (<xref ref-type="bibr" rid="B55">Zhu et&#x20;al., 2011</xref>).</p>
<p>This work introduces a novel sequence-based computational approach, CPIELA, to predict potential plant protein&#x2013;protein interactions. More specifically, we first converted the plant protein sequence into a position-specific scoring matrix (PSSM). Then, to fully capture the evolutionary information of the plant protein, we performed the local optimal-oriented pattern (LOOP) on the PSSM to extract the local textural descriptor. Although the LOOP algorithm is widely applied in image processing, to the best of our knowledge, this is the first work where LOOP is used in plant biology to predict PPIs. Finally, an efficient and powerful classification model, rotation forest (ROF), is used to identify the possible plant PPIs. The main contributions of this methodology are as follows: 1) based on the evolutionary history of proteins, the proposed method extracts the evolutionary features from the PSSM of the protein with known sequences, enabling our method to have more power for predicting plant PPIs than other sequence-based algorithms; 2) the proposed method does not depend on known PPIs samples and does not bias toward specific subspaces in the examined proteomic space because it directly captures features from the PSSMs of the plant protein sequence; and 3) we applied the ensemble ROF classifier to infer potential plant PPIs, which can truly improve the predictive accuracy compared with existing approaches. The proposed CPIELA method is well investigated on three plant PPIs datasets (<italic>Arabidopsis thaliana</italic>, <italic>Zea mays</italic>, and <italic>Oryza sativa</italic>) and yields high average accuracies of 98.63%, 98.09%, and 94.02%, respectively. In order to further verify the predictive performance of CPIELA, we compare it with the popular support vector machine (SVM) and random forest (RF) classifier. The experimental results illustrated that the CPIELA could be a complementary tool for plant PPIs prediction.</p>
</sec>
<sec sec-type="results|discussion" id="s2">
<title>Results and Discussions</title>
<sec id="s2-1">
<title>Evaluation Measures</title>
<p>In the experiment, the fivefold cross-validation technique is used to evaluate the predictive performance of the CPIELA model. Cross-validation is a widely used approach to estimate the generalization performance of the prediction model. The <italic>k</italic>-fold cross-validation method usually randomly separates the instances into <italic>k</italic> equal-sized disjoint groups. Then, the <italic>k</italic>-1 groups are used as a training dataset, and the remaining group is retained as the testing samples. This process is repeated <italic>k</italic> times. The predictive results of the proposed method are evaluated using five criteria, including precision (Prec.), accuracy (Acc.), sensitivity (Sen.), specificity (Spec.), and Matthews correlation coefficient (MCC). The calculation formulas are listed as follows:<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>c</mml:mi>
<mml:mi>u</mml:mi>
<mml:mo>.</mml:mo>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
<disp-formula id="e2">
<mml:math id="m2">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>n</mml:mi>
<mml:mo>.</mml:mo>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(2)</label>
</disp-formula>
<disp-formula id="e3">
<mml:math id="m3">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mo>.</mml:mo>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(3)</label>
</disp-formula>
<disp-formula id="e4">
<mml:math id="m4">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>p</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>c</mml:mi>
<mml:mo>.</mml:mo>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(4)</label>
</disp-formula>
<disp-formula id="e5">
<mml:math id="m5">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mi>C</mml:mi>
<mml:mi>C</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>T</mml:mi>
<mml:mi>P</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>F</mml:mi>
<mml:mi>N</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:msqrt>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(5)</label>
</disp-formula>where <italic>TP</italic>, <italic>FP</italic>, <italic>TN</italic>, and <italic>FN</italic> represent the number of true-positive, false-positive, true-negative, and false-negative samples, respectively. Furthermore, the Receiver Operating Characteristic (ROC) curve is employed to describe and compare the performance of a prediction model (<xref ref-type="bibr" rid="B7">Broadhurst and Kell, 2006</xref>). The <italic>y</italic>-axis and <italic>x</italic>-axis of the ROC curve are the sensitivity (the true positive rate, TPR) and 1&#x20;&#x2212; specificity (the false positive rate, FPR), respectively. The area under the ROC curve (AUC) is a frequently used measure of performance for classification. An AUC of 0.5 means a random classifier, while the ideal value of AUC would be 1.0. For the convenience of presentation, the specific steps of the CPIELA method for identifying plant PPIs are shown in <xref ref-type="fig" rid="F1">Figure&#x20;1</xref>.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>The flowchart of the proposed CPIELA method.</p>
</caption>
<graphic xlink:href="fgene-13-857839-g001.tif"/>
</fig>
</sec>
<sec id="s2-2">
<title>Evaluation of Model Predictive Ability</title>
<p>To verify the high predictive performance of the CPIELA model, we performed it on three plant PPIs datasets: <italic>Arabidopsis thaliana</italic>, <italic>Oryza sativa</italic>, and <italic>Zea mays</italic>. To guarantee the stability of the predictive results, the fivefold cross-validation technique is used to estimate the generalization capacity of the proposed learning model. Because the predictive performance of a rotation forest (ROF) ensemble is highly associated with the number <italic>L</italic> of decision trees (DT) and the number <italic>K</italic> of feature subset, a grid search method is conducted for tuning multiple parameters of the RF model. Considering the tradeoff between the computational complexity and accuracy rate, we set the number of decision trees to 3 and the number of feature subsets to 10 for all experiments.</p>
<p>The experimental results on the <italic>Arabidopsis thaliana</italic> dataset are outlined in <xref ref-type="table" rid="T1">Table&#x20;1</xref>. It can be seen from <xref ref-type="table" rid="T1">Table&#x20;1</xref> that the average accuracy of the proposed method is as high as 98.63%. In order to further quantify the prediction performance of the proposed method, some other evaluation measures are calculated. From <xref ref-type="table" rid="T1">Table&#x20;1</xref>, we can observe that the overall sensitivity, precision, specificity, MCC, and AUC are 97.56%, 99.69%, 99.70%, 97.30%, and 0.9954, respectively. The standard deviations of them are 0.43%, 0.10%, 0.09%, 0.42%, and 0.0009, respectively.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>The fivefold cross-validation results achieved on the <italic>A. thaliana</italic> dataset using the proposed CPIELA method.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Testing set</th>
<th align="center">Accu. (%)</th>
<th align="center">Sen. (%)</th>
<th align="center">Prec. (%)</th>
<th align="center">Spec. (%)</th>
<th align="center">MCC (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">1</td>
<td align="center">98.43</td>
<td align="center">97.23</td>
<td align="center">99.56</td>
<td align="center">99.58</td>
<td align="center">96.90</td>
<td align="char" char=".">0.9957</td>
</tr>
<tr>
<td align="left">2</td>
<td align="center">98.78</td>
<td align="center">97.99</td>
<td align="center">99.61</td>
<td align="center">99.60</td>
<td align="center">97.59</td>
<td align="char" char=".">0.9961</td>
</tr>
<tr>
<td align="left">3</td>
<td align="center">98.39</td>
<td align="center">97.04</td>
<td align="center">99.76</td>
<td align="center">99.77</td>
<td align="center">96.83</td>
<td align="char" char=".">0.9936</td>
</tr>
<tr>
<td align="left">4</td>
<td align="center">98.89</td>
<td align="center">97.98</td>
<td align="center">99.76</td>
<td align="center">99.77</td>
<td align="center">97.80</td>
<td align="char" char=".">0.9957</td>
</tr>
<tr>
<td align="left">5</td>
<td align="center">98.67</td>
<td align="center">97.58</td>
<td align="center">99.76</td>
<td align="center">99.77</td>
<td align="center">97.37</td>
<td align="char" char=".">0.9956</td>
</tr>
<tr>
<td align="left">Average</td>
<td align="center">
<bold>98.63&#x20;&#xb1; 0.22</bold>
</td>
<td align="center">
<bold>97.56&#x20;&#xb1; 0.43</bold>
</td>
<td align="center">
<bold>99.69&#x20;&#xb1; 0.10</bold>
</td>
<td align="center">
<bold>99.70&#x20;&#xb1; 0.09</bold>
</td>
<td align="center">
<bold>97.30&#x20;&#xb1; 0.42</bold>
</td>
<td align="char" char=".">
<bold>0.9954</bold>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The bold values in these Tables mean the highest value in every column.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>For the <italic>Zea mays</italic> dataset, it can be observed from <xref ref-type="table" rid="T2">Table&#x20;2</xref> that the proposed CPIELA achieved good performance of accuracy 98.09%, precision 99.03%, sensitivity 97.13%, specificity 99.05%, MCC 96.25%, and AUC 0.9912, respectively. We also tested the CPIELA method on the <italic>Oryza sativa</italic> dataset. <xref ref-type="table" rid="T3">Table&#x20;3</xref> lists the predictive results of fivefold cross-validation. We achieved the high accuracy of 94.02%, the precision value of 94.39%, the sensitivity value of 93.63%, the specificity value of 94.43%, the MCC value of 88.79%, and the AUC value of 0.9581 on the <italic>Oryza sativa</italic> dataset. Furthermore, from <xref ref-type="table" rid="T3">Table&#x20;3</xref>, we can also see that the standard deviations of accuracy, precision, sensitivity, specificity, MCC, and AUC are 1.45%, 2.20%, 1.08%, 2.19%, 2.61%, and 0.014, respectively.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>The fivefold cross-validation results achieved on the <italic>Zea mays</italic> dataset using the proposed CPIELA method.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Testing set</th>
<th align="center">Accu. (%)</th>
<th align="center">Sen. (%)</th>
<th align="center">Prec. (%)</th>
<th align="center">Spec. (%)</th>
<th align="center">MCC (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">1</td>
<td align="center">97.82</td>
<td align="center">96.59</td>
<td align="center">99.07</td>
<td align="center">99.08</td>
<td align="center">95.74</td>
<td align="char" char=".">0.9914</td>
</tr>
<tr>
<td align="left">2</td>
<td align="center">98.28</td>
<td align="center">97.34</td>
<td align="center">99.22</td>
<td align="center">99.22</td>
<td align="center">96.62</td>
<td align="char" char=".">0.992</td>
</tr>
<tr>
<td align="left">3</td>
<td align="center">97.98</td>
<td align="center">97.05</td>
<td align="center">98.89</td>
<td align="center">98.91</td>
<td align="center">96.04</td>
<td align="char" char=".">0.9902</td>
</tr>
<tr>
<td align="left">4</td>
<td align="center">98.00</td>
<td align="center">97.00</td>
<td align="center">98.91</td>
<td align="center">98.96</td>
<td align="center">96.07</td>
<td align="char" char=".">0.9893</td>
</tr>
<tr>
<td align="left">5</td>
<td align="center">98.37</td>
<td align="center">97.65</td>
<td align="center">99.07</td>
<td align="center">99.09</td>
<td align="center">96.79</td>
<td align="char" char=".">0.9931</td>
</tr>
<tr>
<td align="left">Average</td>
<td align="center">
<bold>98.09&#x20;&#xb1; 0.23</bold>
</td>
<td align="center">
<bold>97.13&#x20;&#xb1; 0.40</bold>
</td>
<td align="center">
<bold>99.03&#x20;&#xb1; 0.14</bold>
</td>
<td align="center">
<bold>99.05&#x20;&#xb1; 0.12</bold>
</td>
<td align="center">
<bold>96.25&#x20;&#xb1; 0.44</bold>
</td>
<td align="char" char=".">
<bold>0.9912</bold>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The bold values in these Tables mean the highest value in every column.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>The fivefold cross-validation results achieved on the <italic>Oryza sativa</italic> dataset using the proposed CPIELA method.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Testing set</th>
<th align="center">Accu. (%)</th>
<th align="center">Sen. (%)</th>
<th align="center">Prec. (%)</th>
<th align="center">Spec. (%)</th>
<th align="center">MCC (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">1</td>
<td align="center">93.70</td>
<td align="center">93.74</td>
<td align="center">93.45</td>
<td align="center">93.65</td>
<td align="center">88.19</td>
<td align="char" char=".">0.9558</td>
</tr>
<tr>
<td align="left">2</td>
<td align="center">93.59</td>
<td align="center">92.17</td>
<td align="center">95.17</td>
<td align="center">95.09</td>
<td align="center">88.00</td>
<td align="char" char=".">0.9516</td>
</tr>
<tr>
<td align="left">3</td>
<td align="center">93.33</td>
<td align="center">93.54</td>
<td align="center">93.15</td>
<td align="center">93.13</td>
<td align="center">87.56</td>
<td align="char" char=".">0.952</td>
</tr>
<tr>
<td align="left">4</td>
<td align="center">96.56</td>
<td align="center">95.21</td>
<td align="center">97.86</td>
<td align="center">97.91</td>
<td align="center">93.36</td>
<td align="char" char=".">0.9826</td>
</tr>
<tr>
<td align="left">5</td>
<td align="center">92.92</td>
<td align="center">93.49</td>
<td align="center">92.32</td>
<td align="center">92.36</td>
<td align="center">86.84</td>
<td align="char" char=".">0.9484</td>
</tr>
<tr>
<td align="left">Average</td>
<td align="center">
<bold>94.02&#x20;&#xb1; 1.45</bold>
</td>
<td align="center">
<bold>93.63&#x20;&#xb1; 1.08</bold>
</td>
<td align="center">
<bold>94.39&#x20;&#xb1; 2.20</bold>
</td>
<td align="center">
<bold>94.43&#x20;&#xb1; 2.19</bold>
</td>
<td align="center">
<bold>88.79&#x20;&#xb1; 2.61</bold>
</td>
<td align="char" char=".">
<bold>0.9581</bold>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The bold values in these Tables mean the highest value in every column.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>
<xref ref-type="fig" rid="F2">Figures 2A&#x2013;C</xref> plot the ROC curves generated by the CPIELA method on the <italic>Arabidopsis thaliana</italic>, <italic>Zea mays</italic>, and <italic>Oryza sativa</italic> datasets. It can be seen from the above experimental results that the CPIELA method is effective for predicting plant PPIs. The better prediction performance mainly comes from the discriminative LOOP descriptors and the powerful ROF classifier. More specifically, the PSSM not only encodes the sequence into the matrix but also obtains sufficient evolutionary information on plant proteins, which can significantly improve the prediction accuracy. As a popular ensemble classifier, the ROF model has a considerably high predictive capability for identifying potential PPIs, making us more convinced that the proposed CPIELA can be a useful tool for predicting plant&#x20;PPIs.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>The predictive performance of the proposed CPIELA method via fivefold cross-validation. <bold>(A&#x2013;C)</bold> The Receiver Operating Characteristic (ROC) curves of <italic>Arabidopsis thaliana</italic>, <italic>Zea mays</italic>, and <italic>Oryza sativa</italic> datasets. <bold>(D)</bold> The ROC curves performed by the CPIELA method on three plant PPIs datasets.</p>
</caption>
<graphic xlink:href="fgene-13-857839-g002.tif"/>
</fig>
</sec>
<sec id="s2-3">
<title>Comparison of the Proposed Model With Different Classifiers and Descriptors</title>
<p>In this section, we conduct an experiment to compare the prediction performance of the state-of-the-art SVM classifier (<xref ref-type="bibr" rid="B11">Chih-Chung and Chih-Jen, 2011</xref>), the standard random forest (RF), and the rotation forest (ROF). The experimental results of the above-mentioned classifiers combined with the LOOP descriptor are listed in <xref ref-type="table" rid="T4">Table&#x20;4</xref>. It can be seen from <xref ref-type="table" rid="T4">Table&#x20;4</xref> that the average accuracies of SVM, RF, and ROF classifier on the <italic>Arabidopsis thaliana</italic> dataset are 89.37%, 97.21%, and 98.63%, respectively. To demonstrate the predictive ability of the proposed CPIELA more comprehensively, we also computed the values of sensitivity, precision, MCC, and AUC. As observed from <xref ref-type="table" rid="T4">Table&#x20;4</xref>, the proposed CPIELA model achieved the highest performance on the <italic>Arabidopsis thaliana</italic> dataset with the sensitivity value of 97.56%, precision value of 99.69%, MCC value of 97.30%, and AUC value of 0.9954. In addition, we could observe in detail from <xref ref-type="table" rid="T4">Table&#x20;4</xref> that the corresponding standard deviation of accuracy, precision, sensitivity, MCC, and AUC is 0.22%, 0.10%, 0.43%, 0.42%, and 0.0009, respectively.</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>The fivefold cross-validation results achieved by different classifiers on the three plant datasets.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Dataset</th>
<th align="center">Classifier</th>
<th align="center">Acc. (%)</th>
<th align="center">Sen. (%)</th>
<th align="center">Prec. (%)</th>
<th align="center">MCC (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="left">
<italic>A. thaliana</italic>
</td>
<td align="left">SVM</td>
<td align="char" char="plusmn">89.37&#x20;&#xb1; 0.25</td>
<td align="char" char="plusmn">83.95&#x20;&#xb1; 0.51</td>
<td align="char" char="plusmn">94.16&#x20;&#xb1; 0.41</td>
<td align="char" char="plusmn">80.89&#x20;&#xb1; 0.39</td>
<td align="char" char="plusmn">0.9495&#x20;&#xb1; 0.0038</td>
</tr>
<tr>
<td align="left">RF</td>
<td align="char" char="plusmn">97.21&#x20;&#xb1; 0.12</td>
<td align="char" char="plusmn">96.15&#x20;&#xb1; 0.19</td>
<td align="char" char="plusmn">98.22&#x20;&#xb1; 0.33</td>
<td align="char" char="plusmn">94.58&#x20;&#xb1; 0.22</td>
<td align="char" char="plusmn">0.9720&#x20;&#xb1; 0.0011</td>
</tr>
<tr>
<td align="left">Our method</td>
<td align="char" char="plusmn">
<bold>98.63&#x20;&#xb1; 0.22</bold>
</td>
<td align="char" char="plusmn">
<bold>97.56&#x20;&#xb1; 0.43</bold>
</td>
<td align="char" char="plusmn">
<bold>99.69&#x20;&#xb1; 0.10</bold>
</td>
<td align="char" char="plusmn">
<bold>97.30&#x20;&#xb1; 0.42</bold>
</td>
<td align="char" char="plusmn">
<bold>0.9954&#x20;&#xb1; 0.0009</bold>
</td>
</tr>
<tr>
<td rowspan="3" align="left">
<italic>Zea mays</italic>
</td>
<td align="left">SVM</td>
<td align="char" char="plusmn">84.46&#x20;&#xb1; 0.20</td>
<td align="char" char="plusmn">77.55&#x20;&#xb1; 0.94</td>
<td align="char" char="plusmn">89.98&#x20;&#xb1; 0.47</td>
<td align="char" char="plusmn">73.5&#x20;&#xb1; 0.34</td>
<td align="char" char="plusmn">0.9179&#x20;&#xb1; 0.0048</td>
</tr>
<tr>
<td align="left">RF</td>
<td align="char" char="plusmn">94.65&#x20;&#xb1; 0.60</td>
<td align="char" char="plusmn">94.28&#x20;&#xb1; 0.66</td>
<td align="char" char="plusmn">94.98&#x20;&#xb1; 0.81</td>
<td align="char" char="plusmn">89.87&#x20;&#xb1; 1.07</td>
<td align="char" char="plusmn">0.9472&#x20;&#xb1; 0.0060</td>
</tr>
<tr>
<td align="left">Our method</td>
<td align="char" char="plusmn">
<bold>98.09&#x20;&#xb1; 0.23</bold>
</td>
<td align="char" char="plusmn">
<bold>97.13&#x20;&#xb1; 0.40</bold>
</td>
<td align="char" char="plusmn">
<bold>99.03&#x20;&#xb1; 0.14</bold>
</td>
<td align="char" char="plusmn">
<bold>96.25&#x20;&#xb1; 0.44</bold>
</td>
<td align="char" char="plusmn">
<bold>0.9912&#x20;&#xb1; 0.0015</bold>
</td>
</tr>
<tr>
<td rowspan="3" align="left">
<italic>Oryza sativa</italic>
</td>
<td align="left">SVM</td>
<td align="char" char="plusmn">88.95&#x20;&#xb1; 1.44</td>
<td align="char" char="plusmn">83.23&#x20;&#xb1; 2.52</td>
<td align="char" char="plusmn">94.00&#x20;&#xb1; 0.72</td>
<td align="char" char="plusmn">80.24&#x20;&#xb1; 2.28</td>
<td align="char" char="plusmn">0.9445&#x20;&#xb1; 0.0068</td>
</tr>
<tr>
<td align="left">RF</td>
<td align="char" char="plusmn">90.90&#x20;&#xb1; 1.30</td>
<td align="char" char="plusmn">90.45&#x20;&#xb1; 1.58</td>
<td align="char" char="plusmn">91.29&#x20;&#xb1; 2.10</td>
<td align="char" char="plusmn">83.47&#x20;&#xb1; 2.11</td>
<td align="char" char="plusmn">0.9113&#x20;&#xb1; 0.0122</td>
</tr>
<tr>
<td align="left">Our method</td>
<td align="char" char="plusmn">
<bold>94.02&#x20;&#xb1; 1.45</bold>
</td>
<td align="char" char="plusmn">
<bold>93.63&#x20;&#xb1; 1.08</bold>
</td>
<td align="char" char="plusmn">
<bold>94.39&#x20;&#xb1; 2.20</bold>
</td>
<td align="char" char="plusmn">
<bold>88.79&#x20;&#xb1; 2.61</bold>
</td>
<td align="char" char="plusmn">
<bold>0.9581&#x20;&#xb1; 0.0140</bold>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The bold values in these Tables mean the highest value in every column.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>The precision, sensitivity, MCC, and AUC of the SVM classifier are 94.16%, 83.95%, 80.89%, and 0.9495, respectively. The precision, sensitivity, MCC, and AUC of the RF model are 98.22%, 96.15%, 94.58%, and 0.9720, respectively. It is evident that the SVM model achieved poor accuracy compared to the RF and ROF classifiers. It is specifically notable in the case of MCC. The proposed CPIELA method is the model with the best predictive results in terms of MCC for <italic>Arabidopsis thaliana</italic> PPIs datasets.</p>
<p>We also pay attention to the other two plant PPIs datasets. <xref ref-type="table" rid="T4">Table&#x20;4</xref> shows the experimental results obtain on the <italic>Zea mays</italic> dataset, from which we can observe that the average accuracies of SVM, RF, and ROF classifiers are 84.46%, 94.65%, and 98.09%, respectively. Here, it could also be observed that the average accuracies obtained by the SVM, RF, and ROF models on the <italic>Oryza sativa</italic> dataset are 88.95%, 90.90%, and 94.02%, respectively.</p>
<p>
<xref ref-type="fig" rid="F3">Figures 3A&#x2013;C</xref> show the ROC curve generated by different classifiers with the LOOP descriptor on the <italic>Arabidopsis thaliana</italic>, <italic>Zea mays</italic>, and <italic>Oryza sativa</italic> PPIs datasets, respectively.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Prediction performance comparison of different classifiers using ROC curves in predicting plant protein&#x2013;protein interactions. Shown in the plot are the ROC curves for <bold>(A)</bold> <italic>Arabidopsis thaliana</italic>, <bold>(B)</bold> <italic>Zea mays</italic>, <bold>(C)</bold> <italic>Oryza sativa</italic> datasets using RF (blue line), ROF (green line), SVM (red line), respectively. <bold>(D)</bold> ROC curves of different descriptors on three plant PPIs datasets.</p>
</caption>
<graphic xlink:href="fgene-13-857839-g003.tif"/>
</fig>
<p>In order to further evaluate the predictive performance of CPIELA, we also compared it with several other protein descriptors. In the experiment, local phase quantization (LPQ), first proposed by <xref ref-type="bibr" rid="B35">Ojansivu et&#x20;al. (2008)</xref>, <xref ref-type="bibr" rid="B23">Heikkil&#xe4; et&#x20;al. (2014)</xref>, is employed to evaluate the performance of predicting plant PPIs on <italic>Arabidopsis thaliana</italic>, <italic>Zea mays</italic>, and <italic>Oryza sativa</italic> datasets, respectively. The fivefold cross-validation results of the LOOP and LPQ descriptor combined with ROF classifier on three plant PPIs datasets are summarized in <xref ref-type="table" rid="T5">Table&#x20;5</xref>. It can be observed that the LPQ descriptor achieved 73.17% average accuracy, 72.55% average sensitivity, 73.46% average precision, 73.79% average specificity, 60.74% average MCC, and 0.7873 average AUC on the <italic>Arabidopsis thaliana</italic> dataset. Meanwhile, the LOOP descriptor achieved 98.63% average accuracy, 97.56% average sensitivity, 99.69% average precision, 99.70% average specificity, 97.30% average MCC, and 0.9954 average AUC on the <italic>Arabidopsis thaliana</italic> dataset.</p>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>The fivefold cross-validation results achieved on the three plant PPIs dataset among different descriptors using the proposed method.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Dataset</th>
<th align="center">Methods</th>
<th align="center">Acc. (%)</th>
<th align="center">Sen. (%)</th>
<th align="center">Prec. (%)</th>
<th align="center">Spec. (%)</th>
<th align="center">MCC (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="2" align="left">
<italic>A. thaliana</italic>
</td>
<td align="left">LPQ &#x2b; RoF</td>
<td align="center">73.17&#x20;&#xb1; 0.72</td>
<td align="center">72.55&#x20;&#xb1; 0.86</td>
<td align="center">73.46&#x20;&#xb1; 0.84</td>
<td align="center">73.79&#x20;&#xb1; 0.64</td>
<td align="center">60.74&#x20;&#xb1; 0.69</td>
<td align="center">0.7873&#x20;&#xb1; 0.0090</td>
</tr>
<tr>
<td align="left">LOOP &#x2b; RoF</td>
<td align="center">
<bold>98.63&#x20;&#xb1; 0.22</bold>
</td>
<td align="center">
<bold>97.56&#x20;&#xb1; 0.43</bold>
</td>
<td align="center">
<bold>99.69&#x20;&#xb1; 0.10</bold>
</td>
<td align="center">
<bold>99.70&#x20;&#xb1; 0.09</bold>
</td>
<td align="center">
<bold>97.30&#x20;&#xb1; 0.42</bold>
</td>
<td align="center">
<bold>0.9954 &#xb1;</bold> <bold>0.0009</bold>
</td>
</tr>
<tr>
<td rowspan="2" align="left">
<italic>Zea mays</italic>
</td>
<td align="left">LPQ &#x2b; RoF</td>
<td align="center">94.17&#x20;&#xb1; 0.40</td>
<td align="center">93.4&#x20;&#xb1; 0.64</td>
<td align="center">94.86&#x20;&#xb1; 0.53</td>
<td align="center">94.93&#x20;&#xb1; 0.50</td>
<td align="center">89.02&#x20;&#xb1; 0.72</td>
<td align="center">0.9639&#x20;&#xb1; 0.0031</td>
</tr>
<tr>
<td align="left">LOOP &#x2b; RoF</td>
<td align="center">
<bold>98.09&#x20;&#xb1; 0.23</bold>
</td>
<td align="center">
<bold>97.13&#x20;&#xb1; 0.40</bold>
</td>
<td align="center">
<bold>99.03&#x20;&#xb1; 0.14</bold>
</td>
<td align="center">
<bold>99.05&#x20;&#xb1; 0.12</bold>
</td>
<td align="center">
<bold>96.25&#x20;&#xb1; 0.44</bold>
</td>
<td align="center">
<bold>0.9912&#x20;&#xb1; 0.0015</bold>
</td>
</tr>
<tr>
<td rowspan="2" align="left">
<italic>Oryza sativa</italic>
</td>
<td align="left">LPQ &#x2b; RoF</td>
<td align="center">91.89&#x20;&#xb1; 0.64</td>
<td align="center">92.14&#x20;&#xb1; 1.57</td>
<td align="center">91.70&#x20;&#xb1; 0.87</td>
<td align="center">91.65&#x20;&#xb1; 1.01</td>
<td align="center">85.09&#x20;&#xb1; 1.07</td>
<td align="center">0.9474&#x20;&#xb1; 0.0041</td>
</tr>
<tr>
<td align="left">LOOP &#x2b; RoF</td>
<td align="center">
<bold>94.02&#x20;&#xb1; 1.45</bold>
</td>
<td align="center">
<bold>93.63&#x20;&#xb1; 1.08</bold>
</td>
<td align="center">
<bold>94.39&#x20;&#xb1; 2.20</bold>
</td>
<td align="center">
<bold>94.43&#x20;&#xb1; 2.19</bold>
</td>
<td align="center">
<bold>88.79&#x20;&#xb1; 2.61</bold>
</td>
<td align="center">
<bold>0.9581&#x20;&#xb1; 0.0140</bold>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The bold values in these Tables mean the highest value in every column.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>As we can see in <xref ref-type="fig" rid="F3">Figure&#x20;3D</xref>, for <italic>Arabidopsis thaliana</italic>, the area under the ROC curve corresponding to LOOP is significantly larger than that of the LPQ descriptor. In terms of the indicator AUC, the AUC value of LOOP reaches 0.9957, which is 26.42% higher than that of the LPQ method. The experimental results also demonstrate that the LOOP descriptor exhibited significantly better performance than the LPQ descriptor on the other two plant PPIs datasets. Furthermore, the higher prediction accuracies and lower standard deviations indicate that the LOOP descriptor can effectively extract the features from protein sequence and significantly improve the predictive performance in plant PPIs prediction.</p>
</sec>
<sec id="s2-4">
<title>Comparison With Existing Method</title>
<p>In the previous works, some researchers have put forward several computational approaches to solve the problem of plant PPIs prediction (<xref ref-type="bibr" rid="B36">Pan et&#x20;al., 2021a</xref>; <xref ref-type="bibr" rid="B37">Pan et&#x20;al., 2021b</xref>). Therefore, we compare the predictive performance of CPIELA against the recently proposed approaches. Experimental results of predictive performance comparison on <italic>Oryza sativa</italic> dataset between CPIELA and several related models are demonstrated in <xref ref-type="table" rid="T6">Table&#x20;6</xref>. It can be clearly observed from this table that the range of AUC generated by other approaches is from 0.7931 to 0.9440, the range of MCC obtained is from 37.39% to 78.26%, the range of accuracy generated by other models is from 66.63% to 82.60%, and the corresponding values obtained by CPIELA are 0.9581, 88.79%, and 94.02%. It shows that the predictive performance (AUC, MCC, accuracy) of CPIELA is better than that of existing models. We can see from <xref ref-type="table" rid="T6">Table&#x20;6</xref> that the CPIELA model also gives better performance than the above-mentioned models for sensitivity, precision, and specificity metrics. Overall, the proposed CPIELA model shows better predictive performance than the previous prediction model on the <italic>Oryza sativa</italic> dataset.</p>
<table-wrap id="T6" position="float">
<label>TABLE 6</label>
<caption>
<p>The predictive performance comparison of different methods on the <italic>Oryza sativa</italic> dataset.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Methods</th>
<th align="center">Accu. (%)</th>
<th align="center">Sen. (%)</th>
<th align="center">Prec. (%)</th>
<th align="center">Spec. (%)</th>
<th align="center">MCC (%)</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">
<bold>DHT &#x2b; KNN</bold>
</td>
<td align="center">N/A</td>
<td align="char" char="plusmn">89.28&#x20;&#xb1; 0.78</td>
<td align="char" char="plusmn">76.41&#x20;&#xb1; 1.55</td>
<td align="char" char="plusmn">72.44&#x20;&#xb1; 1.58</td>
<td align="char" char="plusmn">68.59&#x20;&#xb1; 1.17</td>
<td align="char" char="plusmn">0.8680&#x20;&#xb1; 0.8900</td>
</tr>
<tr>
<td align="left">
<bold>DHT &#x2b; RF</bold>
</td>
<td align="center">N/A</td>
<td align="char" char="plusmn">88.00&#x20;&#xb1; 1.34</td>
<td align="char" char="plusmn">87.30&#x20;&#xb1; 1.35</td>
<td align="char" char="plusmn">87.22&#x20;&#xb1; 1.16</td>
<td align="char" char="plusmn">78.26&#x20;&#xb1; 1.28</td>
<td align="char" char="plusmn">0.9199&#x20;&#xb1; 0.5800</td>
</tr>
<tr>
<td align="left">
<bold>DHT &#x2b; DNN</bold>
</td>
<td align="center">82.60&#x20;&#xb1; 1.79</td>
<td align="char" char="plusmn">95.89&#x20;&#xb1; 0.91</td>
<td align="char" char="plusmn">75.79&#x20;&#xb1; 2.43</td>
<td align="char" char="plusmn">69.31&#x20;&#xb1; 3.53</td>
<td align="char" char="plusmn">67.65&#x20;&#xb1; 2.98</td>
<td align="char" char="plusmn">0.9440&#x20;&#xb1; 0.5800</td>
</tr>
<tr>
<td align="left">
<bold>FFT &#x2b; DNN</bold>
</td>
<td align="center">75.31&#x20;&#xb1; 1.37</td>
<td align="char" char="plusmn">93.34&#x20;&#xb1; 1.59</td>
<td align="char" char="plusmn">68.61&#x20;&#xb1; 1.03</td>
<td align="char" char="plusmn">57.23&#x20;&#xb1; 2.90</td>
<td align="char" char="plusmn">54.26&#x20;&#xb1; 2.81</td>
<td align="char" char="plusmn">0.8760&#x20;&#xb1; 0.0096</td>
</tr>
<tr>
<td align="left">
<bold>DWT &#x2b; DNN</bold>
</td>
<td align="center">81.54&#x20;&#xb1; 3.05</td>
<td align="char" char="plusmn">94.81&#x20;&#xb1; 0.65</td>
<td align="char" char="plusmn">75.10&#x20;&#xb1; 3.84</td>
<td align="char" char="plusmn">68.26&#x20;&#xb1; 6.61</td>
<td align="char" char="plusmn">65.50&#x20;&#xb1; 4.99</td>
<td align="char" char="plusmn">0.9309&#x20;&#xb1; 0.0052</td>
</tr>
<tr>
<td align="left">
<bold>AC &#x2b; DNN</bold>
</td>
<td align="center">66.63&#x20;&#xb1; 4.48</td>
<td align="char" char="plusmn">88.42&#x20;&#xb1; 4.77</td>
<td align="char" char="plusmn">62.02&#x20;&#xb1; 4.91</td>
<td align="char" char="plusmn">45.02&#x20;&#xb1; 12.49</td>
<td align="char" char="plusmn">37.39&#x20;&#xb1; 5.39</td>
<td align="char" char="plusmn">0.7931&#x20;&#xb1; 0.0126</td>
</tr>
<tr>
<td align="left">
<bold>DCT &#x2b; DNN</bold>
</td>
<td align="center">80.95&#x20;&#xb1; 1.10</td>
<td align="char" char="plusmn">96.12&#x20;&#xb1; 1.15</td>
<td align="char" char="plusmn">73.70&#x20;&#xb1; 1.41</td>
<td align="char" char="plusmn">65.64&#x20;&#xb1; 2.40</td>
<td align="char" char="plusmn">64.99&#x20;&#xb1; 1.97</td>
<td align="char" char="plusmn">0.9360&#x20;&#xb1; 0.0017</td>
</tr>
<tr>
<td align="left">Our method</td>
<td align="center">
<bold>94.02&#x20;&#xb1; 1.45</bold>
</td>
<td align="char" char="plusmn">
<bold>93.63&#x20;&#xb1; 1.08</bold>
</td>
<td align="char" char="plusmn">
<bold>94.39&#x20;&#xb1; 2.20</bold>
</td>
<td align="char" char="plusmn">
<bold>94.43&#x20;&#xb1; 2.19</bold>
</td>
<td align="char" char="plusmn">
<bold>88.79&#x20;&#xb1; 2.61</bold>
</td>
<td align="char" char="plusmn">
<bold>0.9581&#x20;&#xb1; 0.0140</bold>
</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>DHT: discrete Hilbert transform (<xref ref-type="bibr" rid="B12">Cizek, 1970</xref>); KNN: k-nearest neighbors; RF: random forest; FFT: fast Fourier transform; DWT: discrete wavelet transform; AC: auto covariance; DCT: discrete cosine transform.</p>
</fn>
<fn>
<p>The bold values in these Tables mean the highest value in every column.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="conclusion" id="s3">
<title>Conclusion</title>
<p>Protein&#x2013;protein interactions are involved in almost all aspects of plant cellular processes. Thus, identifying plant PPIs is an important step toward understanding the molecular mechanisms and biological systems. This article developed a novel computational approach called CPIELA for predicting plant PPIs using the specifically designed protein representation method LOOP and ROF-based framework. The local optimal-oriented pattern (LOOP) descriptor is proposed to conquer some of the disadvantages in the previous feature descriptor, local directional pattern (LDP), and local binary pattern (LBP), by integrating the strength of these two descriptors. Thus, the LOOP-based features from PSSM are useful for predictive accuracy improvement. A highly accurate rotation forest algorithm is used to predict the potential plant PPIs. Experimental results on three plant PPIs datasets showed that the proposed CPIELA method outperforms all existing methods, demonstrating the feasibility and effectiveness of the proposed protein representation LOOP and the ROF-based classifier for predicting plant PPIs. The proposed sequence-based prediction method enables the systematic identification of possible PPIs in plants.</p>
</sec>
<sec sec-type="materials|methods" id="s4">
<title>Materials and Methodology</title>
<sec id="s4-1">
<title>Golden Standard Datasets</title>
<p>With the rapid advances of high-throughput biological technologies, many resources currently provide plant PPIs for different species. To construct a plant PPIs prediction model and compare it with existing prediction approaches, three plant PPIs datasets (<italic>Zea mays</italic>, <italic>Oryza sativa</italic>, and <italic>Arabidopsis thaliana</italic>) are employed in this work. For the interactome of <italic>Zea mays</italic>, 14,230 experimentally verified PPIs are downloaded from the Protein-Protein Interaction Database for Maize (PPIM) (<xref ref-type="bibr" rid="B54">Zhu et&#x20;al., 2017</xref>) and agriGO (<xref ref-type="bibr" rid="B44">Tian et&#x20;al., 2017</xref>). Because there is no available confirmed non-interacting plant PPIs, constructing negative PPIs dataset remains a challenging task in PPIs prediction. In order to build the negative dataset, 14,230 maize protein pairs located in different subcellular localization are randomly chose in this study. Consequently, the whole <italic>Zea mays</italic> dataset consists of 28,460 protein&#x20;pairs.</p>
<p>A total of 4,800&#x20;non-redundant <italic>Oryza sativa</italic> protein interaction pairs among 1,834 rice proteins are downloaded from the PRIN database (<ext-link ext-link-type="uri" xlink:href="http://bis.zju.edu.cn/prin">http://bis.zju.edu.cn/prin</ext-link>) (<xref ref-type="bibr" rid="B20">Gu et&#x20;al., 2011</xref>). The <italic>Arabidopsis thaliana</italic> PPIs dataset is collected from the public databases of BioGrid (<xref ref-type="bibr" rid="B41">Rose et&#x20;al., 2018</xref>), TAIR (<xref ref-type="bibr" rid="B48">Yon et&#x20;al., 2003</xref>), and IntAct (<xref ref-type="bibr" rid="B27">Kerrien et&#x20;al., 2011</xref>). Meanwhile, the protein pairs containing a protein with fewer than fifty amino acids or having &#x2264;40% sequence identity are removed. Finally, the 28,110 protein pairs from 7,437&#x20;<italic>Arabidopsis thaliana</italic> proteins comprise the positive dataset. The 28,110 protein pairs occurring in two different subcellular localizations are generated as a negative PPIs dataset. In this way, the whole <italic>Arabidopsis thaliana</italic> dataset is constructed by more than 56,220 protein pairs. The summary of plant PPIs used in this study is shown in <xref ref-type="table" rid="T7">Table&#x20;7</xref>.</p>
<table-wrap id="T7" position="float">
<label>TABLE 7</label>
<caption>
<p>Summary of plant PPIs and proteins in different species.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Species name</th>
<th align="center">Common name</th>
<th align="center">Number of proteins</th>
<th align="center">Number of PPIs</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">
<bold>
<italic>Arabidopsis thaliana</italic>
</bold>
</td>
<td align="left">Thale cress</td>
<td align="center">7, 437</td>
<td align="center">56, 220</td>
</tr>
<tr>
<td align="left">
<bold>
<italic>Zea mays</italic>
</bold>
</td>
<td align="left">Maize</td>
<td align="center">4, 841</td>
<td align="center">28, 460</td>
</tr>
<tr>
<td align="left">
<bold>
<italic>Oryza sativa</italic>
</bold>
</td>
<td align="left">Rice</td>
<td align="center">1, 834</td>
<td align="center">9, 600</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4-2">
<title>Position-Specific Scoring Matrix</title>
<p>The position-specific scoring matrix (PSSM) was first proposed by Gribskov et&#x20;al. to detect distantly related proteins and is now widely applied for the representation and prediction of PPIs (<xref ref-type="bibr" rid="B19">Gribskov et&#x20;al., 1987</xref>; <xref ref-type="bibr" rid="B50">You et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B45">Wong et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B49">You et&#x20;al., 2016b</xref>). A PSSM for a given protein is a 20&#xd7;<italic>M</italic> matrix <inline-formula id="inf1">
<mml:math id="m6">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>:</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1,2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mn>20</mml:mn>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>a</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>d</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1,2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, where <italic>M</italic> is the length of the target protein sequence. The PSSM matrix <italic>p</italic> can be represented as follows:<disp-formula id="e6">
<mml:math id="m7">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>1,1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>1,2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x22ef;</mml:mo>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>2,1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>2,2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x22ef;</mml:mo>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x22ee;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x22ee;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x22ee;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mo>&#x22ee;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>20,1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>20,2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x22ef;</mml:mo>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mn>20</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(6)</label>
</disp-formula>where each element denotes the log-likelihood of the particular amino acid substitution at that position in the template. For example, it assigns a value <inline-formula id="inf2">
<mml:math id="m8">
<mml:mrow>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> for the <italic>i</italic>th residue in the <italic>j</italic>th position of the query protein sequence with a small score representing a weekly conserved position and a large score indicating a highly conserved position.</p>
<p>In the experiment, we employed the position-specific iterated BLAST (PSI-BLAST) tool and SwissProt database to build the PSSM for each protein amino acid sequence (<xref ref-type="bibr" rid="B3">Altschul et&#x20;al., 1997</xref>; <xref ref-type="bibr" rid="B2">Altschul and Koonin, 1998</xref>; <xref ref-type="bibr" rid="B4">Amos and Rolf, 1999</xref>). The PSI-BLAST approach is highly sensitive in discovering similar proteins in distantly related species and new members of the protein family. To obtain high homologous sequences, we set the number of iterations to three, the e-value to 0.001, and the default value to the other parameters. The PSI-BLAST tool was downloaded from <ext-link ext-link-type="uri" xlink:href="http://blast.ncbi.nlm.nih.gov/Blast.cgi">http://blast.ncbi.nlm.nih.gov/Blast.cgi</ext-link>.</p>
</sec>
<sec id="s4-3">
<title>Local Optimal-Oriented Pattern</title>
<p>Tapabrata et&#x20;al. presented the local optimal-oriented pattern (LOOP) as a novel binary local pattern descriptor that encodes rotation invariance into the main formulation of the local binary descriptor (<xref ref-type="bibr" rid="B9">Chakraborti et&#x20;al., 2018</xref>). The LOOP descriptor is an improvement designed on local binary pattern (LBP) (<xref ref-type="bibr" rid="B34">Ojala et&#x20;al., 1994</xref>) and local directional pattern (LDP) (<xref ref-type="bibr" rid="B25">Jabid et&#x20;al., 2010</xref>).</p>
<p>Given an image <inline-formula id="inf3">
<mml:math id="m9">
<mml:mi>I</mml:mi>
</mml:math>
</inline-formula>, let <inline-formula id="inf4">
<mml:math id="m10">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> be the intensity at pixel <inline-formula id="inf5">
<mml:math id="m11">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mtext>x</mml:mtext>
<mml:mi>c</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. Suppose <inline-formula id="inf6">
<mml:math id="m12">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mo>&#xa0;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mn>7</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> represents the intensity of a pixel in the <inline-formula id="inf7">
<mml:math id="m13">
<mml:mrow>
<mml:mn>3</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> neighborhood of <inline-formula id="inf8">
<mml:math id="m14">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mtext>x</mml:mtext>
<mml:mi>c</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> keeping out the pixel <inline-formula id="inf9">
<mml:math id="m15">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. <xref ref-type="fig" rid="F4">Figure&#x20;4</xref> shows the Kirsch edge detectors centered at <inline-formula id="inf10">
<mml:math id="m16">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mtext>x</mml:mtext>
<mml:mi>c</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> in eight directions. Let <inline-formula id="inf11">
<mml:math id="m17">
<mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mo>&#xa0;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mn>7</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> be the eight responses of the Kirsch masks, corresponding to pixels with intensity <inline-formula id="inf12">
<mml:math id="m18">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mo>&#xa0;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mn>7</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. Suppose <inline-formula id="inf13">
<mml:math id="m19">
<mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is the <italic>k</italic>th highest Kirsch activation. An exponential <inline-formula id="inf14">
<mml:math id="m20">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c9;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> for each of these pixels is assigned based on the rank of the magnitude of <inline-formula id="inf15">
<mml:math id="m21">
<mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> amongst the eight Kirsch mask outputs. Finally, the value of LOOP for the pixel <inline-formula id="inf16">
<mml:math id="m22">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mtext>x</mml:mtext>
<mml:mi>c</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is calculated as follows:<disp-formula id="e7">
<mml:math id="m23">
<mml:mrow>
<mml:mtext>LOOP</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>x</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mn>7</mml:mn>
</mml:munderover>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mn>.2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c9;</mml:mi>
<mml:mi>n</mml:mi>
</mml:msub>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(7)</label>
</disp-formula>where<disp-formula id="e8">
<mml:math id="m24">
<mml:mrow>
<mml:mi>s</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mi>f</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi mathvariant="normal">o</mml:mi>
<mml:mi mathvariant="normal">t</mml:mi>
<mml:mi mathvariant="normal">h</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
<mml:mi mathvariant="normal">r</mml:mi>
<mml:mi mathvariant="normal">w</mml:mi>
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">s</mml:mi>
<mml:mi mathvariant="normal">e</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(8)</label>
</disp-formula>where <inline-formula id="inf17">
<mml:math id="m25">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> denotes the intensity of the center pixel <inline-formula id="inf18">
<mml:math id="m26">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtext>x</mml:mtext>
<mml:mo>,</mml:mo>
<mml:mi>y</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. In our study, the input PSSM is a 20&#xd7;<italic>M</italic> matrix. Thus, each protein sequence is represented by a 256-dimensional feature vector after employing the LOOP descriptor.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>The masks of Kirsch&#x2019;s edge detector which is used for calculating responses in eight possible directions.</p>
</caption>
<graphic xlink:href="fgene-13-857839-g004.tif"/>
</fig>
</sec>
<sec id="s4-4">
<title>Rotation Forest</title>
<p>Rotation forest (ROF) is a popular ensemble classifier firstly proposed by <xref ref-type="bibr" rid="B40">Rodriguez et&#x20;al. (2006)</xref>. Compared with other classifiers, the ROF model is successfully used in dealing with many computational biology problems (<xref ref-type="bibr" rid="B21">He et&#x20;al., 2021b</xref>). The basic idea of the rotation forest model is to simultaneously improve both individual accuracy and member diversity within an ensemble classifier. The success of the ROF method is attributed to the base classifier and rotation matrix created by the transformation algorithms, including principal component analysis (PCA) (<xref ref-type="bibr" rid="B26">Jolliffe, 2002</xref>), local fisher discriminant analysis (LFDA) (<xref ref-type="bibr" rid="B31">Masashi et&#x20;al., 2010</xref>), maximum noise fraction (MNF) (<xref ref-type="bibr" rid="B17">Gordon, 2000</xref>), and independent component analysis (ICA) (<xref ref-type="bibr" rid="B38">Prasad, 2001</xref>). The framework of the ROF model is described as follows.</p>
<p>Let <inline-formula id="inf19">
<mml:math id="m27">
<mml:mi>X</mml:mi>
</mml:math>
</inline-formula> be the training samples in the form of an <inline-formula id="inf20">
<mml:math id="m28">
<mml:mrow>
<mml:mi>N</mml:mi>
<mml:mo>&#xd7;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> matrix, where <italic>N</italic> represents the number of samples and <inline-formula id="inf21">
<mml:math id="m29">
<mml:mi>n</mml:mi>
</mml:math>
</inline-formula> denotes the number of features, respectively. Let a vector <inline-formula id="inf22">
<mml:math id="m30">
<mml:mrow>
<mml:mi>Y</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>N</mml:mi>
</mml:msub>
<mml:mo>]</mml:mo>
</mml:mrow>
<mml:mi>T</mml:mi>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> be the corresponding class label, where <inline-formula id="inf23">
<mml:math id="m31">
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c9;</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x3c9;</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. Let <italic>F</italic> be the feature set, and <italic>F</italic> is randomly split into <italic>K</italic> equal subset. Suppose <italic>L</italic> is the number of base decision trees in the ensemble model, which could be represented as <inline-formula id="inf24">
<mml:math id="m32">
<mml:mrow>
<mml:msub>
<mml:mtext>&#x393;</mml:mtext>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mtext>&#x393;</mml:mtext>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mtext>&#x393;</mml:mtext>
<mml:mi>L</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, respectively. It should be noticed that the number of base classifiers (<italic>L</italic>) and the number of feature subsets (<italic>K</italic>) are the two important tuning parameters for the ROF classifier. The training dataset for a single classifier <inline-formula id="inf25">
<mml:math id="m33">
<mml:mrow>
<mml:msub>
<mml:mtext>&#x393;</mml:mtext>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is preprocessed as follows:<list list-type="simple">
<list-item>
<p>1) Randomly divide <italic>F</italic> into <italic>K</italic> disjointed feature sets, each subset containing <inline-formula id="inf26">
<mml:math id="m34">
<mml:mrow>
<mml:mi>M</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>/</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> features.</p>
</list-item>
<list-item>
<p>2) Let <inline-formula id="inf27">
<mml:math id="m35">
<mml:mrow>
<mml:msub>
<mml:mtext>F</mml:mtext>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> be the <inline-formula id="inf28">
<mml:math id="m36">
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mtext>th&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> feature subset for the training dataset of classifier <inline-formula id="inf29">
<mml:math id="m37">
<mml:mrow>
<mml:msub>
<mml:mtext>&#x393;</mml:mtext>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and a new matrix <inline-formula id="inf30">
<mml:math id="m38">
<mml:mrow>
<mml:msub>
<mml:mtext>X</mml:mtext>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is built by selecting the corresponding column of the features in the subset <inline-formula id="inf31">
<mml:math id="m39">
<mml:mrow>
<mml:msub>
<mml:mtext>F</mml:mtext>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> from the training dataset <inline-formula id="inf32">
<mml:math id="m40">
<mml:mi>X</mml:mi>
</mml:math>
</inline-formula>. Then, a bootstrap subset of objects is selected with the size of 75 percent of the dataset <inline-formula id="inf33">
<mml:math id="m41">
<mml:mrow>
<mml:msub>
<mml:mtext>X</mml:mtext>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> to form a new training dataset <inline-formula id="inf34">
<mml:math id="m42">
<mml:mrow>
<mml:msubsup>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mo>&#x27;</mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</list-item>
<list-item>
<p>3) The principal component analysis (PCA) technique is used on <inline-formula id="inf35">
<mml:math id="m43">
<mml:mrow>
<mml:msubsup>
<mml:mi>X</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mo>&#x27;</mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> to obtain the coefficients in a matrix&#x20;<inline-formula id="inf36">
<mml:math id="m44">
<mml:mrow>
<mml:msub>
<mml:mtext>C</mml:mtext>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
</list-item>
<list-item>
<p>4) A sparse rotation matrix <inline-formula id="inf37">
<mml:math id="m45">
<mml:mrow>
<mml:msub>
<mml:mtext>R</mml:mtext>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is constructed using the coefficients obtained in the matrix <inline-formula id="inf38">
<mml:math id="m46">
<mml:mrow>
<mml:msub>
<mml:mtext>C</mml:mtext>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, which is expressed as follows:</p>
</list-item>
</list>
<disp-formula id="e9">
<mml:math id="m47">
<mml:mrow>
<mml:msub>
<mml:mi>R</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x22ee;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x22ee;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x22f1;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x22ee;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>K</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:msub>
<mml:mi>M</mml:mi>
<mml:mi>K</mml:mi>
</mml:msub>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
<mml:mo>]</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(9)</label>
</disp-formula>
</p>
<p>The columns of <inline-formula id="inf39">
<mml:math id="m48">
<mml:mrow>
<mml:msub>
<mml:mtext>R</mml:mtext>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> should be rearranged to <inline-formula id="inf40">
<mml:math id="m49">
<mml:mrow>
<mml:msubsup>
<mml:mi>R</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>a</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> according to the original feature set. Then, the transformed training dataset for classifier <inline-formula id="inf41">
<mml:math id="m50">
<mml:mrow>
<mml:msub>
<mml:mtext>&#x393;</mml:mtext>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> will become <inline-formula id="inf42">
<mml:math id="m51">
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:msubsup>
<mml:mi>R</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>a</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula>. In this way, all classifiers are trained in parallel.</p>
<p>In the prediction phase, provided a testing sample <inline-formula id="inf43">
<mml:math id="m52">
<mml:mi>x</mml:mi>
</mml:math>
</inline-formula>, let <inline-formula id="inf44">
<mml:math id="m53">
<mml:mrow>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:msubsup>
<mml:mi>R</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>a</mml:mi>
</mml:msubsup>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> be the probability generated by the classifier <inline-formula id="inf45">
<mml:math id="m54">
<mml:mrow>
<mml:msub>
<mml:mtext>&#x393;</mml:mtext>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> to the hypothesis that <inline-formula id="inf46">
<mml:math id="m55">
<mml:mi>x</mml:mi>
</mml:math>
</inline-formula> belongs to class <inline-formula id="inf47">
<mml:math id="m56">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3c9;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. Then, the confidence of each class is calculated by means of the average combination as follows:<disp-formula id="e10">
<mml:math id="m57">
<mml:mrow>
<mml:msub>
<mml:mi>&#x3bc;</mml:mi>
<mml:mi>k</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>L</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>L</mml:mi>
</mml:munderover>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:msubsup>
<mml:mi>R</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>a</mml:mi>
</mml:msubsup>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>c</mml:mi>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(10)</label>
</disp-formula>
</p>
<p>Finally, the testing sample <inline-formula id="inf48">
<mml:math id="m58">
<mml:mi>x</mml:mi>
</mml:math>
</inline-formula> is assigned to the class with the largest confidence.</p>
</sec>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>L-PL performed experiments and wrote the manuscript. BZ and LC designed, performed, and analyzed experiments and wrote the manuscript. All authors read and approved the final manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was supported in part by the National Science Foundation of China, under Grant 61873212, and the Science and Technology Innovation 2030-New Generation Artificial Intelligence Major Project (no. 2018AAA0100103).</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>The authors would like to thank all reviewers for their constructive advice.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aloy</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Russell</surname>
<given-names>R. B.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Ten Thousand Interactions for the Molecular Biologist</article-title>. <source>Nat. Biotechnol.</source> <volume>22</volume> (<issue>10</issue>), <fpage>1317</fpage>&#x2013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1038/nbt1018</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Altschul</surname>
<given-names>S. F.</given-names>
</name>
<name>
<surname>Koonin</surname>
<given-names>E. V.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Iterated Profile Searches with PSI-BLAST-A Tool for Discovery in Protein Databases</article-title>. <source>Trends Biochem. Sci.</source> <volume>23</volume> (<issue>11</issue>), <fpage>444</fpage>&#x2013;<lpage>447</lpage>. <pub-id pub-id-type="doi">10.1016/s0968-0004(98)01298-5</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Altschul</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Madden</surname>
<given-names>T. L.</given-names>
</name>
<name>
<surname>Schffer</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Webb</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>1997</year>). <article-title>Gapped BLAST and PSI-BLAST: a New Generation of Protein Database Search Programs</article-title>. <source>Nucleic Acids Res.</source> <volume>25</volume> (<issue>17</issue>), <fpage>3389</fpage>&#x2013;<lpage>3402</lpage>. <pub-id pub-id-type="doi">10.1093/nar/25.17.3389</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Amos</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Rolf</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>The SWISS-PROT Protein Sequence Data Bank and its Supplement TrEMBL in 1999</article-title>. <source>Nucleic Acids Res.</source> (<issue>1</issue>), <fpage>49</fpage>. </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bracha-Drori</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Shichrur</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Katz</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Oliva</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Angelovici</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Yalovsky</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>Detection of Protein-Protein Interactions in Plants Using Bimolecular Fluorescence Complementation</article-title>. <source>Plant J.</source> <volume>40</volume> (<issue>3</issue>), <fpage>419</fpage>&#x2013;<lpage>427</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2004.02206.x</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brand&#xe3;o</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Dantas</surname>
<given-names>L. L.</given-names>
</name>
<name>
<surname>Silva-Filho</surname>
<given-names>M. C.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>AtPIN: <italic>Arabidopsis thaliana</italic> Protein Interaction Network</article-title>. <source>Bmc Bioinformatics</source> <volume>10</volume>, <fpage>454</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-10-454</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Broadhurst</surname>
<given-names>D. I.</given-names>
</name>
<name>
<surname>Kell</surname>
<given-names>D. B.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Statistical Strategies for Avoiding False Discoveries in Metabolomics and Related Experiments</article-title>. <source>Metabolomics</source> <volume>2</volume> (<issue>4</issue>), <fpage>171</fpage>&#x2013;<lpage>196</lpage>. <pub-id pub-id-type="doi">10.1007/s11306-006-0037-z</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Causier</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Davies</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Analysing Protein-Protein Interactions with the Yeast Two-Hybrid System</article-title>. <source>Plant Mol. Biol.</source> <volume>50</volume> (<issue>6</issue>), <fpage>855</fpage>&#x2013;<lpage>870</lpage>. <pub-id pub-id-type="doi">10.1023/a:1021214007897</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chakraborti</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>McCane</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Mills</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Pal</surname>
<given-names>U.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>LOOP Descriptor: Local Optimal-Oriented Pattern</article-title>. <source>IEEE Signal. Process. Lett.</source> <volume>25</volume>, <fpage>635</fpage>&#x2013;<lpage>639</lpage>. <pub-id pub-id-type="doi">10.1109/lsp.2018.2817176</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Loscalzo</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Comprehensive Characterization of Protein&#x2013;Protein Interactions Perturbed by Disease Mutations</article-title>. <source>Nat. Genet.</source> <volume>53</volume> (<issue>3</issue>), <fpage>1</fpage>&#x2013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-020-00774-y</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Chih-Chung</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Chih-Jen</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2011</year>). <source>Libsvm: A Library for Support Vector Machines</source>. </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cizek</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>1970</year>). <article-title>Discrete Hilbert Transform</article-title>. <source>IEEE Trans. Audio Electroacoust.</source> <volume>18</volume> (<issue>4</issue>), <fpage>340</fpage>&#x2013;<lpage>343</lpage>. <pub-id pub-id-type="doi">10.1109/tau.1970.1162139</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ding</surname>
<given-names>Y.-D.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>J.-W.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Q.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Prediction and Functional Analysis of the Sweet orange Protein-Protein Interaction Network</article-title>. <source>BMC Plant Biol.</source> <volume>14</volume> (<issue>1</issue>), <fpage>213</fpage>. <pub-id pub-id-type="doi">10.1186/s12870-014-0213-7</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dreze</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Carvunis</surname>
<given-names>A-R.</given-names>
</name>
<name>
<surname>Charloteaux</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Galli</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Evidence for Network Evolution in an Arabidopsis Interactome Map</article-title>. <source>Science</source> <volume>333</volume> (<issue>6042</issue>), <fpage>601</fpage>&#x2013;<lpage>607</lpage>. <pub-id pub-id-type="doi">10.1126/science.1203877</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fukao</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Protein-protein Interactions in Plants</article-title>. <source>Plant Cel Physiol.</source> <volume>53</volume> (<issue>4</issue>), <fpage>617</fpage>&#x2013;<lpage>625</lpage>. <pub-id pub-id-type="doi">10.1093/pcp/pcs026</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Geisler-Lee</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>O&#x27;Toole</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Ammar</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Provart</surname>
<given-names>N. J.</given-names>
</name>
<name>
<surname>Millar</surname>
<given-names>A. H.</given-names>
</name>
<name>
<surname>Geisler</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>A Predicted Interactome for Arabidopsis</article-title>. <source>Plant Physiol.</source> <volume>145</volume> (<issue>2</issue>), <fpage>317</fpage>&#x2013;<lpage>329</lpage>. <pub-id pub-id-type="doi">10.1104/pp.107.103465</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gordon</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>A Generalization of the Maximum Noise Fraction Transform</article-title>. <source>IEEE Trans. Geosci. Remote Sensing</source> <volume>38</volume> (<issue>1</issue>), <fpage>608</fpage>&#x2013;<lpage>610</lpage>. <pub-id pub-id-type="doi">10.1109/36.823955</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Green</surname>
<given-names>A. G.</given-names>
</name>
<name>
<surname>Elhabashy</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Brock</surname>
<given-names>K. P.</given-names>
</name>
<name>
<surname>Maddamsetti</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Kohlbacher</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Marks</surname>
<given-names>D. S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Large-scale Discovery of Protein Interactions at Residue Resolution Using Co-evolution Calculated from Genomic Sequences</article-title>. <source>Nat. Commun.</source> <volume>12</volume> (<issue>1</issue>), <fpage>1396</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-021-21636-z</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gribskov</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>McLachlan</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Eisenberg</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>1987</year>). <article-title>Profile Analysis: Detection of Distantly Related Proteins</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>84</volume> (<issue>13</issue>), <fpage>4355</fpage>&#x2013;<lpage>4358</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.84.13.4355</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Jiao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Meng</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>PRIN: a Predicted rice Interactome Network</article-title>. <source>Bmc Bioinformatics</source> <volume>12</volume>, <fpage>161</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-12-161</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>He</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ong</surname>
<given-names>Y. S.</given-names>
</name>
</person-group> (<year>2021b</year>). <article-title>Vicinal Vertex Allocation for Matrix Factorization in Networks</article-title>. <source>IEEE T Cybern</source> (<issue>99</issue>). <publisher-loc>Piscataway, NJ</publisher-loc>: <publisher-name>IEEE (The Institute of Electrical and Electronics Engineers)</publisher-name>. <pub-id pub-id-type="doi">10.1109/tcyb.2021.3051606</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>He</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ong</surname>
<given-names>Y. S.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2021a</year>). <source>Learning Conjoint Attentions for Graph Neural Nets</source>. <publisher-loc>San Diego, CA</publisher-loc>: <publisher-name>NIPS; The Neural Information Processing Systems (NIPS) Foundation</publisher-name>. </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heikkil&#xe4;</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Rahtu</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Ojansivu</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Local Phase Quantization for Blur Insensitive Texture Description</article-title>. <source>Stud. Comput. Intelligence</source> <volume>506</volume>, <fpage>49</fpage>&#x2013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-39289-4_3</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hultschig</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Kreutzberger</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Seitz</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Konthur</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Bussow</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Lehrach</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Recent Advances of Protein Microarrays</article-title>. <source>Curr. Opin. Chem. Biol.</source> <volume>10</volume> (<issue>1</issue>), <fpage>4</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1016/j.cbpa.2005.12.011</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jabid</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kabir</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Chae</surname>
<given-names>O.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Gender Classification Using Local Directional Pattern (LDP)</article-title>.in&#x201d; <conf-name>20th International Conference on Pattern Recognition, ICPR 2010</conf-name>, <volume>23-26</volume>. <pub-id pub-id-type="doi">10.1109/icpr.2010.373</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jolliffe</surname>
<given-names>I. T.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Principal Component Analysis</article-title>. <source>J.&#x20;Marketing Res.</source> <volume>87</volume> (<issue>4</issue>), <fpage>513</fpage>. </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kerrien</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Aranda</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Breuza</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Bridge</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Broackes-Carter</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>The IntAct Molecular Interaction Database in 2012</article-title>. <source>Nucleic Acids Res.</source> <volume>40</volume> (<issue>Database issue</issue>), <fpage>D841</fpage>&#x2013;<lpage>D846</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr1088</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Lenz</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sinn</surname>
<given-names>L. R.</given-names>
</name>
<name>
<surname>O&#x27;Reilly</surname>
<given-names>F. J.</given-names>
</name>
<name>
<surname>Fischer</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Rappsilber</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <source>Reliable Identification of Protein-Protein Interactions by Crosslinking Mass Spectrometry</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Nature Publishing Group</publisher-name>. </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>H.-L.</given-names>
</name>
<name>
<surname>Pang</surname>
<given-names>Y.-H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B.</given-names>
</name>
</person-group>: <article-title>BioSeq-BLM: a Platform for Analyzing DNA, RNA and Protein Sequences Based on Biological Language Models</article-title>. <year>2021</year>, <volume>49</volume>(<issue>22</issue>):<fpage>e129</fpage>.<pub-id pub-id-type="doi">10.1093/nar/gkab829</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>BioSeq-Analysis2.0: an Updated Platform for Analyzing DNA, RNA and Protein Sequences at Sequence Level and Residue Level Based on Machine Learning Approaches</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>e127</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz740</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Masashi</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Sugiyama</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Id&#xe9;</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>NakajimaJun</surname>
</name>
</person-group> (<year>2010</year>). <article-title>Semi-supervised Local Fisher Discriminant Analysis for Dimensionality Reduction</article-title>. <source>Mach Learn</source>. </citation>
</ref>
<ref id="B32">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Min</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2010</year>). &#x201c;<article-title>A Database of Protein-Protein Interactions in Plants</article-title>,&#x201d; in <conf-name>International Conference on Bioinformatics &#x26; Biomedical Engineering</conf-name>, <conf-loc>Wuhan, China</conf-loc>, <conf-date>May 10&#x2013;12, 2011</conf-date>. </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morsy</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Gouthu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>OrchardHarper</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Thorneycroft</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Harper</surname>
<given-names>J.&#x20;F.</given-names>
</name>
<name>
<surname>Mittler</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>Charting Plant Interactomes: Possibilities and Challenges</article-title>. <source>Trends Plant Sci.</source> <volume>13</volume> (<issue>4</issue>), <fpage>183</fpage>&#x2013;<lpage>191</lpage>. <pub-id pub-id-type="doi">10.1016/j.tplants.2008.01.006</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Ojala</surname>
<given-names>T.</given-names>
</name>
</person-group> <person-group person-group-type="author">
<name>
<surname>Pietikainen</surname>
<given-names>M.</given-names>
</name>
</person-group>, and <person-group person-group-type="author">
<name>
<surname>Harwood</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>1994</year>). <source>Performance Evaluation of Texture Measures with Classification Based on Kullback Discrimination of Distributions</source>.<publisher-name>IEEE</publisher-name> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ojansivu</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Rahtu</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Heikkila&#xa8;</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Rotation Invariant Local Phase Quantization for Blur Insensitive Texture Analysis</article-title>.&#x201d;in <conf-name>19th International Conference on Pattern Recognition</conf-name>. <pub-id pub-id-type="doi">10.1109/icpr.2008.4761377</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>L-P.</given-names>
</name>
<name>
<surname>You</surname>
<given-names>Z-H.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>C-Q.</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>Z-H.</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>Y-J.</given-names>
</name>
</person-group> (<year>2021a</year>). <article-title>Prediction of Protein&#x2013;Protein Interactions in Arabidopsis, Maize, and Rice by Combining Deep Neural Network with Discrete Hilbert Transform</article-title>. <source>Front. Genet.</source> <volume>2021</volume> (<issue>1678</issue>), <fpage>12</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2021.745228</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>L-P.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>C-Q.</given-names>
</name>
<name>
<surname>You</surname>
<given-names>Z-H.</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>Z-H.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J-Y.</given-names>
</name>
<etal/>
</person-group> (<year>2021b</year>). <article-title>A Novel Computational Approach to Predict Plant Protein-Protein Interactions via an Ensemble Learning Method</article-title>. <source>Scientific Programming</source> <volume>2021</volume>, <fpage>1607946</fpage>. <pub-id pub-id-type="doi">10.1155/2021/1607946</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Prasad</surname>
<given-names>P. S.</given-names>
</name>
</person-group> (<year>2001</year>). <source>Independent Component Analysis</source>. <publisher-name>Cambridge University Press</publisher-name>. </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Puig</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Caspary</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Rigaut</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Rutz</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Bouveret</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Bragado-Nilsson</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2001</year>). <article-title>The Tandem Affinity Purification (TAP) Method: A General Procedure of Protein Complex Purification</article-title>. <source>Methods</source> <volume>24</volume> (<issue>3</issue>), <fpage>218</fpage>&#x2013;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1006/meth.2001.1183</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rodriguez</surname>
<given-names>J.&#x20;J.</given-names>
</name>
<name>
<surname>Kuncheva</surname>
<given-names>L. I.</given-names>
</name>
<name>
<surname>Alonso</surname>
<given-names>C. J.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Rotation forest: A New Classifier Ensemble Method</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell.</source> <volume>28</volume> (<issue>10</issue>), <fpage>1619</fpage>&#x2013;<lpage>1630</lpage>. <pub-id pub-id-type="doi">10.1109/tpami.2006.211</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rose</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Chris</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bobby-Joe</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Jennifer</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Lorrie</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Christie</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>The BioGRID Interaction Database: 2019 Update</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>D529</fpage>&#x2013;<lpage>D541</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1079</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sambourg</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Thierry-Mieg</surname>
<given-names>N.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>New Insights into Protein-Protein Interaction Data lead to Increased Estimates of the <italic>S. cerevisiae</italic> Interactome Size</article-title>. <source>Bmc Bioinformatics</source> <volume>11</volume> (<issue>1</issue>), <fpage>605</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-11-605</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sheth</surname>
<given-names>B. P.</given-names>
</name>
<name>
<surname>Thaker</surname>
<given-names>V. S.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Plant Systems Biology: Insights, Advances and Challenges</article-title>. <source>Planta: Int. J.&#x20;Plant Biol.</source> <pub-id pub-id-type="doi">10.1007/s00425-014-2059-5</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tian</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>You</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Yi</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Du</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>agriGO v2.0: a GO Analysis Toolkit for the Agricultural Community, 2017 Update</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume> (<issue>W1</issue>), <fpage>W122</fpage>&#x2013;<lpage>W129</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx382</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wong</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>You</surname>
<given-names>Z. H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>Y. A.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2015</year>). &#x201c;<article-title>Detection of Protein-Protein Interactions from Amino Acid Sequences Using a Rotation Forest Model with a Novel PR-LPQ Descriptor</article-title>,&#x201d; in <conf-name>International Conference on Intelligent Computing</conf-name>. <pub-id pub-id-type="doi">10.1007/978-3-319-22053-6_75</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiaoli</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Xiucai</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Pu-Feng</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Ran</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Leyi</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>CPPred-FL: a Sequence-Based Predictor for Large-Scale Identification of Cell-Penetrating Peptides by Feature Representation Learning</article-title>. <source>Brief. Bioinformatics</source>. </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yi</surname>
<given-names>H. C.</given-names>
</name>
<name>
<surname>You</surname>
<given-names>Z. H.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>T. H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>L. P.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information</article-title>. <source>Mol. Ther. Nucleic Acids</source> <volume>11</volume> (<issue>C</issue>), <fpage>337</fpage>&#x2013;<lpage>344</lpage>. <pub-id pub-id-type="doi">10.1016/j.omtn.2018.03.001</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yon</surname>
<given-names>R. S.</given-names>
</name>
<name>
<surname>William</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Berardini</surname>
<given-names>T. Z.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>David</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Aisling</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2003</year>). <article-title>The Arabidopsis Information Resource (TAIR): a Model Organism Database Providing a Centralized, Curated Gateway to Arabidopsis Biology, Research Materials and Community</article-title>. <source>Nucleic Acids Res.</source> <volume>31</volume> (<issue>1</issue>), <fpage>224</fpage> </citation>
</ref>
<ref id="B49">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>You</surname>
<given-names>Z-H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Chan</surname>
<given-names>K. C.</given-names>
</name>
</person-group> (<year>2016b</year>). <source>An Improved Sequence-Based Prediction Protocol for Protein-Protein Interactions Using Amino Acids Substitution Matrix and Rotation forest Ensemble Classifiers</source>. <publisher-name>Neurocomputing.</publisher-name> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>You</surname>
<given-names>Z. H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Ji</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Large-Scale Protein-Protein Interactions Detection by Integrating Big Biosensing Data with Computational Model</article-title>. <source>Biomed. Res. Int.</source> <volume>2014</volume>, <fpage>598129</fpage>. <pub-id pub-id-type="doi">10.1155/2014/598129</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>You</surname>
<given-names>Z. H.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2016a</year>). <source>Highly Efficient Framework for Predicting Interactions between Proteins</source>. <publisher-name>IEEE T Cybern.</publisher-name> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan</surname>
<given-names>J.&#x20;S.</given-names>
</name>
<name>
<surname>Galbraith</surname>
<given-names>D. W.</given-names>
</name>
<name>
<surname>Dai</surname>
<given-names>S. Y.</given-names>
</name>
<name>
<surname>Griffin</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Stewart</surname>
<given-names>C. N.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Plant Systems Biology Comes of Age</article-title>. <source>Trends Plant Sci.</source> <volume>13</volume> (<issue>4</issue>), <fpage>165</fpage>&#x2013;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.1016/j.tplants.2008.02.003</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yuan-Ke</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Zi-Ang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tao</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Pu-Feng</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Predicting lncRNA-Protein Interactions with miRNAs as Mediators in a Heterogeneous Network Model</article-title>. <source>Front. Genet.</source> <volume>10</volume>, <fpage>1341</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2019.01341</pub-id> </citation>
</ref>
<ref id="B54">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Rui</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>X. M.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>PPIM: A Protein-Protein Interaction Database for Maize</article-title>.in&#x201d; <conf-name>13th IEEE Conference on Automation Science and Engineering (IEEE CASE 2017), sponsored by the IEEE Robotics and Automation Society (RAS)</conf-name>, <conf-loc>Xi&#x0027;an, China</conf-loc>, <conf-date>20&#x2013;23 Auguest 2017</conf-date>. <pub-id pub-id-type="doi">10.1109/coase.2017.8256085</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Gu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Jiao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Computational Identification of Protein-Protein Interactions in Rice Based on the Predicted Rice Interactome Network</article-title>. <source>Genomics Proteomics Bioinformatics</source> <volume>9</volume> (<issue>4</issue>), <fpage>128</fpage>&#x2013;<lpage>137</lpage>. <pub-id pub-id-type="doi">10.1016/S1672-0229(11)60016-8</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>