<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2021.680117</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Application of Machine Learning for Drug&#x2013;Target Interaction Prediction</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Xu</surname> <given-names>Lei</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/561110/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Ru</surname> <given-names>Xiaoqing</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/684750/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Song</surname> <given-names>Rong</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>School of Electronic and Communication Engineering, Shenzhen Polytechnic</institution>, <addr-line>Shenzhen</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Computer Science, University of Tsukuba</institution>, <addr-line>Tsukuba</addr-line>, <country>Japan</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Quan Zou, University of Electronic Science and Technology of China, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Ying Hong Li, Chongqing University of Posts and Telecommunications, China; Changli Feng, Taishan University, China</p></fn>
<corresp id="c001">&#x002A;Correspondence: Rong Song, <email>sr1@szpt.edu.cn</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>21</day>
<month>06</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>680117</elocation-id>
<history>
<date date-type="received">
<day>13</day>
<month>03</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>05</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Xu, Ru and Song.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Xu, Ru and Song</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Exploring drug&#x2013;target interactions by biomedical experiments requires a lot of human, financial, and material resources. To save time and cost to meet the needs of the present generation, machine learning methods have been introduced into the prediction of drug&#x2013;target interactions. The large amount of available drug and target data in existing databases, the evolving and innovative computer technologies, and the inherent characteristics of various types of machine learning have made machine learning techniques the mainstream method for drug&#x2013;target interaction prediction research. In this review, details of the specific applications of machine learning in drug&#x2013;target interaction prediction are summarized, the characteristics of each algorithm are analyzed, and the issues that need to be further addressed and explored for future research are discussed. The aim of this review is to provide a sound basis for the construction of high-performance models.</p>
</abstract>
<kwd-group>
<kwd>machine learning</kwd>
<kwd>drug&#x2013;target interactions</kwd>
<kwd>data</kwd>
<kwd>features</kwd>
<kwd>task algorithms</kwd>
<kwd>drug development</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="98"/>
<page-count count="8"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>Tens of thousands of known diseases threatening human health, and new ones are being added every year. They include emerging diseases (e.g., the currently prevalent COVID-19) and diseases that have plagued the public for many years and have no cure so far (e.g., Parkinson&#x2019;s disease and Alzheimer&#x2019;s disease) (<xref ref-type="bibr" rid="B75">Xu et al., 2018a</xref>, <xref ref-type="bibr" rid="B77">2019</xref>). Rapidly and accurately discovering drugs that can effectively treat diseases is very important for the development of society. Long cycle and high cost are common phenomena in current drug development, but these fail to guarantee a high success rate. Many steps are required from drug development to final marketing, including drug discovery, preclinical and clinical trials, and marketing approval (<xref ref-type="bibr" rid="B54">Srivastava et al., 2019</xref>; <xref ref-type="bibr" rid="B35">Li Z. et al., 2020</xref>). The overall success rate of drug discovery and preclinical studies, which are part of the laboratory development phase, is approximately 0.05&#x2013;0.1%, and less than 1% of the candidate compounds are likely to have the expected effect and proceed to the clinical trial phase. Investigating drug&#x2013;target interactions is an important step in the drug discovery process and can improve the success rate of new drug discovery (<xref ref-type="bibr" rid="B3">Chen et al., 2019</xref>; <xref ref-type="bibr" rid="B22">Huang et al., 2020</xref>; <xref ref-type="bibr" rid="B83">Zeng et al., 2020b</xref>). These not only signal the need to expend significant resources to find and test candidate compounds one by one during the drug development phase to confirm that they meet expectations, but also demonstrate the importance of drug&#x2013;target interaction prediction in the overall drug development process. Supplementally, an obvious drawback of biomedical experiment is that it does not allow for rapidly finding and solving problems, which can be detrimental to the treatment of emerging and highly infectious diseases. Therefore, machine learning methods have been introduced into the prediction of drug&#x2013;target interactions.</p>
<p>Machine learning, a computer technology for data analysis designed to build predictive models using datasets, has become an important means of modern biological research (<xref ref-type="bibr" rid="B76">Xu et al., 2018b</xref>; <xref ref-type="bibr" rid="B80">Yang et al., 2018</xref>; <xref ref-type="bibr" rid="B38">Liu et al., 2019</xref>, <xref ref-type="bibr" rid="B42">2020</xref>; <xref ref-type="bibr" rid="B59">Tang et al., 2020</xref>; <xref ref-type="bibr" rid="B82">Zeng et al., 2020a</xref>). It has become a mainstream technique for analyzing and solving problems involved in drug&#x2013;target interaction prediction studies (<xref ref-type="bibr" rid="B2">Cai et al., 2018</xref>; <xref ref-type="bibr" rid="B55">Stephenson et al., 2019</xref>; <xref ref-type="bibr" rid="B85">Zeng et al., 2019</xref>; <xref ref-type="bibr" rid="B15">Fu et al., 2020</xref>; <xref ref-type="bibr" rid="B64">Wang J. et al., 2020</xref>).</p>
</sec>
<sec id="S2">
<title>Three Factors</title>
<p>The existing data background, powerful toolkits, and current status and requirements have promoted machine learning to become the mainstream method of drug&#x2013;target interaction prediction.</p>
<p>(1) Existing databases. With the emergence of sequencing technology, high-throughput technology and computer-aided drug design method, a large number of proteins have been sequenced and many compounds have been synthesized. On the basis of existing related works and accumulated experience, relevant data has been organized and various databases have been constructed. Most of the data in these databases are publicly available and free to download, which provides a good data foundation for solving drug&#x2013;target interaction prediction problems by machine learning. Researchers can collect datasets from databases that cover different information according to their needs (<xref ref-type="bibr" rid="B93">Zheng et al., 2019</xref>, <xref ref-type="bibr" rid="B94">2020</xref>). Some representative databases are briefly described here.</p>
<p>UniProt database<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> : UniProt is supported by many institutions, and is the most informative and comprehensive protein database (<xref ref-type="bibr" rid="B9">Consortium, 2015</xref>). It consists of five sub-databases: Swiss-Prot, TrEMBL, UniRef, UniParc, and Proteomes. Each sub-database has its own unique function. For example, Swiss-Prot is a high-quality, manually annotated, non-redundant database, in which protein annotations are derived mainly from the literature or E-value verification calculation analysis results. Proteomes is a database that provides proteomic information for species with fully sequenced genomes.</p>
<p>PubChem database<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> : PubChem is an open chemistry database that collects information including chemical structures, identifiers, physicochemical properties, and biological activities of chemical molecules (<xref ref-type="bibr" rid="B27">Kim et al., 2016</xref>, <xref ref-type="bibr" rid="B26">2021</xref>). It is the world&#x2019;s largest database with free access to chemical information, and currently covers 109 million compounds. PubChem has become an important chemical information resource for scientists, students, and the public.</p>
<p>DrugBank database<sup><xref ref-type="fn" rid="footnote3">3</xref></sup> : As a bioinformatics and cheminformatics resource, DrugBank combines detailed drug data (i.e., chemical, pharmacological, and pharmaceutical) with comprehensive target information (i.e., sequence, structure, and pathway) (<xref ref-type="bibr" rid="B72">Wishart et al., 2018</xref>). The latest DrugBank release (version 5.1.8.) contains 14,443 drug molecules and 5,244 non-redundant protein sequences associated with these drugs. The database describes not only clinical information on drugs, namely drug side effects and drug&#x2013;drug interactions, but also contains molecular-level data, such as chemical structures of drugs and proteins targeted by drugs (<xref ref-type="bibr" rid="B73">Wishart et al., 2008</xref>). One significant function of DrugBank is that it supports comprehensive and complex searches, so it is used widely by the pharmaceutical industry, medicinal chemists, pharmacists, physicians, students, and the general public.</p>
<p>KEGG database<sup><xref ref-type="fn" rid="footnote4">4</xref></sup> : KEGG was established in 1995 by the Kanehisa Laboratories at the Bioinformatics Center, Kyoto University, Japan, and is now one of the most commonly used international bioinformatics databases (<xref ref-type="bibr" rid="B25">Kanehisa and Goto, 2000</xref>). KEGG is a database used to understand the high-level functions and practicability of biological systems from molecular-level information (<xref ref-type="bibr" rid="B32">Li H. et al., 2020</xref>; <xref ref-type="bibr" rid="B62">Wang et al., 2021a</xref>) (especially large-scale molecular datasets generated by genome sequencing and other high-throughput techniques), of which the data information can be roughly classified into four major categories: system information, genetic information, chemical information, and medical information.</p>
<p>BindingDB database<sup><xref ref-type="fn" rid="footnote5">5</xref></sup> : BindingDB is a publicly available, web-accessible database for measuring binding affinity, focusing on the interactions between proteins considered to be drug targets and drug-like small molecules (<xref ref-type="bibr" rid="B41">Liu et al., 2007</xref>). BindingDB currently contains 2,114,159 binding data between 8,202 protein targets and 928,022 small molecules.</p>
<p>(2) Powerful toolkits and web servers. Bioinformatics and cheminformatics are emerging interdisciplinary fields that use computers to solve biological and chemical problems. Many toolkits and web servers have been developed (<xref ref-type="bibr" rid="B98">Zuo et al., 2017</xref>; <xref ref-type="bibr" rid="B96">Zou et al., 2019</xref>; <xref ref-type="bibr" rid="B36">Lin et al., 2020</xref>; <xref ref-type="bibr" rid="B47">Pang and Liu, 2020</xref>; <xref ref-type="bibr" rid="B51">Shao et al., 2021</xref>), which can help to solve problems in drug&#x2013;target interaction prediction.</p>
<p>STITCH<sup><xref ref-type="fn" rid="footnote6">6</xref></sup> : STITCH not only includes experimentally validated drug&#x2013;target interaction data, but also integrates predicted drug&#x2013;target relationships (<xref ref-type="bibr" rid="B29">Kuhn et al., 2007</xref>). This website can clearly depict the protein&#x2013;protein interactions, protein&#x2013;compound interactions, and the strength of the interactions.</p>
<p>SwissTargetPrediction<sup><xref ref-type="fn" rid="footnote7">7</xref></sup> : SwissTargetPrediction can estimate the most likely macromolecule to be targeted by a biologically active small molecule and count the percentage of each target type targeted by the small molecule (<xref ref-type="bibr" rid="B16">Gfeller et al., 2014</xref>).</p>
<p>RDkit<sup><xref ref-type="fn" rid="footnote8">8</xref></sup> : RDkit is a powerful python toolkit for chemical information, which has functions such as acquiring molecule information from multiple formats, obtaining information about atoms, bonds, and rings in molecules, generating molecular descriptors and molecular fingerprints of compounds, and calculating similarities of compound structures (<xref ref-type="bibr" rid="B30">Landrum, 2013</xref>).</p>
<p>OpenChem<sup><xref ref-type="fn" rid="footnote9">9</xref></sup> : OpenChem is a pytorch-based deep learning toolkit for computational chemistry and drug design, which contains Feature2Label, Smiles2Label, Graph2Label, SiameseModel, GenerativeRNN, and MolecularRNN (<xref ref-type="bibr" rid="B28">Korshunova et al., 2021</xref>). Users can train predictive models for classification, regression, and multi-task problems, and develop generative models for generating novel molecules with optimized properties. Its goal is to make deep learning an easy-to-use tool for researchers in computational chemistry and drug design.</p>
<p>iFeature<sup><xref ref-type="fn" rid="footnote10">10</xref></sup> : iFeature is a python toolkit that can compute various structural and physicochemical property descriptors from protein and peptide sequences. iFeature can compute and extract comprehensive spectra for 18 major sequence coding schemes, including 53 different types of feature descriptors. In addition, iFeature integrates 12 different types of commonly used feature clustering, selection, and dimensionality reduction algorithms (<xref ref-type="bibr" rid="B4">Chen et al., 2018</xref>).</p>
<p>Pse-in-one<sup><xref ref-type="fn" rid="footnote11">11</xref></sup> : Pse-in-one is a python toolkit that generates all possible pseudo-components of DNA, RNA, and protein sequences. It covers a total of 28 different patterns, 14 for DNA sequences, 6 for RNA sequences, and 8 for protein sequences (<xref ref-type="bibr" rid="B39">Liu et al., 2015</xref>, <xref ref-type="bibr" rid="B40">2017</xref>). This toolkit is widely and increasingly used by researchers to tackle various problems in computational biology, and a more specific and detailed version BioSeq-Analysis (<xref ref-type="bibr" rid="B37">Liu, 2019</xref>) has recently been released.</p>
<p>(3) Current status and requirements. With the development of high-throughput technologies, many compounds and proteins have been mined. The human genome contains more than 20,000 genes, and approximately 80% of them can encode one or more proteins. Only a small number of proteins have been identified as pharmacologically active and are targets for currently approved drugs. The pharmacological functions of most proteins remain to be demonstrated. This is also true for most compounds. For example, there are currently 111 million compounds in the PubChem database, but proteins that could interact with many of these compounds are unknown. In addition, it is obvious that the traditional approach of wet experiments is not feasible for some emerging, highly infectious and destructive new pathogens, such as the SARS, H7N9, Ebola, Mers, and COVID-19 viruses (<xref ref-type="bibr" rid="B7">Cheng et al., 2021</xref>). Considering the huge amounts of available data and large numbers of diseases that cause serious social health risks, using computational chemistry-related theories and computer simulation methods to computationally predict drug&#x2013;target interaction can effectively improve efficiency. Machine learning-based methods have become effective ways to compensate for the shortcomings of traditional biochemical experimental methods.</p>
</sec>
<sec id="S3">
<title>Applications</title>
<p>The current drug&#x2013;target interaction prediction procedures are shown in <xref ref-type="fig" rid="F1">Figure 1</xref>. Existing studies on drug&#x2013;target interaction prediction have shown that using different calculation or optimization methods in the steps of data set acquisition, feature extraction and processing, and task algorithm selection can build models with good performance.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Steps for predicting drug-target interactions. The two- and three-dimensional structure diagrams of the drug are from PubChem.</p></caption>
<graphic xlink:href="fgene-12-680117-g001.tif"/>
</fig>
<p>(1) Dataset acquisition. Redundant data, unbalanced categories, and unrepresentative samples can lead to long experimental cycles, as well as inaccurate and biased experimental results. Different data acquisition methods have been used to avoid or reduce the impact of these problems on model construction. For example, <xref ref-type="bibr" rid="B66">Wang et al. (2010)</xref> collected negative examples by random selection to solve the data imbalance problem. <xref ref-type="bibr" rid="B65">Wang et al. (2018)</xref> also used random selection to extract negative examples, and this operation was performed five times to reduce the impact of the unverified negative samples. Pdti-EssB (<xref ref-type="bibr" rid="B44">Mahmud et al., 2020</xref>) used random under-sampling and under-sampling clustering to address the data imbalance problem.</p>
<p>Currently, most target molecules are proteins, of which four protein families [kinases, G protein-coupled receptors (GPCRs), ion channels, and nuclear receptors] account for 44% of the target molecules, and 70% of the currently developed drugs are targeted to these four protein families. Datasets established by <xref ref-type="bibr" rid="B78">Yamanishi et al. (2008)</xref>, which contain the interactions between these four proteins and drugs, have been widely used (<xref ref-type="bibr" rid="B46">&#x00D6;zt&#x00FC;rk et al., 2018</xref>; <xref ref-type="bibr" rid="B44">Mahmud et al., 2020</xref>). The relevant data can be downloaded from <ext-link ext-link-type="uri" xlink:href="http://web.kuicr.kyoto-u.ac.jp/supp/yoshi/drugtarget/">http://web.kuicr.kyoto-u.ac.jp/supp/yoshi/drugtarget/</ext-link>. Most of the computational approaches based on these datasets have focused on binary classification, that is, they only explore whether a drug can interact with a particular protein. To further accelerate process and reduce cost, drug&#x2013;target affinity has been explored in some studies. Drug&#x2013;target affinity is a key property that determines the strength of the interaction between the small molecule drug and the target. The commonly used datasets for predicting drug&#x2013;target affinity are the Kinase (<xref ref-type="bibr" rid="B10">Davis et al., 2011</xref>) and KIBA (<xref ref-type="bibr" rid="B58">Tang et al., 2014</xref>) datasets.</p>
<p>(2) Feature extraction and processing. Accurate and comprehensive descriptions of the biological or chemical functional information of drugs and targets in numerical form play an important role in the construction of high-performance models. Feature extraction of drugs and targets can be performed from different perspectives (<xref ref-type="bibr" rid="B5">Cheng, 2019</xref>; <xref ref-type="bibr" rid="B91">Zhao T. et al., 2020</xref>). For example, iGPCR-Drug (<xref ref-type="bibr" rid="B74">Xiao et al., 2013</xref>) obtains drug features by discrete Fourier transform of drug molecular fingerprints and extracts GPCR features according to pseudo amino acid compositions. DrugE-Rank (<xref ref-type="bibr" rid="B81">Yuan et al., 2016</xref>) represents drug features according to general descriptors and extracts target features according to amino acid composition, transformation, and distribution. TargetGDrug (<xref ref-type="bibr" rid="B20">Hu J. et al., 2016</xref>) extracts drug features by applying wavelet transform to drug molecular fingerprints and extracts GPCR features according to evolutionary information. <xref ref-type="bibr" rid="B49">Ru et al. (2020)</xref> extracted protein features using the distance-based top-n-gram algorithm and obtained drug features according to general descriptors. Chemical databases store information in a textual representation and the simplified molecular input line entry specification (SMILES) format is a common standard used in many cheminformatics software. Each SMILES string encodes structural information that can be used to predict complex chemical properties, and a large number of machine learning models can extract molecular features of compounds according to SMILES strings. Recently, convolutional neural networks (CNNs) and recurrent neural networks have been used for molecular feature extraction. <xref ref-type="bibr" rid="B19">Hirohara et al. (2018)</xref> transformed SMILES strings into two-dimensional matrices and used CNNs to extract molecular features. <xref ref-type="bibr" rid="B17">Goh et al. (2017)</xref> applied natural language processing to SMILES feature extraction and used recurrent neural networks for molecular strings.</p>
<p>The presence of invalid or redundant features not only reduces the accuracy of the experiment result but also lengthens the experimental period. Low-dimensional and comprehensive information feature sets are expected. Therefore, a variety of methods for processing features have been applied to related rearch (<xref ref-type="bibr" rid="B95">Zou et al., 2016a</xref>, <xref ref-type="bibr" rid="B97">b</xref>; <xref ref-type="bibr" rid="B18">Guo et al., 2020</xref>; <xref ref-type="bibr" rid="B88">Zhang G. et al., 2020</xref>; <xref ref-type="bibr" rid="B92">Zhao X. et al., 2020</xref>). For example, to reduce the noise between features, <xref ref-type="bibr" rid="B34">Li et al. (2017)</xref> used principal component analysis (PCA) to reduce the dimensionality of drugs and targets features. <xref ref-type="bibr" rid="B57">Tabei et al. (2012)</xref> combined 881 substructures of drugs and 876 Pfam domain structures of targets by tensor product to form feature vectors of drug&#x2013;target pairs. MFDR (<xref ref-type="bibr" rid="B21">Hu P.-W. et al., 2016</xref>) used autoencoders as the building blocks of a deep network to reconstruct drug and protein features into a low-dimensional new representation. DeepConv-DT (<xref ref-type="bibr" rid="B31">Lee et al., 2019</xref>) used CNNs on raw protein sequences to capture local amino acid residue information by convolving amino acid subsequences of various lengths.</p>
<p>(3) Selection of task algorithms. Several task algorithms have been used for drug&#x2013;target interaction prediction, such as classification algorithms, learning to rank algorithms, and deep learning algorithms (<xref ref-type="bibr" rid="B8">Cheng et al., 2019</xref>; <xref ref-type="bibr" rid="B43">Lv et al., 2019</xref>; <xref ref-type="bibr" rid="B60">Tao et al., 2020</xref>; <xref ref-type="bibr" rid="B90">Zhang Y. et al., 2020</xref>).</p>
<p>Most of the existing studies treat drug&#x2013;target interaction prediction as binary tasks, and different classification algorithms have been applied. For example, <xref ref-type="bibr" rid="B1">Bleakley and Yamanishi (2009)</xref> proposed a bipartite local model (BLM) based on a support vector machine (SVM) kernel to predict drug&#x2013;target relationships. LRF-DTI (<xref ref-type="bibr" rid="B53">Shi et al., 2019</xref>) is a drug&#x2013;target interaction prediction method using Lasso for feature extraction and random forest for classification. <xref ref-type="bibr" rid="B79">Yamanishi et al. (2010)</xref> used a distance learning algorithm as a classifier. Pred-binding (<xref ref-type="bibr" rid="B52">Shar et al., 2016</xref>) extracted molecular structure and protein sequence features, and used support vector machines and random forests to classify whether drugs and targets can be docked.</p>
<p>Drug&#x2013;target interaction prediction can be regarded as a ranking task. Exploring the strength of drug&#x2013;target interactions can shorten the drug development process and save expenses. <xref ref-type="bibr" rid="B89">Zhang et al. (2015)</xref> applied six learning to rank algorithms (Prank, RankNet, RankBoost, SVMRank, AdaRank, and ListNet) to virtual screening of drugs, their study showed that learning to rank is an effective computational strategy, especially because of its novel use in cross-target virtual screening and heterogeneous data integration. DrugE-Rank (<xref ref-type="bibr" rid="B81">Yuan et al., 2016</xref>) used protein amino acid composition, transformation and distribution information, compound descriptor information, and output information of six classifiers as features to be input into the learning to ranking algorithm to improve the performance of drug-target interaction prediction.</p>
<p>Neural networks have also been used to solve related problems in the prediction of drug&#x2013;target interactions. <xref ref-type="bibr" rid="B48">Prado-Prado et al. (2011)</xref> used the entropy information of drug&#x2013;protein complexes and neural networks to predict drug&#x2013;target affinity values. DeepDTA (<xref ref-type="bibr" rid="B46">&#x00D6;zt&#x00FC;rk et al., 2018</xref>) proposed a deep-learning based model that used only sequence information of both targets and drugs, One novel approach used in this work is the modeling of protein sequences and compound 1D representations with CNNs. GraphDTA (<xref ref-type="bibr" rid="B45">Nguyen et al., 2019</xref>) focused on the fact that molecules are by nature formed by chemical bonding of atoms, and used graph convolutional network to learn drug-target binding affinity.</p>
</sec>
<sec id="S4">
<title>Discussion</title>
<p>Under the background of the existing chemical and biological computing theory, big data and rapid development of computer technology, the use of machine learning for drug-target interaction prediction does have many benefits, but there are still some problems that need to be further explored.</p>
<p>(1) Data heterogeneity. Most of the existing studies are based on publicly available data in databases that collect data with different focuses, and each database has its own criteria for judging the data. Drugs, targets, and related data from different databases often have different terminological descriptions and different organization structures, such inconsistencies make data integration difficult.</p>
<p>(2) Effective representation of biological and chemical features. Feature engineering is a key concern in building machine learning models. There are often technical difficulties in how to effectively extract key features and how to deal with data with high dimensionality. Existing studies have shown that the features of proteins and drugs can be extracted from a variety of angles, and the combination of information from these angles can achieve complementary effects. Most drug&#x2013;target interaction prediction studies only extract relatively one-sided information, and do not comprehensively consider the information from multiple perspectives. In addition, most studies have focused on extracting drug molecule and protein features separately, ignoring the potentially valid association that may exist between drug and target. Moreover, the direct concatenation of biologically unrelated features may lead to a decrease in prediction accuracy.</p>
<p>(3) Characteristics of task algorithms. The classification, ranking, or deep learning methods used in drug&#x2013;target interaction prediction all have their own characteristics. Different computational approaches can be used to solve different problems in drug&#x2013;target interaction prediction, however, these algorithms also have shortcomings. Classification is the simplest and most understandable task. However, there is an obvious and long-standing defect in this task that it is necessary to collect negative samples. Most existing classification studies take experimentally validated drug&#x2013;target pairs with known interactions as positive samples, and unvalidated or unknown drug&#x2013;target pairs as negative examples. Among these negative examples, there may be positive samples that have not been accurately validated, the performance of a model that is based on such a dataset will be biased.</p>
<p>On the basis of the existence of one-to-many or many-to-many relationships between queries and documents, learning to rank can be used in multi-target drug discovery. Early drug development followed the &#x201C;one drug, one target&#x201D; principle, with the aim of finding high-affinity, high-selective drugs for a specific receptor associated with a particular disease. However, the number of complex diseases is increasing and the proteins associated with these diseases are not limited to one, therefore drug combinations are used to achieve the optimal therapeutic effect. Clinical pharmacology studies have shown that drug combinations greatly increase the incidence of adverse drug reactions, but because of the lack of multi-target drugs, such risks have to be taken. Multi-target drugs are undoubtedly an important area for future research. Therefore, using the characteristics of learning to rank to tackle the multi-target problem of drugs deserves to be explored further. Learning to rank was originally applied for information retrieval. Its output is a relative score of correlation between queries and documents (<xref ref-type="bibr" rid="B6">Cheng, 2020</xref>; <xref ref-type="bibr" rid="B50">Ru et al., 2021</xref>). This is not sufficient for studies that require accurate prediction of drug&#x2013;target affinities.</p>
<p>The use of neural networks for predicting accurate drug&#x2013;target affinity values has shown great potential in this research area. Neural networks can fuse drug and target features, which have changed the current situation of simple concatenation or tensor products of drug and target features. Deep learning contains more neural network structures with multiple implicit layers compared with traditional machine learning, which allows deep learning to handle large datasets and identify complex patterns from the learning process. But for the same reason, neural networks require much more execution time than classification or ranking algorithms. It will lead to overfitting when the drug and target feature dimensions are high.</p>
<p>Although existing machine learning methods have opened a new area in drug&#x2013;target interaction prediction, they have not achieved satisfactory results so far. Therefore, there is still a need to develop new theoretical and computational methods for drug&#x2013;target interaction prediction.</p>
</sec>
<sec id="S5">
<title>Conclusion</title>
<p>Drug&#x2013;target interaction prediction can help to screen out unsuitable compounds and is an important step in the development of new drugs. In this review, we describe the importance of drug&#x2013;target interaction prediction, analyze in detail the three main reasons why machine learning has become a mainstream technique, summarize the specific applications of machine learning methods in each step of building machine learning models, analyze the shortcomings of existing research methods, and discuss several aspects that can be further explored (<xref ref-type="bibr" rid="B68">Wei et al., 2014</xref>, <xref ref-type="bibr" rid="B69">2017a</xref>, <xref ref-type="bibr" rid="B71">2017b</xref>, <xref ref-type="bibr" rid="B67">2018</xref>, <xref ref-type="bibr" rid="B70">2019</xref>; <xref ref-type="bibr" rid="B11">Ding et al., 2017</xref>, <xref ref-type="bibr" rid="B12">2019</xref>, <xref ref-type="bibr" rid="B13">2020a</xref>, <xref ref-type="bibr" rid="B14">2020b</xref>; <xref ref-type="bibr" rid="B23">Jin Q. et al., 2019</xref>; <xref ref-type="bibr" rid="B24">Jin S. et al., 2019</xref>; <xref ref-type="bibr" rid="B33">Li J. et al., 2020</xref>; <xref ref-type="bibr" rid="B56">Su et al., 2020</xref>; <xref ref-type="bibr" rid="B61">Wang H. et al., 2020</xref>; <xref ref-type="bibr" rid="B84">Zeng et al., 2020c</xref>, <xref ref-type="bibr" rid="B86">d</xref>; <xref ref-type="bibr" rid="B87">Zhai et al., 2020</xref>; <xref ref-type="bibr" rid="B63">Wang et al., 2021b</xref>). This review provides meaningful perspectives for future drug&#x2013;target interaction prediction studies, especially the application of learning to rank to deal with multi-target drug problems.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>XR drafted the manuscript. LX and RS initiated the idea, conceived the whole process, and finalized the manuscript. All authors have read and approved the final manuscript.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling editor declared a past co-authorship with one of the authors, LX.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This work was supported by the natural science foundation of Guangdong province (grant No. 2018A0303130084) and the Grant of Shenzhen Polytechnic (No. 6021310015K).</p>
</fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bleakley</surname> <given-names>K.</given-names></name> <name><surname>Yamanishi</surname> <given-names>Y.</given-names></name></person-group> (<year>2009</year>). <article-title>Supervised prediction of drug&#x2013;target interactions using bipartite local models.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>2397</fpage>&#x2013;<lpage>2403</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp433</pub-id> <pub-id pub-id-type="pmid">19605421</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname> <given-names>J.</given-names></name> <name><surname>Cai</surname> <given-names>H.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Yang</surname> <given-names>X.</given-names></name></person-group> (<year>2018</year>). <article-title>Identifying &#x201C;many-to-many&#x201D; relationships between gene-expression data and drug-response data via sparse binary matching.</article-title> <source><italic>IEEE/ACM Trans. Comput. Biol. Bioinform.</italic></source> <volume>17</volume> <fpage>165</fpage>&#x2013;<lpage>176</lpage>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Peng</surname> <given-names>H.</given-names></name> <name><surname>Han</surname> <given-names>G.</given-names></name> <name><surname>Cai</surname> <given-names>H.</given-names></name> <name><surname>Cai</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>HOGMMNC: a higher order graph matching with multiple network constraints model for gene&#x2013;drug regulatory modules identification.</article-title> <source><italic>Bioinformatics</italic></source> <volume>35</volume> <fpage>602</fpage>&#x2013;<lpage>610</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty662</pub-id> <pub-id pub-id-type="pmid">30052773</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>P.</given-names></name> <name><surname>Li</surname> <given-names>F.</given-names></name> <name><surname>Leier</surname> <given-names>A.</given-names></name> <name><surname>Marquez-Lago</surname> <given-names>T. T.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>iFeature: a python package and web server for features extraction and selection from protein and peptide sequences.</article-title> <source><italic>Bioinformatics</italic></source> <volume>34</volume> <fpage>2499</fpage>&#x2013;<lpage>2502</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bty140</pub-id> <pub-id pub-id-type="pmid">29528364</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>L.</given-names></name></person-group> (<year>2019</year>). <article-title>Computational and biological methods for gene therapy.</article-title> <source><italic>Curr. Gene Ther.</italic></source> <volume>19</volume> <fpage>210</fpage>&#x2013;<lpage>210</lpage>. <pub-id pub-id-type="doi">10.2174/156652321904191022113307</pub-id> <pub-id pub-id-type="pmid">31762421</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>L.</given-names></name></person-group> (<year>2020</year>). <article-title>Omics Data and Artificial Intelligence: New Challenges for Gene Therapy.</article-title> <source><italic>Curr. Gene Ther.</italic></source> <volume>20</volume>:<fpage>1</fpage>. <pub-id pub-id-type="doi">10.2174/156652322001200604150041</pub-id> <pub-id pub-id-type="pmid">32603274</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>L.</given-names></name> <name><surname>Han</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>Z.</given-names></name> <name><surname>Qi</surname> <given-names>C.</given-names></name> <name><surname>Wang</surname> <given-names>P.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name></person-group> (<year>2021</year>). <article-title>Functional alterations caused by mutations reflect evolutionary trends of SARS-CoV-2.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>22</volume> <fpage>1442</fpage>&#x2013;<lpage>1450</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbab042</pub-id> <pub-id pub-id-type="pmid">33580783</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>L.</given-names></name> <name><surname>Zhao</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>P.</given-names></name> <name><surname>Zhou</surname> <given-names>W.</given-names></name> <name><surname>Luo</surname> <given-names>M.</given-names></name> <name><surname>Li</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Computational Methods for Identifying Similar Diseases.</article-title> <source><italic>Mol. Ther. Nucleic Acids</italic></source> <volume>18</volume> <fpage>590</fpage>&#x2013;<lpage>604</lpage>. <pub-id pub-id-type="doi">10.1016/j.omtn.2019.09.019</pub-id> <pub-id pub-id-type="pmid">31678735</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Consortium</surname> <given-names>U.</given-names></name></person-group> (<year>2015</year>). <article-title>UniProt: a hub for protein information.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume> <fpage>D204</fpage>&#x2013;<lpage>D212</lpage>.</citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davis</surname> <given-names>M. I.</given-names></name> <name><surname>Hunt</surname> <given-names>J. P.</given-names></name> <name><surname>Herrgard</surname> <given-names>S.</given-names></name> <name><surname>Ciceri</surname> <given-names>P.</given-names></name> <name><surname>Wodicka</surname> <given-names>L. M.</given-names></name> <name><surname>Pallares</surname> <given-names>G.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Comprehensive analysis of kinase inhibitor selectivity.</article-title> <source><italic>Nat. Biotechnol.</italic></source> <volume>29</volume> <fpage>1046</fpage>&#x2013;<lpage>1051</lpage>.</citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2017</year>). <article-title>Identification of drug-target interactions via multiple information integration.</article-title> <source><italic>Inform. Sci.</italic></source> <volume>418</volume> <fpage>546</fpage>&#x2013;<lpage>560</lpage>. <pub-id pub-id-type="doi">10.1016/j.ins.2017.08.045</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>Identification of drug-side effect association via multiple information integration with centered kernel alignment.</article-title> <source><italic>Neurocomputing</italic></source> <volume>325</volume> <fpage>211</fpage>&#x2013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2018.10.028</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2020a</year>). <article-title>Identification of Drug-Target Interactions via Dual Laplacian Regularized Least Squares with Multiple Kernel Fusion.</article-title> <source><italic>Knowl. Based Syst.</italic></source> <volume>204</volume>:<fpage>106254</fpage>. <pub-id pub-id-type="doi">10.1016/j.knosys.2020.106254</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2020b</year>). <article-title>Identification of drug-target interactions via fuzzy bipartite local model.</article-title> <source><italic>Neural Comput. Appli.</italic></source> <volume>23</volume> <fpage>10303</fpage>&#x2013;<lpage>10319</lpage>. <pub-id pub-id-type="doi">10.1007/s00521-019-04569-z</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fu</surname> <given-names>X.</given-names></name> <name><surname>Cai</surname> <given-names>L.</given-names></name> <name><surname>Zeng</surname> <given-names>X.</given-names></name> <name><surname>Zou</surname> <given-names>Q. J. B.</given-names></name></person-group> (<year>2020</year>). <article-title>StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency.</article-title> <source><italic>Bioinformatics</italic></source> <volume>36</volume> <fpage>3028</fpage>&#x2013;<lpage>3034</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa131</pub-id> <pub-id pub-id-type="pmid">32105326</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gfeller</surname> <given-names>D.</given-names></name> <name><surname>Grosdidier</surname> <given-names>A.</given-names></name> <name><surname>Wirth</surname> <given-names>M.</given-names></name> <name><surname>Daina</surname> <given-names>A.</given-names></name> <name><surname>Michielin</surname> <given-names>O.</given-names></name> <name><surname>Zoete</surname> <given-names>V.</given-names></name></person-group> (<year>2014</year>). <article-title>SwissTargetPrediction: a web server for target prediction of bioactive small molecules.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>42</volume> <fpage>W32</fpage>&#x2013;<lpage>W38</lpage>.</citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goh</surname> <given-names>G. B.</given-names></name> <name><surname>Hodas</surname> <given-names>N. O.</given-names></name> <name><surname>Siegel</surname> <given-names>C.</given-names></name> <name><surname>Vishnu</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Smiles2vec: An interpretable general-purpose deep neural network for predicting chemical properties.</article-title> <source><italic>arXiv preprint arXiv</italic></source> <fpage>171202034</fpage>.</citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>P.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Discrimination of Thermophilic Proteins and Non-thermophilic Proteins Using Feature Dimension Reduction.</article-title> <source><italic>Front. Bioeng. Biotechnol.</italic></source> <volume>8</volume>:<fpage>584807</fpage>. <pub-id pub-id-type="doi">10.3389/fbioe.2020.584807</pub-id> <pub-id pub-id-type="pmid">33195148</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hirohara</surname> <given-names>M.</given-names></name> <name><surname>Saito</surname> <given-names>Y.</given-names></name> <name><surname>Koda</surname> <given-names>Y.</given-names></name> <name><surname>Sato</surname> <given-names>K.</given-names></name> <name><surname>Sakakibara</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Convolutional neural network based on SMILES representation of compounds for detecting chemical motif.</article-title> <source><italic>BMC bioinformatics</italic></source> <volume>19</volume>:<fpage>526</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-018-2523-5</pub-id> <pub-id pub-id-type="pmid">30598075</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Yang</surname> <given-names>J.-Y.</given-names></name> <name><surname>Shen</surname> <given-names>H.-B.</given-names></name> <name><surname>Yu</surname> <given-names>D.-J.</given-names></name></person-group> (<year>2016</year>). <article-title>GPCR&#x2013;drug interactions prediction using random forest with drug-association-matrix-based post-processing procedure.</article-title> <source><italic>Comput. Biol. Chem.</italic></source> <volume>60</volume> <fpage>59</fpage>&#x2013;<lpage>71</lpage>. <pub-id pub-id-type="doi">10.1016/j.compbiolchem.2015.11.007</pub-id> <pub-id pub-id-type="pmid">26674225</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>P.-W.</given-names></name> <name><surname>Chan</surname> <given-names>K. C.</given-names></name> <name><surname>You</surname> <given-names>Z.-H.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Large-scale prediction of drug-target interactions from deep representations</article-title>,&#x201D; in <source><italic>2016 International Joint Conference on Neural Networks (IJCNN</italic></source> (<publisher-loc>Vancouver</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1236</fpage>&#x2013;<lpage>1243</lpage>.</citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>B.</given-names></name> <name><surname>Zhu</surname> <given-names>L.</given-names></name> <name><surname>Cai</surname> <given-names>H.</given-names></name></person-group> (<year>2020</year>). <article-title>Evaluation of gene&#x2013;drug common module identification methods using pharmacogenomics data.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>22</volume>:<fpage>bbaa087</fpage>.</citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>Q.</given-names></name> <name><surname>Meng</surname> <given-names>Z.</given-names></name> <name><surname>Tuan</surname> <given-names>D. P.</given-names></name> <name><surname>Chen</surname> <given-names>Q.</given-names></name> <name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Su</surname> <given-names>R.</given-names></name></person-group> (<year>2019</year>). <article-title>DUNet: A deformable network for retinal vessel segmentation.</article-title> <source><italic>Knowl. Based Syst.</italic></source> <volume>178</volume> <fpage>149</fpage>&#x2013;<lpage>162</lpage>. <pub-id pub-id-type="doi">10.1016/j.knosys.2019.04.025</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jin</surname> <given-names>S.</given-names></name> <name><surname>Zeng</surname> <given-names>X.</given-names></name> <name><surname>Fang</surname> <given-names>J.</given-names></name> <name><surname>Lin</surname> <given-names>J.</given-names></name> <name><surname>Chan</surname> <given-names>S. Y.</given-names></name> <name><surname>Erzurum</surname> <given-names>S. C.</given-names></name></person-group> (<year>2019</year>). <article-title>Cheng FJNsb, applications: A network-based approach to uncover microRNA-mediated disease comorbidities and potential pathobiological implications.</article-title> <source><italic>NPJ Syst. Biol. Appl.</italic></source> <volume>5</volume> <fpage>1</fpage>&#x2013;<lpage>11</lpage>.</citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Goto</surname> <given-names>S.</given-names></name></person-group> (<year>2000</year>). <article-title>KEGG: kyoto encyclopedia of genes and genomes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>28</volume> <fpage>27</fpage>&#x2013;<lpage>30</lpage>.</citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>S.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Cheng</surname> <given-names>T.</given-names></name> <name><surname>Gindulyte</surname> <given-names>A.</given-names></name> <name><surname>He</surname> <given-names>J.</given-names></name> <name><surname>He</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2021</year>). <article-title>PubChem in 2021: new data content and improved web interfaces.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>49</volume> <fpage>D1388</fpage>&#x2013;<lpage>D1395</lpage>.</citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>S.</given-names></name> <name><surname>Thiessen</surname> <given-names>P. A.</given-names></name> <name><surname>Bolton</surname> <given-names>E. E.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Fu</surname> <given-names>G.</given-names></name> <name><surname>Gindulyte</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>PubChem substance and compound databases.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>D1202</fpage>&#x2013;<lpage>D1213</lpage>.</citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Korshunova</surname> <given-names>M.</given-names></name> <name><surname>Ginsburg</surname> <given-names>B.</given-names></name> <name><surname>Tropsha</surname> <given-names>A.</given-names></name> <name><surname>Isayev</surname> <given-names>O.</given-names></name></person-group> (<year>2021</year>). <article-title>OpenChem: A Deep Learning Toolkit for Computational Chemistry and Drug Design.</article-title> <source><italic>J. Chem. Inform. Model.</italic></source> <volume>61</volume> <fpage>7</fpage>&#x2013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.0c00971</pub-id> <pub-id pub-id-type="pmid">33393291</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhn</surname> <given-names>M.</given-names></name> <name><surname>von Mering</surname> <given-names>C.</given-names></name> <name><surname>Campillos</surname> <given-names>M.</given-names></name> <name><surname>Jensen</surname> <given-names>L. J.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name></person-group> (<year>2007</year>). <article-title>STITCH: interaction networks of chemicals and proteins.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>36</volume> <fpage>D684</fpage>&#x2013;<lpage>D688</lpage>.</citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Landrum</surname> <given-names>G.</given-names></name></person-group> (<year>2013</year>). <article-title>Rdkit documentation.</article-title> <source><italic>Release</italic></source> <volume>1</volume>:<fpage>4</fpage>.</citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>I.</given-names></name> <name><surname>Keum</surname> <given-names>J.</given-names></name> <name><surname>Nam</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>15</volume>:<fpage>e1007129</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1007129</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Long</surname> <given-names>C.</given-names></name> <name><surname>Xiang</surname> <given-names>J.</given-names></name> <name><surname>Liang</surname> <given-names>P.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Zuo</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Dppa2/4 as a trigger of signaling pathways to promote zygote genome activation by binding to CG-rich region.</article-title> <source><italic>Brief. Bioinform.</italic></source> <pub-id pub-id-type="doi">10.1093/bib/bbaa342</pub-id> [Epub Online ahead of print].</citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Pu</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2020</year>). <article-title>DeepATT: a hybrid category attention neural network for identifying functional effects of DNA sequences.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>22</volume>:<fpage>bbaa159</fpage>.</citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Han</surname> <given-names>P.</given-names></name> <name><surname>You</surname> <given-names>Z.-H.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Yu</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>7</volume> <fpage>1</fpage>&#x2013;<lpage>13</lpage>.</citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>T.</given-names></name> <name><surname>Lei</surname> <given-names>H.</given-names></name> <name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Shi</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Research on Gastric Cancer&#x2019;s Drug-resistant Gene Regulatory Network Model.</article-title> <source><italic>Curr. Bioinform.</italic></source> <volume>15</volume> <fpage>225</fpage>&#x2013;<lpage>234</lpage>. <pub-id pub-id-type="doi">10.2174/1574893614666190722102557</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>X.</given-names></name> <name><surname>Quan</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Huang</surname> <given-names>H.</given-names></name> <name><surname>Zeng</surname> <given-names>X.</given-names></name></person-group> (<year>2020</year>). <article-title>A novel molecular representation with BiGRU neural networks for learning atom.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>21</volume> <fpage>2099</fpage>&#x2013;<lpage>2111</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbz125</pub-id> <pub-id pub-id-type="pmid">31729524</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <article-title>BioSeq-Analysis: a platform for DNA, RNA and protein sequence analysis based on machine learning approaches.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>20</volume> <fpage>1280</fpage>&#x2013;<lpage>1294</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbx165</pub-id> <pub-id pub-id-type="pmid">29272359</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>B.</given-names></name> <name><surname>Gao</surname> <given-names>X.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>47</volume>:<fpage>e127</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz740</pub-id> <pub-id pub-id-type="pmid">31504851</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>B.</given-names></name> <name><surname>Liu</surname> <given-names>F.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Fang</surname> <given-names>L.</given-names></name> <name><surname>Chou</surname> <given-names>K.-C.</given-names></name></person-group> (<year>2015</year>). <article-title>Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>43</volume> <fpage>W65</fpage>&#x2013;<lpage>W71</lpage>.</citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>B.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name> <name><surname>Chou</surname> <given-names>K.-C.</given-names></name></person-group> (<year>2017</year>). <article-title>Pse-in-One 2.0: an improved package of web servers for generating various modes of pseudo components of DNA, RNA, and protein sequences.</article-title> <source><italic>Nat. Sci.</italic></source> <volume>9</volume>:<fpage>67</fpage>. <pub-id pub-id-type="doi">10.4236/ns.2017.94007</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name> <name><surname>Wen</surname> <given-names>X.</given-names></name> <name><surname>Jorissen</surname> <given-names>R. N.</given-names></name> <name><surname>Gilson</surname> <given-names>M. K.</given-names></name></person-group> (<year>2007</year>). <article-title>BindingDB: a web-accessible database of experimentally determined protein&#x2013;ligand binding affinities.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>35</volume> <fpage>D198</fpage>&#x2013;<lpage>D201</lpage>.</citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Hong</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name> <name><surname>Rodr&#x00ED;guez-Pat&#x00F3;n</surname> <given-names>A.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name></person-group> (<year>2020</year>). <article-title>Zeng XJBib: Computational methods for identifying the critical nodes in biological networks.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>21</volume> <fpage>486</fpage>&#x2013;<lpage>497</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbz011</pub-id> <pub-id pub-id-type="pmid">30753282</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lv</surname> <given-names>Z. B.</given-names></name> <name><surname>Ao</surname> <given-names>C. Y.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name></person-group> (<year>2019</year>). <article-title>Protein Function Prediction: From Traditional Classifier to Deep Learning.</article-title> <source><italic>Proteomics</italic></source> <volume>19</volume>:<fpage>2</fpage>.</citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mahmud</surname> <given-names>S. H.</given-names></name> <name><surname>Chen</surname> <given-names>W.</given-names></name> <name><surname>Meng</surname> <given-names>H.</given-names></name> <name><surname>Jahan</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Hasan</surname> <given-names>S. M.</given-names></name></person-group> (<year>2020</year>). <article-title>Prediction of drug-target interaction based on protein features using undersampling and feature selection techniques with boosting.</article-title> <source><italic>Anal. Biochem.</italic></source> <volume>589</volume>:<fpage>13507</fpage>.</citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nguyen</surname> <given-names>T.</given-names></name> <name><surname>Le</surname> <given-names>H.</given-names></name> <name><surname>Venkatesh</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>GraphDTA: prediction of drug&#x2013;target binding affinity using graph convolutional networks.</article-title> <source><italic>BioRxiv</italic></source> <pub-id pub-id-type="doi">10.1101/684662</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>&#x00D6;zt&#x00FC;rk</surname> <given-names>H.</given-names></name> <name><surname>&#x00D6;zg&#x00FC;r</surname> <given-names>A.</given-names></name> <name><surname>Ozkirimli</surname> <given-names>E.</given-names></name></person-group> (<year>2018</year>). <article-title>DeepDTA: deep drug&#x2013;target binding affinity prediction.</article-title> <source><italic>Bioinformatics</italic></source> <volume>34</volume> <fpage>i821</fpage>&#x2013;<lpage>i829</lpage>.</citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pang</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>B.</given-names></name></person-group> (<year>2020</year>). <article-title>SelfAT-Fold: Protein Fold Recognition Based on Residue-Based and Motif-Based Self-Attention Networks.</article-title> <source><italic>IEEE/ACM Trans. Comput. Biol. Bioinform.</italic></source> <pub-id pub-id-type="doi">10.1109/TCBB.2020.3031888</pub-id> [Epub Online ahead of print]. <pub-id pub-id-type="pmid">33090951</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prado-Prado</surname> <given-names>F.</given-names></name> <name><surname>Garc&#x00ED;a-Mera</surname> <given-names>X.</given-names></name> <name><surname>Abeij&#x00F3;n</surname> <given-names>P.</given-names></name> <name><surname>Alonso</surname> <given-names>N.</given-names></name> <name><surname>Caama&#x00F1;o</surname> <given-names>O.</given-names></name> <name><surname>Y&#x00E1;&#x00F1;ez</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Using entropy of drug and protein graphs to predict FDA drug-target network: theoretic-experimental study of MAO inhibitors and hemoglobin peptides from Fasciola hepatica.</article-title> <source><italic>Eur. J. Med. Chem.</italic></source> <volume>46</volume> <fpage>1074</fpage>&#x2013;<lpage>1094</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejmech.2011.01.023</pub-id> <pub-id pub-id-type="pmid">21315497</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ru</surname> <given-names>X.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Ding</surname> <given-names>H.</given-names></name> <name><surname>Ye</surname> <given-names>X.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name></person-group> (<year>2020</year>). <article-title>Exploration of the correlation between GPCRs and drugs based on a learning to rank algorithm.</article-title> <source><italic>Comput. Biol. Med.</italic></source> <volume>119</volume>:<fpage>103660</fpage>. <pub-id pub-id-type="doi">10.1016/j.compbiomed.2020.103660</pub-id> <pub-id pub-id-type="pmid">32090901</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ru</surname> <given-names>X.</given-names></name> <name><surname>Ye</surname> <given-names>X.</given-names></name> <name><surname>Sakurai</surname> <given-names>T.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name></person-group> (<year>2021</year>). <article-title>Application of learning to rank in bioinformatics tasks.</article-title> <source><italic>Brief. Bioinform.</italic></source> <pub-id pub-id-type="doi">10.1093/bib/bbaa1394</pub-id> [Epub Online ahead of print].</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shao</surname> <given-names>J.</given-names></name> <name><surname>Yan</surname> <given-names>K.</given-names></name> <name><surname>Liu</surname> <given-names>B.</given-names></name></person-group> (<year>2021</year>). <article-title>FoldRec-C2C: protein fold recognition by combining cluster-to-cluster model and protein similarity network.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>22</volume>:<fpage>bbaa144</fpage>. <pub-id pub-id-type="doi">10.1093/bib/bbaa144</pub-id> <pub-id pub-id-type="pmid">32685972</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shar</surname> <given-names>P. A.</given-names></name> <name><surname>Tao</surname> <given-names>W.</given-names></name> <name><surname>Gao</surname> <given-names>S.</given-names></name> <name><surname>Huang</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>B.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Pred-binding: large-scale protein&#x2013;ligand binding affinity prediction.</article-title> <source><italic>J. Enzyme Inhib. Med. Chem.</italic></source> <volume>31</volume> <fpage>1443</fpage>&#x2013;<lpage>1450</lpage>. <pub-id pub-id-type="doi">10.3109/14756366.2016.1144594</pub-id> <pub-id pub-id-type="pmid">26888050</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Ma</surname> <given-names>Q.</given-names></name> <name><surname>Yu</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <article-title>Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure.</article-title> <source><italic>Genomics</italic></source> <volume>111</volume> <fpage>1839</fpage>&#x2013;<lpage>1852</lpage>. <pub-id pub-id-type="doi">10.1016/j.ygeno.2018.12.007</pub-id> <pub-id pub-id-type="pmid">30550813</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Srivastava</surname> <given-names>N.</given-names></name> <name><surname>Mishra</surname> <given-names>B. N.</given-names></name> <name><surname>Srivastava</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>In-Silico Identification of Drug Lead Molecule Against Pesticide Exposed-neurodevelopmental Disorders Through Network-based Computational Model Approach.</article-title> <source><italic>Curr. Bioinform.</italic></source> <volume>14</volume> <fpage>460</fpage>&#x2013;<lpage>467</lpage>. <pub-id pub-id-type="doi">10.2174/1574893613666181112130346</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stephenson</surname> <given-names>N.</given-names></name> <name><surname>Shane</surname> <given-names>E.</given-names></name> <name><surname>Chase</surname> <given-names>J.</given-names></name> <name><surname>Rowland</surname> <given-names>J.</given-names></name> <name><surname>Ries</surname> <given-names>D.</given-names></name> <name><surname>Justice</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Survey of Machine Learning Techniques in Drug Discovery.</article-title> <source><italic>Curr. Drug Metab.</italic></source> <volume>20</volume> <fpage>185</fpage>&#x2013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.2174/1389200219666180820112457</pub-id> <pub-id pub-id-type="pmid">30124147</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Su</surname> <given-names>R.</given-names></name> <name><surname>Hu</surname> <given-names>J.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name> <name><surname>Manavalan</surname> <given-names>B.</given-names></name> <name><surname>Wei</surname> <given-names>L.</given-names></name></person-group> (<year>2020</year>). <article-title>Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>21</volume> <fpage>408</fpage>&#x2013;<lpage>420</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bby124</pub-id> <pub-id pub-id-type="pmid">30649170</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tabei</surname> <given-names>Y.</given-names></name> <name><surname>Pauwels</surname> <given-names>E.</given-names></name> <name><surname>Stoven</surname> <given-names>V.</given-names></name> <name><surname>Takemoto</surname> <given-names>K.</given-names></name> <name><surname>Yamanishi</surname> <given-names>Y.</given-names></name></person-group> (<year>2012</year>). <article-title>Identification of chemogenomic features from drug&#x2013;target interaction networks using interpretable classifiers.</article-title> <source><italic>Bioinformatics</italic></source> <volume>28</volume> <fpage>i487</fpage>&#x2013;<lpage>i494</lpage>.</citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Szwajda</surname> <given-names>A.</given-names></name> <name><surname>Shakyawar</surname> <given-names>S.</given-names></name> <name><surname>Xu</surname> <given-names>T.</given-names></name> <name><surname>Hintsanen</surname> <given-names>P.</given-names></name> <name><surname>Wennerberg</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Making sense of large-scale kinase inhibitor bioactivity data sets: a comparative and integrative analysis.</article-title> <source><italic>J. Chem. Inform. Model.</italic></source> <volume>54</volume> <fpage>735</fpage>&#x2013;<lpage>743</lpage>. <pub-id pub-id-type="doi">10.1021/ci400709d</pub-id> <pub-id pub-id-type="pmid">24521231</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tang</surname> <given-names>Y.-J.</given-names></name> <name><surname>Pang</surname> <given-names>Y.-H.</given-names></name> <name><surname>Liu</surname> <given-names>B.</given-names></name></person-group> (<year>2020</year>). <article-title>IDP-Seq2Seq: Identification of Intrinsically Disordered Regions based on Sequence to Sequence Learning.</article-title> <source><italic>Bioinformaitcs</italic></source> <volume>36</volume> <fpage>5177</fpage>&#x2013;<lpage>5186</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa667</pub-id> <pub-id pub-id-type="pmid">32702119</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tao</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Teng</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>A Method for Identifying Vesicle Transport Proteins Based on LibSVM and MRMD.</article-title> <source><italic>Comput. Math. Methods Med</italic></source> <volume>2020</volume>:<fpage>8926750</fpage>.</citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2020</year>). <article-title>Identification of membrane protein types via multivariate information fusion with Hilbert-Schmidt Independence Criterion.</article-title> <source><italic>Neurocomputing</italic></source> <volume>383</volume> <fpage>257</fpage>&#x2013;<lpage>269</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2019.11.103</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Liang</surname> <given-names>P.</given-names></name> <name><surname>Zheng</surname> <given-names>L.</given-names></name> <name><surname>Long</surname> <given-names>C.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Zuo</surname> <given-names>Y.</given-names></name></person-group> (<year>2021a</year>). <article-title>eHSCPr discriminating the cell identity involved in endothelial to hematopoietic transition.</article-title> <source><italic>Bioinformatics</italic></source></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2021b</year>). <article-title>Exploring associations of non-coding RNAs in human diseases via three-matrix factorization with hypergraph-regular terms on center kernel alignment.</article-title> <source><italic>Brief. Bioinform.</italic></source> <pub-id pub-id-type="doi">10.1093/bib/bbaa409</pub-id> [Epub Online ahead of print]. <pub-id pub-id-type="pmid">33443536</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Chang</surname> <given-names>H.</given-names></name></person-group> (<year>2020</year>). <article-title>Predicting drug-target interactions via FM-DNN learning.</article-title> <source><italic>Curr. Bioinform.</italic></source> <volume>15</volume> <fpage>68</fpage>&#x2013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.2174/1574893614666190227160538</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>You</surname> <given-names>Z.-H.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Xia</surname> <given-names>S.-X.</given-names></name> <name><surname>Liu</surname> <given-names>F.</given-names></name> <name><surname>Yan</surname> <given-names>X.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>A computational-based method for predicting drug&#x2013;target interactions by using stacked autoencoder deep neural network.</article-title> <source><italic>J. Comput. Biol.</italic></source> <volume>25</volume> <fpage>361</fpage>&#x2013;<lpage>373</lpage>. <pub-id pub-id-type="doi">10.1089/cmb.2017.0135</pub-id> <pub-id pub-id-type="pmid">28891684</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y.-C.</given-names></name> <name><surname>Yang</surname> <given-names>Z.-X.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Deng</surname> <given-names>N.-Y.</given-names></name></person-group> (<year>2010</year>). <article-title>Computationally probing drug-protein interactions via support vector machine.</article-title> <source><italic>Lett. Drug Des. Discov.</italic></source> <volume>7</volume> <fpage>370</fpage>&#x2013;<lpage>378</lpage>. <pub-id pub-id-type="doi">10.2174/157018010791163433</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Ding</surname> <given-names>Y.</given-names></name> <name><surname>Su</surname> <given-names>R.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name></person-group> (<year>2018</year>). <article-title>Prediction of human protein subcellular localization using deep learning.</article-title> <source><italic>J. Parallel Distrib. Comput.</italic></source> <volume>117</volume> <fpage>212</fpage>&#x2013;<lpage>217</lpage>.</citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Liao</surname> <given-names>M.</given-names></name> <name><surname>Gao</surname> <given-names>Y.</given-names></name> <name><surname>Ji</surname> <given-names>R.</given-names></name> <name><surname>He</surname> <given-names>Z.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name></person-group> (<year>2014</year>). <article-title>Improved and Promising Identification of Human MicroRNAs by Incorporating a High-Quality Negative Set.</article-title> <source><italic>IEEE/ACM Trans. Comput. Biol. Bioinform.</italic></source> <volume>11</volume> <fpage>192</fpage>&#x2013;<lpage>201</lpage>. <pub-id pub-id-type="doi">10.1109/tcbb.2013.146</pub-id> <pub-id pub-id-type="pmid">26355518</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Wan</surname> <given-names>S.</given-names></name> <name><surname>Guo</surname> <given-names>J.</given-names></name> <name><surname>Wong</surname> <given-names>K. K. L.</given-names></name></person-group> (<year>2017a</year>). <article-title>A novel hierarchical selective ensemble classifier with bioinformatics application.</article-title> <source><italic>Artif. Intell. Med.</italic></source> <volume>83</volume> <fpage>82</fpage>&#x2013;<lpage>90</lpage>. <pub-id pub-id-type="doi">10.1016/j.artmed.2017.02.005</pub-id> <pub-id pub-id-type="pmid">28245947</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Xing</surname> <given-names>P.</given-names></name> <name><surname>Shi</surname> <given-names>G.</given-names></name> <name><surname>Ji</surname> <given-names>Z.</given-names></name> <name><surname>Zou</surname> <given-names>Q.</given-names></name></person-group> (<year>2019</year>). <article-title>Fast Prediction of Protein Methylation Sites Using a Sequence-Based Feature Selection Technique.</article-title> <source><italic>IEEE-ACM Trans. Comput. Biol. Bioinform.</italic></source> <volume>16</volume> <fpage>1264</fpage>&#x2013;<lpage>1273</lpage>. <pub-id pub-id-type="doi">10.1109/tcbb.2017.2670558</pub-id> <pub-id pub-id-type="pmid">28222000</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Xing</surname> <given-names>P.</given-names></name> <name><surname>Zeng</surname> <given-names>J.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Su</surname> <given-names>R.</given-names></name> <name><surname>Guo</surname> <given-names>F.</given-names></name></person-group> (<year>2017b</year>). <article-title>Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier.</article-title> <source><italic>Artif. Intell. Med.</italic></source> <volume>83</volume> <fpage>67</fpage>&#x2013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1016/j.artmed.2017.03.001</pub-id> <pub-id pub-id-type="pmid">28320624</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wishart</surname> <given-names>D. S.</given-names></name> <name><surname>Feunang</surname> <given-names>Y. D.</given-names></name> <name><surname>Guo</surname> <given-names>A. C.</given-names></name> <name><surname>Lo</surname> <given-names>E. J.</given-names></name> <name><surname>Marcu</surname> <given-names>A.</given-names></name> <name><surname>Grant</surname> <given-names>J. R.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>DrugBank 5.0: a major update to the DrugBank database for 2018.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>46</volume> <fpage>D1074</fpage>&#x2013;<lpage>D1082</lpage>.</citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wishart</surname> <given-names>D. S.</given-names></name> <name><surname>Knox</surname> <given-names>C.</given-names></name> <name><surname>Guo</surname> <given-names>A. C.</given-names></name> <name><surname>Cheng</surname> <given-names>D.</given-names></name> <name><surname>Shrivastava</surname> <given-names>S.</given-names></name> <name><surname>Tzur</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>DrugBank: a knowledgebase for drugs, drug actions and drug targets.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>36</volume> <fpage>D901</fpage>&#x2013;<lpage>D906</lpage>.</citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiao</surname> <given-names>X.</given-names></name> <name><surname>Min</surname> <given-names>J.-L.</given-names></name> <name><surname>Wang</surname> <given-names>P.</given-names></name> <name><surname>Chou</surname> <given-names>K.-C.</given-names></name></person-group> (<year>2013</year>). <article-title>iGPCR-Drug: A web server for predicting interaction between GPCRs and drugs in cellular networking.</article-title> <source><italic>PLoS One</italic></source> <volume>8</volume>:<fpage>e72234</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0072234</pub-id> <pub-id pub-id-type="pmid">24015221</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Liang</surname> <given-names>G.</given-names></name> <name><surname>Shi</surname> <given-names>S.</given-names></name> <name><surname>Liao</surname> <given-names>C.</given-names></name></person-group> (<year>2018a</year>). <article-title>SeqSVM: A Sequence-Based Support Vector Machine Method for Identifying Antioxidant Proteins.</article-title> <source><italic>Int. J. Mol. Sci.</italic></source> <volume>19</volume>:<fpage>1773</fpage>. <pub-id pub-id-type="doi">10.3390/ijms19061773</pub-id> <pub-id pub-id-type="pmid">29914044</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Liang</surname> <given-names>G.</given-names></name> <name><surname>Liao</surname> <given-names>C.</given-names></name> <name><surname>Chen</surname> <given-names>G. D.</given-names></name> <name><surname>Chang</surname> <given-names>C. C.</given-names></name></person-group> (<year>2018b</year>). <article-title>An Efficient Classifier for Alzheimer&#x2019;s Disease Genes Identification.</article-title> <source><italic>Molecules</italic></source> <volume>23</volume>:<fpage>13</fpage>.</citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Liang</surname> <given-names>G.</given-names></name> <name><surname>Liao</surname> <given-names>C.</given-names></name> <name><surname>Chen</surname> <given-names>G. D.</given-names></name> <name><surname>Chang</surname> <given-names>C. C.</given-names></name></person-group> (<year>2019</year>). <article-title>k-Skip-n-Gram-RF: A Random Forest Based Method for Alzheimer&#x2019;s Disease Protein Identification.</article-title> <source><italic>Front. Genet.</italic></source> <volume>10</volume>:<fpage>7</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2019.00033</pub-id> <pub-id pub-id-type="pmid">30809242</pub-id></citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamanishi</surname> <given-names>Y.</given-names></name> <name><surname>Araki</surname> <given-names>M.</given-names></name> <name><surname>Gutteridge</surname> <given-names>A.</given-names></name> <name><surname>Honda</surname> <given-names>W.</given-names></name> <name><surname>Kanehisa</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>Prediction of drug&#x2013;target interaction networks from the integration of chemical and genomic spaces.</article-title> <source><italic>Bioinformatics</italic></source> <volume>24</volume> <fpage>i232</fpage>&#x2013;<lpage>i240</lpage>.</citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamanishi</surname> <given-names>Y.</given-names></name> <name><surname>Kotera</surname> <given-names>M.</given-names></name> <name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Goto</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework.</article-title> <source><italic>Bioinformatics</italic></source> <volume>26</volume> <fpage>i246</fpage>&#x2013;<lpage>i254</lpage>.</citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>X.</given-names></name> <name><surname>Han</surname> <given-names>G.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Cai</surname> <given-names>H.</given-names></name></person-group> (<year>2018</year>). <article-title>Finding correlated patterns via high-order matching for multiple sourced biological data.</article-title> <source><italic>IEEE Trans. Biomed. Eng.</italic></source> <volume>66</volume> <fpage>1017</fpage>&#x2013;<lpage>1025</lpage>. <pub-id pub-id-type="doi">10.1109/tbme.2018.2866266</pub-id> <pub-id pub-id-type="pmid">30130172</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yuan</surname> <given-names>Q.</given-names></name> <name><surname>Gao</surname> <given-names>J.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Zhang</surname> <given-names>S.</given-names></name> <name><surname>Mamitsuka</surname> <given-names>H.</given-names></name> <name><surname>Zhu</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>DrugE-Rank: improving drug&#x2013;target interaction prediction of new candidate drugs or targets by ensemble learning to rank.</article-title> <source><italic>Bioinformatics</italic></source> <volume>32</volume> <fpage>i18</fpage>&#x2013;<lpage>i27</lpage>.</citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname> <given-names>X.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name> <name><surname>He</surname> <given-names>Y.</given-names></name> <name><surname>Lv</surname> <given-names>L.</given-names></name> <name><surname>Min</surname> <given-names>X.</given-names></name></person-group> (<year>2020a</year>). <article-title>Deep collaborative filtering for prediction of disease genes.</article-title> <source><italic>IEEE/ACM Trans. Comput. Biol. Bioinform.</italic></source> <volume>17</volume> <fpage>1639</fpage>&#x2013;<lpage>1647</lpage>.</citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname> <given-names>X.</given-names></name> <name><surname>Song</surname> <given-names>X.</given-names></name> <name><surname>Ma</surname> <given-names>T.</given-names></name> <name><surname>Pan</surname> <given-names>X.</given-names></name> <name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Hou</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2020b</year>). <article-title>Cheng FJJopr: Repurpose open data to discover therapeutics for COVID-19 using deep learning.</article-title> <source><italic>J. Proteome Res.</italic></source> <volume>19</volume> <fpage>4624</fpage>&#x2013;<lpage>4636</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jproteome.0c00316</pub-id> <pub-id pub-id-type="pmid">32654489</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>S.</given-names></name> <name><surname>Hou</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>P.</given-names></name> <name><surname>Li</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2020c</year>). <article-title>Network-based prediction of drug&#x2013;target interactions using an arbitrary-order proximity embedded deep forest.</article-title> <source><italic>Bioinformatics</italic></source> <volume>36</volume> <fpage>2805</fpage>&#x2013;<lpage>2812</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa010</pub-id> <pub-id pub-id-type="pmid">31971579</pub-id></citation></ref>
<ref id="B85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Nussinov</surname> <given-names>R.</given-names></name> <name><surname>Cheng</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>deepDR: a network-based deep learning approach to in silico drug repositioning.</article-title> <source><italic>Bioinformatics</italic></source> <volume>35</volume> <fpage>5191</fpage>&#x2013;<lpage>5198</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz418</pub-id> <pub-id pub-id-type="pmid">31116390</pub-id></citation></ref>
<ref id="B86"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname> <given-names>X.</given-names></name> <name><surname>Zhu</surname> <given-names>S.</given-names></name> <name><surname>Lu</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Zhou</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2020d</year>). <article-title>Target identification among known drugs by deep learning from heterogeneous networks.</article-title> <source><italic>Chem. Sci.</italic></source> <volume>11</volume> <fpage>1775</fpage>&#x2013;<lpage>1797</lpage>. <pub-id pub-id-type="doi">10.1039/c9sc04336e</pub-id> <pub-id pub-id-type="pmid">34123272</pub-id></citation></ref>
<ref id="B87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhai</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Teng</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions.</article-title> <source><italic>Front. Cell Dev. Biol.</italic></source> <volume>8</volume>:<fpage>591487</fpage>. <pub-id pub-id-type="doi">10.3389/fcell.2020.591487</pub-id> <pub-id pub-id-type="pmid">33195258</pub-id></citation></ref>
<ref id="B88"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>G.</given-names></name> <name><surname>Yu</surname> <given-names>P.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name> <name><surname>Yan</surname> <given-names>C.</given-names></name></person-group> (<year>2020</year>). <article-title>Feature Selection Algorithm for High-dimensional Biomedical Data Using Information Gain and Improved Chemical Reaction Optimization.</article-title> <source><italic>Curr. Bioinform.</italic></source> <volume>15</volume> <fpage>912</fpage>&#x2013;<lpage>926</lpage>. <pub-id pub-id-type="doi">10.2174/1574893615666200204154358</pub-id></citation></ref>
<ref id="B89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Ji</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>K.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Zhu</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>When drug discovery meets web search: learning to rank for ligand-based virtual screening.</article-title> <source><italic>J. Cheminform.</italic></source> <volume>7</volume> <fpage>1</fpage>&#x2013;<lpage>13</lpage>.</citation></ref>
<ref id="B90"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Yan</surname> <given-names>J.</given-names></name> <name><surname>Chen</surname> <given-names>S.</given-names></name> <name><surname>Gong</surname> <given-names>M.</given-names></name> <name><surname>Gao</surname> <given-names>D.</given-names></name> <name><surname>Zhu</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Review of the Applications of Deep Learning in Bioinformatics.</article-title> <source><italic>Curr. Bioinform.</italic></source> <volume>15</volume> <fpage>898</fpage>&#x2013;<lpage>911</lpage>. <pub-id pub-id-type="doi">10.2174/1574893615999200711165743</pub-id></citation></ref>
<ref id="B91"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>T.</given-names></name> <name><surname>Hu</surname> <given-names>Y.</given-names></name> <name><surname>Peng</surname> <given-names>J.</given-names></name> <name><surname>Cheng</surname> <given-names>L.</given-names></name></person-group> (<year>2020</year>). <article-title>DeepLGP: a novel deep learning method for prioritizing lncRNA target genes.</article-title> <source><italic>Bioinformatics</italic></source> <volume>36</volume> <fpage>4466</fpage>&#x2013;<lpage>4472</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa428</pub-id> <pub-id pub-id-type="pmid">32467970</pub-id></citation></ref>
<ref id="B92"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>X.</given-names></name> <name><surname>Jiao</surname> <given-names>Q.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Huang</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>ECFS-DEA: an ensemble classifier-based feature selection for differential expression analysis on expression profiles.</article-title> <source><italic>BMC Bioinformatics</italic></source> <volume>21</volume>:<fpage>43</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-020-3388-y</pub-id> <pub-id pub-id-type="pmid">32024464</pub-id></citation></ref>
<ref id="B93"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>L.</given-names></name> <name><surname>Huang</surname> <given-names>S.</given-names></name> <name><surname>Mu</surname> <given-names>N.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name> <name><surname>Chang</surname> <given-names>Y.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>RAACBook: a web server of reduced amino acid alphabet for sequence-dependent inference by using Chou&#x2019;s five-step rule.</article-title> <source><italic>Database</italic></source> <volume>2019</volume>:<fpage>baz131</fpage>.</citation></ref>
<ref id="B94"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>D.</given-names></name> <name><surname>Yang</surname> <given-names>W.</given-names></name> <name><surname>Yang</surname> <given-names>L.</given-names></name> <name><surname>Zuo</surname> <given-names>Y.</given-names></name></person-group> (<year>2020</year>). <article-title>RaacLogo: a new sequence logo generator by using reduced amino acid clusters.</article-title> <source><italic>Brief. Bioinform.</italic></source> <volume>22</volume>:<fpage>bbaa096</fpage>.</citation></ref>
<ref id="B95"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>Q.</given-names></name> <name><surname>Wan</surname> <given-names>S.</given-names></name> <name><surname>Ju</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>J.</given-names></name> <name><surname>Zeng</surname> <given-names>X.</given-names></name></person-group> (<year>2016a</year>). <article-title>Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy.</article-title> <source><italic>BMC Syst. Biol.</italic></source> <volume>10</volume>:<fpage>114</fpage>. <pub-id pub-id-type="doi">10.1186/s12918-016-0353-5</pub-id> <pub-id pub-id-type="pmid">28155714</pub-id></citation></ref>
<ref id="B96"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>Q.</given-names></name> <name><surname>Xing</surname> <given-names>P.</given-names></name> <name><surname>Wei</surname> <given-names>L.</given-names></name> <name><surname>Liu</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <article-title>Gene2vec: Gene Subsequence Embedding for Prediction of Mammalian N6-Methyladenosine Sites from mRNA.</article-title> <source><italic>RNA</italic></source> <volume>25</volume> <fpage>205</fpage>&#x2013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1261/rna.069112.118</pub-id> <pub-id pub-id-type="pmid">30425123</pub-id></citation></ref>
<ref id="B97"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zou</surname> <given-names>Q.</given-names></name> <name><surname>Zeng</surname> <given-names>J.</given-names></name> <name><surname>Cao</surname> <given-names>L.</given-names></name> <name><surname>Ji</surname> <given-names>R.</given-names></name></person-group> (<year>2016b</year>). <article-title>A novel features ranking metric with application to scalable visual and bioinformatics data classification.</article-title> <source><italic>Neurocomputing</italic></source> <volume>173</volume> <fpage>346</fpage>&#x2013;<lpage>354</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2014.12.123</pub-id></citation></ref>
<ref id="B98"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zuo</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>G.</given-names></name> <name><surname>Yan</surname> <given-names>Z.</given-names></name> <name><surname>Yang</surname> <given-names>L.</given-names></name></person-group> (<year>2017</year>). <article-title>PseKRAAC: a flexible web server for generating pseudo K-tuple reduced amino acids composition.</article-title> <source><italic>Bioinformatics</italic></source> <volume>33</volume> <fpage>122</fpage>&#x2013;<lpage>124</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btw564</pub-id> <pub-id pub-id-type="pmid">27565583</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.uniprot.org/">https://www.uniprot.org/</ext-link></p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="https://pubchem.ncbi.nlm.nih.gov/">https://pubchem.ncbi.nlm.nih.gov/</ext-link></p></fn>
<fn id="footnote3">
<label>3</label>
<p><ext-link ext-link-type="uri" xlink:href="https://go.drugbank.com/">https://go.drugbank.com/</ext-link></p></fn>
<fn id="footnote4">
<label>4</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.genome.jp/kegg/">https://www.genome.jp/kegg/</ext-link></p></fn>
<fn id="footnote5">
<label>5</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.bindingdb.org/bind/index.jsp">https://www.bindingdb.org/bind/index.jsp</ext-link></p></fn>
<fn id="footnote6">
<label>6</label>
<p><ext-link ext-link-type="uri" xlink:href="http://stitch.embl.de/">http://stitch.embl.de/</ext-link></p></fn>
<fn id="footnote7">
<label>7</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.swisstargetprediction.ch/">http://www.swisstargetprediction.ch/</ext-link></p></fn>
<fn id="footnote8">
<label>8</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.rdkit.org/">https://www.rdkit.org/</ext-link></p></fn>
<fn id="footnote9">
<label>9</label>
<p><ext-link ext-link-type="uri" xlink:href="https://mariewelt.github.io/OpenChem/html/index.html">https://mariewelt.github.io/OpenChem/html/index.html</ext-link></p></fn>
<fn id="footnote10">
<label>10</label>
<p><ext-link ext-link-type="uri" xlink:href="https://ifeature.erc.monash.edu/">https://ifeature.erc.monash.edu/</ext-link></p></fn>
<fn id="footnote11">
<label>11</label>
<p><ext-link ext-link-type="uri" xlink:href="http://bioinformatics.hitsz.edu.cn/Pse-in-One/">http://bioinformatics.hitsz.edu.cn/Pse-in-One/</ext-link></p></fn>
</fn-group>
</back>
</article>