1 Introduction

Front. Pharmacol.

Frontiers in Pharmacology

Front. Pharmacol.

1663-9812

Frontiers Media S.A.

1375522

10.3389/fphar.2024.1375522

Pharmacology

Review

A comprehensive review of the recent advances on predicting drug-target affinity based on deep learning

Zeng et al.

10.3389/fphar.2024.1375522

Zeng

Xin

¹ Li

Shu-Juan

² Lv

Shuang-Qing

³ Wen

Meng-Liang

⁴ Li

¹ *

¹ College of Mathematics and Computer Science, Dali University, Dali, China ² Yunnan Institute of Endemic Diseases Control and Prevention, Dali, China ³ Institute of Surveying and Information Engineering West Yunnan University of Applied Science, Dali, China ⁴ State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China

Edited by: Heike Wulff, University of California, Davis, United States

Reviewed by: Karim Abbasi, Sharif University of Technology, Iran

Zhiqin Zhu, Chongqing University of Posts and Telecommunications, China

*Correspondence: Yi Li, yili@dali.edu.cn

02 04 2024

2024

1375522

24 01 2024 21 03 2024

2024

Zeng, Li, Lv, Wen and Li

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

Accurate calculation of drug-target affinity (DTA) is crucial for various applications in the pharmaceutical industry, including drug screening, design, and repurposing. However, traditional machine learning methods for calculating DTA often lack accuracy, posing a significant challenge in accurately predicting DTA. Fortunately, deep learning has emerged as a promising approach in computational biology, leading to the development of various deep learning-based methods for DTA prediction. To support researchers in developing novel and highly precision methods, we have provided a comprehensive review of recent advances in predicting DTA using deep learning. We firstly conducted a statistical analysis of commonly used public datasets, providing essential information and introducing the used fields of these datasets. We further explored the common representations of sequences and structures of drugs and targets. These analyses served as the foundation for constructing DTA prediction methods based on deep learning. Next, we focused on explaining how deep learning models, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Transformer, and Graph Neural Networks (GNNs), were effectively employed in specific DTA prediction methods. We highlighted the unique advantages and applications of these models in the context of DTA prediction. Finally, we conducted a performance analysis of multiple state-of-the-art methods for predicting DTA based on deep learning. The comprehensive review aimed to help researchers understand the shortcomings and advantages of existing methods, and further develop high-precision DTA prediction tool to promote the development of drug discovery.

deep learning drug-target affinity dataset representation methods

section-at-acceptance

Experimental Pharmacology and Drug Discovery

1 Introduction

Drug-target affinity (DTA) is a critical metric and the core of drug discovery. While the wet experiments have been used to calculate DTA with high accuracy, the time-consuming and laborious nature of these experiments can no longer meet the demands of modern drug screening, especially with the massive drug-target pairs. Fortunately, the emergence of computational methods for predicting DTA has accelerated the drug screening process, helping to shorten the drug development cycle and reduce the costs (Kairys et al., 2019; Abbasi et al., 2021; Xu et al., 2021; Zhang et al., 2023a).

At present, while there are non-machine learning methods available for computing DTA, such as FEP (Free-Energy Perturbation) (Jorgensen and Thomas, 2008) and MM/GBSA (or MM/PBSA) (Çınaroğlu and Timuçin, 2020), which can effectively estimate the binding free energy or affinity of drug-target, these methods not only demand a significant amount of computing resources, but also exhibit slow processing speeds when dealing with a large number of drug-target pairs. In contrast, data-driven machine learning methods offer fast processing speeds and high computational accuracy. The computational methods based on machine learning for predicting DTA can be classified into two categories: traditional machine learning methods and deep learning methods. Traditional machine learning methods employ linear regression, random forest regression, nearest neighbor regression, and support vector machine regression (Ballester and Mitchell, 2010; Li et al., 2015; Shar et al., 2016) to predict DTA. Although these methods perform well performance, they cannot automatically extract high-level hidden features from drugs and targets. With the emergence of deep learning models, DTA prediction methods based on deep learning (Öztürk et al., 2018; Wang et al., 2021a; Rube et al., 2022) can automatically extract high-level hidden features from the sequences and structures of drugs and targets, resulting in the improved performance compared to traditional machine learning methods. Except for a few methods that utilize deep learning to extract high-level features from target-target and drug-drug interaction networks (Dehghan et al., 2023; Rafiei et al., 2023), based on the different combinations of multiple modal features of drugs and targets, such as fingerprints, SMILES, two-dimensional molecular topology graphs, three-dimensional spatial structures, physicochemical properties, sequences, and contact maps, deep learning-based DTA prediction methods can be broadly divided into three categories: sequence-based, hybrid-based, and structure-based methods.

Sequence-based methods (Öztürk et al., 2018; Karimi et al., 2019; Wang et al., 2021a; Li et al., 2022b; Ghimire et al., 2022; Zhao et al., 2022; Gim et al., 2023; Jin et al., 2023; Kalemati et al., 2023; Ru et al., 2023; Zhou et al., 2024) aim to extract implicit sequence features from drug SMILES (Simplified Molecular Input Line Entry System) (Weininger, 1988) and target sequences using deep learning models. These methods leverage various sequence deep learning models such as Convolutional Neural Networks (CNNs) (LeCun et al., 2015), Recurrent Neural Networks (RNNs) (Zaremba et al., 2015), and Transformers (Vaswani et al., 2017). In the current sequence-based methods, 1D-CNN, RNNs, and Transformers are commonly used to extract high-level sequence features. On the other hand, 2D-CNN is employed to extract sequence features from a two-dimensional matrix composed of drugs or targets. For instance, DeepDTA (Öztürk et al., 2018) utilized a CNN module with three consecutive 1D convolutional layers to extract sequence features from drug SMILES and target sequences, respectively. SimCNN-DTA (Shim et al., 2021), on the other hand, employed 2D-CNN to predict DTA by utilizing the outer product between column vectors of two similar matrices representing drugs and targets. While CNNs effectively capture the local features from drug SMILES and target sequences, they may overlook long-range dependencies between atoms or amino acids. To address this issue, RNNs with memory functions can be utilized to extract long-range dependent features, as demonstrated in DeepAffinity (Karimi et al., 2019) and DeepCDA (Abbasi et al., 2020). However, CNNs and RNNs may not focus on the key features influencing drug-target interaction or provide interpretability for the model’s effectiveness. Some attention mechanisms (Vaswani et al., 2017) are employed to capture the key features (Zeng et al., 2021; Chen et al., 2022; Ghimire et al., 2022; Monteiro et al., 2022; Zhang et al., 2022; Zhao et al., 2022). For example, AttentionDTA (Zhao et al., 2022) utilized attention mechanisms to focus on subsequences within drug SMILES and target sequences that played a crucial role in affinity prediction. MRBDTA (Zhang et al., 2022) incorporated multi-head attention mechanisms, effectively capturing drug-target interaction sites and providing interpretational analysis for its effectiveness. CAPLS (Jin et al., 2023) employed the cross-attention mechanism to capture the mutual effect of protein-binding pocket and ligand. MT-DTA (Zhu et al., 2023b) built a variational autoencoders system with a cascade structure of attention model and CNNs to extract the implied high-level interactive features from target sequences and drug SMILES. Sequence-based methods have the advantage of easily obtaining the target sequences and drug SMILES data. These methods excel in processing sequence data swiftly, without demanding substantial computing resources, and exhibit a fine performance in predicting DTA based on the extracted high-level sequence features. Nevertheless, these methods overlook additional multimodal information related to targets and drugs, like topology graphs and 3D structures. It is important to note that structural information harbors crucial features that significantly influence DTA prediction. Disregarding this essential structural data may limit the accuracy, depth, and interpretability of understanding in predicting DTA. However, utilizing the structures of targets to enhance DTA faced limitations in the early states, as only a small portion of target sequences had known structures. Consequently, the exploration of hybrid-based methods emerged by incorporating the structural features of drugs into sequence-based approaches.

Hybrid-based methods (Karimi et al., 2021; Wang et al., 2021b; Zhang et al., 2021; Cheng et al., 2022; Li et al., 2022a; Lin et al., 2022a; Tian et al., 2022; Yang et al., 2022; Jiang et al., 2023; Pan et al., 2023; Wang et al., 2023a; Wang and Li, 2023; Xia et al., 2023; Yang et al., 2023; Zeng et al., 2023; Zhang et al., 2023b; Zhang et al., 2023a; Zhu et al., 2023a; Zhu et al., 2023c; Nguyen et al., 2022.) leverage deep learning models to extract sequence features from drug SMILES and target sequences, as well as the structural features from two-dimensional molecular topology graphs and three-dimensional structures of drug small molecules. These methods focus on integrating the structural features of drugs into sequence-based approaches. For the structures of drugs, tools like RDKit (Landrum, 2013) are commonly used to convert drug SMILES into the molecular graphs. GNN models (Xu et al., 2019) are employed for capturing the structural features of drugs. For instance, GraphDTA (Nguyen et al., 2021) utilized GCN and CNN to extract the structural features from drug molecular graphs and sequence features from target sequences, respectively. These extracted features were then combined as inputs and passed through fully connected layers to predict DTA. SAG-DTA (Zhang et al., 2021) incorporated a GCN with multiple self-attention graph pooling layers to extract the hidden features from drug molecular graphs. CNN was directly applied to the target sequences for learning high-level features. TDGraphDTA (Zhu et al., 2023c) introduced the transformer and diffusion to predict drug-target interactions using multi-scale information interaction and graph optimization. Hybrid-based methods combine the structural features of drugs with sequence-based approaches, enriching the features of drugs. Typically, GNN are employed to extract the drug structural features from molecular graphs converted from drug SMILES. These molecular graphs are relatively small and have minimal impact on the computational speed of the model. However, three-dimensional structural features of drugs are underutilized in hybrid-based methods. Furthermore, these methods completely overlook the structural features of target and make it difficult to provide explanatory analysis for the effectiveness of the model, leaving ample opportunity for performance enhancement. However, with the advent of AlphaFold (Jumper et al., 2021) and ColabFold (Kim et al., 2023), two target structural prediction tools, obtaining target structures has become less challenging. Consequently, there is a growing interest in methods that utilize the structures of drugs and targets for predicting DTA. Structure-based methods are gaining increased attention from researchers in this context.

Structure-based methods (Gomes et al., 2017; Stepniewska-Dziubinska et al., 2018; Zhang et al., 2019; Jiang et al., 2020; Seo et al., 2021; Shen et al., 2021; Lin et al., 2022b; Ma et al., 2022; Lu et al., 2023; Wu et al., 2024) employ deep learning models like GNN and 3D Convolutional Neural Network (3D-CNN) to extract implicit structural features from the molecular graphs of drugs and targets or the 3D structures of drug-target complexes. Using GNNs (Li et al., 2021; Yuan et al., 2021; Chu et al., 2022; Jiang et al., 2022; Liao et al., 2022; Pandey et al., 2022; Bi et al., 2023; Wang et al., 2023b; Zhang et al., 2023d; Zhang et al., 2023c; Ma et al., 2023; Mekni et al., 2023; Tsui et al., 2023; Tian et al., 2024), the molecular graphs of drugs and targets are fed into GNN to obtain the structural features. For example, PSG-BAR (Pandey et al., 2022) served as an example where a contact map was generated based on the 3D structure of target. Target graph was then constructed using the contact map, and the structural features were extracted using RGAT. For drug, the graph was generated based on its structural file, and RGAT was also employed to extract high-level features. AttentionMGT-DTA (Wu et al., 2024) represented drugs and targets by a molecular graph and binding pocket graph, respectively. Graph transformer module was utilized to extract the structural features of drugs and binding pockets. WGNN-DTA (Jiang et al., 2022) constructed protein and molecular graphs through sequence and SMILES that can effectively reflect their structures. Weighted graph neural networks were used to extract the structural features of molecules and proteins for predicting DTA. On the other hand, 3D-CNN-based methods (Zheng et al., 2019; Kwon et al., 2020; Liu et al., 2021; Wang et al., 2022) directly take the 3D structure of drug-target complex as input and use the extracted spatial features of complex as input for the FC network to predict DTA. For example, AK-Score (Kwon et al., 2020) employed the ensemble of multiple independently trained networks that consisted of multiple channels of 3D-CNN layers to predict the binding affinity of a complex. Sfcnn (Wang et al., 2022) converted drug-target complex into 3D grids for CNN training to extract the structural features. Structure-based methods offer effective utilization of the structural features of drugs and targets, yielding impressive performance. They are especially valuable for providing explanatory analyses that shed light on the model’s effectiveness, thereby facilitating research into DTA prediction methods and promoting wider application of these models. However, it is important to acknowledge some limitations. One such limitation is the reliance on tools like AlphaFold to obtain the target structures. While AlphaFold has shown higher accuracy in predicting the structures of monomeric proteins, its performance in predicting the structures of other proteins still requires optimization. Additionally, structure-based methods extract structural features from protein structure graphs, which can be computationally demanding and result in slower processing speeds.

In this review, we aimed to highlight the crucial significance of precise DTA prediction, followed by a comprehensive overview of the universal datasets and widely used representation methods for the sequences, structures, and complexes of drugs and targets. We then focused on the widespread application of popular deep learning techniques in DTA prediction. Our goal was to provide a comprehensive overview of datasets, representation, methods, and deep learning techniques for predicting DTA. By doing so, we intend to empower researchers to effectively utilize these resources in developing innovative DTA prediction methods, thereby providing essential support for drug discovery, design, and repurposing endeavors. The main contributions of this review can be summarized as follows:

(1) A comprehensive statistical analysis has been carried out on datasets, representations, model architectures, and performance evaluation of state-of-the-art methods based on deep learning for predicting DTA.

(2) Elaboration on the extraction process of crucial implicit features from diverse modalities of drugs and targets using cutting-edge deep learning technologies like CNN, RNN, GNN, and Transformer.

(3) An in-depth analysis of the strengths and limitations of advanced deep learning methods for predicting DTA is conducted from three perspectives: sequence, hybrid, and structure. This analysis serves as a foundation for researchers to develop novel and more accurate tools for DTA prediction.

2 Dataset

A high-quality dataset of drug-target binding affinity serves as the fundamental basis for the development of computational methods that leverage deep learning for predicting DTA. Currently, the most widely used datasets for DTA prediction include PDBbind (multiple versions) (Wang et al., 2005), Davis (Davis et al., 2011), KIBA (Tang et al., 2014), BindingDB (Liu et al., 2007), and Metz (Metz et al., 2011). To supplement these universal affinity datasets, UniRef (Suzek et al., 2015), UniProt (The et al., 2021), Protein Data Bank (PDB) (Berman, 2000), STITCH (Kuhn et al., 2007), and ZINC (Irwin and Shoichet, 2006) can provide additional sequences and structures for drugs and targets that may be missing.

2.1 Statistical analysis of commonly used datasets for DTA prediction

We performed a comprehensive statistical analysis on the datasets utilized in nearly 80 references on drug-target affinity cited in this review to assess their usage. The results of our analysis, as depicted in Figure 1, revealed that PDBbind, Davis, KIBA, BindingDB, and Metz were the five most frequently employed datasets. Among these, PDBbind and BindingDB were primarily utilized for deep learning methods based on hybrid or structure. These two datasets offer comprehensive sequence and structural data for drugs and targets. On the other hand, Davis, KIBA, and Metz were predominantly employed for sequence-based deep learning methods, although some hybrid or structure-based deep learning methods also utilized them. It is worth mentioning that the structures of targets in Davis, KIBA, and Metz were sourced from the PDB database.

FIGURE 1

Statistics on the usage of the datasets for predicting DTA based on deep learning.

2.2 Introduction to commonly used datasets for DTA prediction

In this section, we provide a detailed introduction to the five most frequently used datasets: PDBbind, Davis, KIBA, BindingDB, and Metz. In addition, we will also introduce the ToxCast dataset (Feng et al., 2019), which is commonly used in multi-task prediction methods.

2.2.1 PDBbind

PDBbind dataset comprises four commonly used versions: 2013, 2016, 2018, and 2020. Each version of the dataset consists of two distinct parts: the general set and the refined set (Table 1). To illustrate, let’s consider the PDBbind dataset (version 2016), which includes a total of 13,283 samples: 9,226 samples for the general set and 4,057 samples for the refined set. The refined set is obtained based on the quality protocols, including measured resolution and experimental precision. This process ensures the exclusion of ligands, ternary complexes, or steric hindrance complexes with resolutions above 2.5 Å, R factors exceeding 0.25, instances of covalent bonding, and complexes lacking reported binding affinities in terms of Kd (dissociation constant) or Ki (inhibitor constant), or falling outside the necessary range (Kd < 1 pM). Each sample represents a drug-target pair labeled with affinity value known as the dissociation constant (Kd). Notably, each sample provides drug SMILES and target sequence, as well as the 3D structure of target and pocket information related to drug-target binding. Thanks to the extensive sequence and structural information available for drugs, targets, and pockets in PDBbind dataset, it has become widely recognized as a universal dataset for predicting DTA in sequence, hybrid, and structure-based deep learning methods. Furthermore, it is worth mentioning that the CASF series datasets used to test the performance of models in certain studies (Stepniewska-Dziubinska et al., 2018; Wang et al., 2022), such as CASF-2013 (Li et al., 2014), CASF-2016 (Su et al., 2019), are the core sets derived from the corresponding refined sets of PDBbind datasets.

TABLE 1

Statistic of commonly used PDBbind dataset with different versions.

Version	Total number of samples	General set	Refined set
2013	11,184	8,225	2,959
2016	13,283	9,226	4,057
2018	16,126	11,663	4,463
2020	19,443	14,127	5,316

2.2.2 Davis and KIBA

Davis dataset (Table 2) comprises 68 compounds and 442 proteins, generating a total of 30,056 compound-protein affinity samples, each labeled with the dissociation constant (Kd). It is worth noting that all drug-target pairs that can not be experimentally measured for bioactivity are assigned a bioactivity value of 10 μM (corresponding to a pK _d of 5) in Davis dataset. But the number of data points within this range is very large. Consequently, some methods have chosen to remove the data points with a bioactivity value of 10 μM from the Davis dataset, thereby creating what is known as the Filtered Davis dataset (Rifaioglu et al., 2021). KIBA dataset (Table 2) includes 246,088 interaction pairs of samples derived from 467 proteins and 52,498 compounds. Notably, KIBA contains three types of labels: inhibition concentration 50 (IC50), dissociation constant (Kd), and inhibition constant (Ki). Due to their focus on only providing the sequences of drugs and targets, Davis and KIBA are predominantly utilized in sequence-based deep learning methods. Nonetheless, a few hybrid or structure-based approaches have also been successfully employed using these datasets. It is important to note that while Davis and KIBA do not include the 3D structures of targets and drugs, they can be accessed by downloading them from the PDB and ZINC databases, respectively.

TABLE 2

Detailed information on datasets Davis, Filtered Davis, and KIBA.

Dataset	Compounds	Proteins	Total number of samples
Davis	68	442	30,056
Filtered Davis	68	379	9,125
KIBA	52,498	467	246,088

2.2.3 BindingDB

BindingDB dataset is primarily composed of drug-target pair samples sourced from some scientific literatures, encompassing four different types of affinity labels: IC50, Kd, Ki, and EC50 (median effect concentration). Table 3 displays the number of drugs, targets, and drug-target interaction pairs in each label category. Notably, BindingDB provides drug SMILES and target sequences, while their structures can be obtained from PDB and ZINC databases, respectively. This comprehensive information enables BindingDB to be widely utilized in sequence, hybrid, and structure-based methods, typically to evaluate the generalization performance of DTA prediction methods.

TABLE 3

Details of BindingDB dataset.

Dataset	Label	Drugs	Targets	Total number of samples
BindingDB	IC50	265,627	2,793	376,751
	Kd	5,895	812	12,589
	Ki	93,437	1,619	144,525
	EC50	31,970	513	37,896

2.2.4 Metz

Metz dataset comprises 1,423 drugs and 170 targets, resulting in a total of 35,259 drug-target pairs. Each pair is labeled with an affinity value represented by Ki (in the form of pK _i value). Furthermore, the relationship between drugs and targets can be accessed from the STITCH database, which consolidates diverse chemical and protein networks.

2.2.5 ToxCast

Toxcast is a toxicology research dataset derived from high-throughput in vitro screening of chemicals, primarily measuring AC50, which represents the concentration at half of the maximum activity. This dataset has a large scale, covering different types of proteins, and contains qualitative results from more than 600 experiments involving over 8,000 compounds. With around 530,000 observations of drug-target pairs and over 600 labels, it is well-suited for multi-task prediction. Its subsets are frequently utilized for case studies or generalization performance testing of DTA methods.

2.3 Introduction to supplementary used databases for DTA prediction 2.3.1 Uniprot

Uniprot database (The et al., 2021) is a sequence database designed specifically for proteins that contains approximately 220 million protein sequences and related annotation information on the biological functions of proteins. It has the ability to add new protein entries, as well as supplement and update publicly available annotation information, and is widely regarded as the protein database with the most extensive collection and comprehensive annotation information.

2.3.2 PDB

PDB database (Berman, 2000) is the premier collection of 3D structures for biological macromolecules, such as proteins, nucleic acids, etc., which contains the 3D structures of all resolved proteins. In addition to annotating the 3D structural information of proteins, PDB also provides various file types for downloading and visualizing the 3D structures of proteins.

2.3.3 STITCH

STITCH database (Kuhn et al., 2007) is a valuable resource that includes information on interactions between 43,000 compounds and 9,643,763 proteins from 2,031 species. It shares protein interaction data with the STRING database (https://cn.string-db.org/), making it an important database for studying compound sequences. Each interaction in STITCH database is assigned a score value, which represents the affinity or probability of the interaction between a compound and a protein. STITCH also provides information on compounds that are similar to the drug of target, along with their similarity scores.

2.3.4 ZINC

ZINC (Irwin and Shoichet, 2006) is a free commercial database used for virtual screening of compounds, which provides access to 3D structures of over 230 million molecules. It offers multiple docking program interfaces, user-defined molecular operations, and web-based database search and browsing capabilities.

3 Representation 3.1 Sequence representation

Drug SMILES and target sequences are composed of different characters. Therefore, they are commonly encoded using one-hot encoding or label encoding in sequence and hybrid-based methods. Their sequence features are extracted using CNN, RNN, or Transformer. In structure-based methods, the extracted features from sequences are utilized as node features in the graphs of drugs and targets. In addition, traditional sequence features such as molecular fingerprint, position-specific score matrix (PSSM) (Altschul, 1997), and Hidden Markov Matrix (HMM) (Remmert et al., 2012) are also widely employed in DTA prediction.

3.2 Structure representation

For drug, the structure representation often involves graph. One common type of the drug graph is based on the drug SMILES, which can be converted using RDKit tool. Another type of the drug graph is based on the 3D structural file, where atoms serve as vertices and bonds act as edges. Node features in the drug graph can be derived from the physical-chemical properties of atoms or extracted from drug SMILES using deep learning techniques.

For target, the secondary structural information can be obtained directly from the relevant file of target and is widely employed in traditional machine learning and deep learning methods. The tertiary structural graph of target can be roughly categorized into two types: contact map and spatial topology graph. Contact map is created based on the sequence or tertiary structure of target, generating a map of interaction between amino acids. Structural features of target can be extracted directly from the contact map using CNN models. Alternatively, the contact map can be converted into a target graph, allowing the use of GNNs to extract structural features. Spatial topology graph of target is constructed based on the 3D structural file. Nodes in the graph represent amino acids, typically carbon α atoms, and edges are formed based on distance thresholds, such as Euclidean distance between carbon α atoms.

3.3 Interaction network graph representation of drug-target complex

In DTA prediction, the interaction between a drug and its target is often represented as a graph. This involves extracting interaction features using GNN. The construction of the interaction network graph is based on the 3D structure of drug-target complex. To create the graph, the atoms of drug and the carbon atom of amino acid in target (typically the carbon α atom) are selected as the vertices of graph. The Euclidean distance between each atom and the carbon atom is then calculated. If the distance is less than or equal to a specified threshold (usually set to 8 Å or 10 Å), an edge is created to connect the corresponding atom to the amino acid. Any atoms and amino acids that did not participate in the construction of the interaction graph are excluded. This process results in an interaction network graph that represents the drug-target complex, which can be used for analysis and prediction of DTA.

3.4 3D structural spatial grid representation of drug-target complex

While the interaction network graph of drug-target complex can provide valuable information about the structural features, some atoms and amino acids are ignored. As a result, deep learning methods that utilize a complete 3D structural spatial grid representation of drug-target complex are widely used. The 3D structural spatial grid representation of the complex is composed of the spatial coordinates of all atoms, and 3D-CNN is used to extract the spatial structural features from the complex’s 3D structure.

4 Drug-target affinity prediction methods based on deep learning

Currently, computational methods for predicting DTA using deep learning can be broadly categorized into three groups based on the progression from sequence to structure: sequence-based, hybrid-based, and structure-based methods. In the following chapters, we will provide a comprehensive overview of the feature extraction process for each category.

4.1 Sequence-based deep learning methods

Sequence-based deep learning methods (Figure 2) utilize drug SMILES and target sequences as input. These methods employ various deep learning techniques, including CNN, RNN, Transformer, and attention mechanisms, to extract essential features from the input sequences. In the following sections, we will provide an overview of some classic sequence-based methods.

FIGURE 2

The overview architecture of sequence-based deep learning methods for predicting DTA.

4.1.1 DeepDTA

In DeepDTA (Öztürk et al., 2018), drug SMILES and target sequences were encoded as label encodings and used as inputs. The sequence feature extraction was conducted by two independent CNN blocks, each comprising three 1D convolutional layers. Drug SMILES and target sequences, were separately processed through the embedded layers and passed into their respective CNN blocks. This allowed for the extraction of high-level sequence features from drugs and targets. Subsequently, the extracted sequence features were concatenated and fed into a three-layer FC network to predict DTA. DeepDTA not only showed superior performance compared to traditional machine learning methods, but also enabled automatic extraction of sequence features and end-to-end DTA prediction. This contributed to the transition from traditional machine learning methods to deep learning methods in the field of DTA prediction.

4.1.2 DeepCDA

In the architecture of DeepCDA (Abbasi et al., 2020), drug SMILES and target sequences were used as inputs. Initially, both drug SMILES and target sequences underwent encoding via coding layers. The encoded representations were then separately fed into identical feature extraction networks. Each feature extraction network consisted of two components: a CNN block and an LSTM block. CNN block comprised three convolutional layers, responsible for extracting short-distance features from the sequences. These short-distance features were subsequently inputted into a multi-layer LSTM block to capture long-distance dependent features. By combining CNN and LSTM, DeepCDA effectively considered local and long-range dependent features of the sequence. To further extract crucial information influencing drug-target interaction, a bidirectional attention mechanism was employed to fuse the extracted sequence features. This fusion process enabled comprehensive feature mining that accounts for the interaction between drugs and targets. Finally, the fused features were fed into a FC layer to predict DTA.

4.1.3 AttentionDTA

AttentionDTA (Zhao et al., 2022) took drug SMILES and target sequences as input, which were encoded using label encoding. A character embedding layer was inserted between the label encoding layer and the feature extraction block to convert drug SMILES and target sequences into embedding matrices. These matrices were then passed through a CNN block consisting of multiple 1D-CNN layers to extract implicit sequence features. To capture the non-covalent interactions between the atoms of drug and the amino acids of target, AttentionDTA incorporated a bilateral multi-head attention mechanism. This mechanism took the features extracted by the CNN block as input, allowing it to capture the interaction information that affected drug-target interaction. The resulting key interaction information was subsequently fed into a multi-layer perceptron (MLP) for DTA prediction.

4.2 Hybrid-based deep learning methods

Hybrid-based deep learning methods (Figure 3) have been at the forefront of utilizing the structural features of drugs. The process begins by obtaining the graph representation of drug directly from its SMILES using RDKit tool. Subsequently, GNN is employed to learn implicit high-level structural features from the graph. Finally, these extracted structural features are combined with the sequence features of target to predict DTA. These methods effectively integrate the sequence and structural information to enhance the performance.

FIGURE 3

The overview architecture of hybrid-based deep learning methods for predicting DTA.

4.2.1 GraphDTA

GraphDTA (Nguyen et al., 2021) was a representative hybrid-based deep learning method for predicting DTA. It leveraged the structural features of drugs and the sequence features of targets. Initially, drug SMILES was converted into a molecular graph using the RDKit tool. Subsequently, a three-layer GNN was employed to extract the structural features. As for target, the sequence underwent label encoding and embedding layers before being inputted into a convolutional block comprising three 1D-CNN layers to learn sequence features. Finally, the extracted structural features of drugs and the sequence features of targets were combined and fed into a FC network with multiple layers to estimate DTA.

4.2.2 MGraphDTA

MGraphDTA (Yang et al., 2022) took a similar approach by leveraging the structures of drugs and target sequences. However, it enhanced the global structural features extraction by employing a deeper multi-scale GNN (MGNN). This allowed for a comprehensive understanding of the global relationships between atoms in drug and captured various features within the structure of drug. Simultaneously, multi-scale CNN (MCNN) was applied to extract multi-scale features from target sequences. Following this, the multi-scale features from the structures of drugs and target sequences were separately fused, and the resulting fused features were concatenated to form a combined representation of drug-target pair. Finally, the combined representation was fed into MLP to predict DTA.

4.2.3 ColdDTA

Deep learning methods have exhibited promising performance on randomly split public datasets, but their performance tends to significantly decrease when applied to practical scenarios. To address this issue, ColdDTA (Fang et al., 2023) utilized the structural knowledge of drugs and target sequence information to enhance the model’s generalization performance by data augmentation and attention-based feature fusion techniques. The construction process of ColdDTA was as follows: firstly, a new drug-target pair was generated by removing a subgraph from the original graph of drug. Next, the structural features of drug and the sequence features of target were extracted using GNN and CNN, respectively. These extracted features were then fused via an attention-based fusion block to better capture the interaction mechanism between drug and target. Finally, the fused features were inputted into MLP to predict DTA.

4.3 Structure-based deep learning methods

Currently, structure-based deep learning methods for predicting DTA can be broadly categorized into two types. The first type involves extracting structural features from the molecular graphs of drugs and targets using GNN, followed by fusing the extracted features to predict DTA using a FC network (Figure 4A). The second type is based on 3D structures of drug-target complexes, where high-level structural features are extracted using 3D-CNN to predict DTA (Figure 4B). With the emergence of AlphaFold and ColabFold, obtaining the structures of targets has become more feasible. Furthermore, the rapid development of GNN and 3D-CNN has provided critical support for extracting structural features. As a result, structure-based methods have garnered increasing attention from researchers.

FIGURE 4

The overview architecture of structure-based deep learning methods. (A) The extraction of structural features from molecular graphs of drugs and targets using Graph Neural Networks (GNN), and (B) the extraction of structural features of drug-target complexes from their 3D structures using 3D Convolutional Neural Networks (3D-CNN).

4.3.1 GSAML-DTA

GSAML-DTA (Liao et al., 2022) employed a hybrid network model combining GNN and GAT to extract structural features from drugs and targets. The process began by converting drug SMILES and target sequences into drug molecular graphs and contact maps, respectively, using different tools. Subsequently, drug molecular graphs and contact maps were separately inputted into the hybrid network model of GNN-GAT with an attention mechanism to extract structural features of drugs and targets. Following this, the extracted features were concatenated and further optimized through an interactive information module. Finally, the optimized features were fed into a FC network to predict DTA. By leveraging the GNN-GAT hybrid network model with attention mechanism and the interactive module, GSAML-DTA aimed to enhance the accuracy of DTA prediction.

4.3.2 HGRL-DTA

HGRL-DTA (Chu et al., 2022) utilized a hierarchical graph representation learning model for predicting DTA. This model established a hierarchical graph framework where the drug-target binding affinity data was represented as an affinity graph, with drugs and targets serving as vertices within the graph. Simultaneously, drugs and targets were represented as molecular graphs, respectively. To begin, GNN was employed to learn global-level affinity relationship within the affinity graph. Additionally, GNN was also used to separately capture the local chemical structural features of drugs and targets. Through a message propagation mechanism, the learned hierarchical graph information was integrated, and the structural features of drugs and targets were refined using GCN. Finally, these extracted structural features of drugs and targets were combined and inputted into a FC network to predict DTA. By leveraging the hierarchical graph setup, GNN-based representation learning, and message propagation mechanism, HGRL-DTA aimed to improve the accuracy of DTA prediction.

4.3.3 MSGNN-DTA

MSGNN-DTA (Wang et al., 2023b) employed a multi-scale graph construction approach to capture the structural features of drugs and targets from multiple perspectives. For drugs, two types of graphs were constructed. Firstly, an atomic level graph was generated using RDKit tool based on drug SMILES. In this graph, atoms were represented as vertices, chemical bonds between atoms were represented as edges, and the topology was represented by a two-dimensional matrix. Secondly, a motif level graph was created by considering certain motifs (e.g., benzene rings) as vertices, with edges indicating the presence of chemical bond connections between motifs. Regarding target, target sequence was converted into a contact map using ESM-1b (Rives et al., 2021). Additionally, a weight map was constructed based on WGNN-DTA (Jiang et al., 2022). In weight map, residues served as vertices, interactions between residues served as edges, and weights of edges were probability values. To obtain multi-scale topological feature representations, GNN was utilized to extracted high-level structural features from the atomic level graphs and motif level graphs of drugs, as well as the weight graphs of targets. Subsequently, an attention mechanism was employed to fuse the multi-scale structural features and generate a join feature representation. The joint feature representation was then inputted into a multi-layer FC network for DTA prediction.

4.3.4 Sfcnn

Aside from utilizing GNN to extract the structural features from molecular graphs of drugs and targets, there are some methods that use 3D-CNN to extract the structural features from drug-target complexes. One such method was Sfcnn, which employed 3D-CNN to generate a score function for DTA prediction. To begin, the drug-target complex was transformed into a 3D grid representation. This grid served as input to 3D-CNN, which learned high-level structural features. Finally, multiple density layers were applied to the extracted features for DTA prediction.

5 Performance analysis of multiple state-of-the-art methods based on deep learning 5.1 Common performance evaluation metrics

In this review, predicting drug-target affinity is a regression task, and commonly used performance evaluation metrics of the model include Mean Absolute Error (MAE), Mean Square Error (MSE), Root Mean Square Error (RMSE), Pearson Correlation Coefficient (PCC), Spearman ( ρ ), Concordance Index (CI), and R ².

MAE Eq. 1 is used to measure the mean absolute error between prediction value and actual value. It reflects the size of actual prediction error. M A E = 1 n ∑ i = 1 n y i − y i ^ , ∈ 0 , + ∞ ) (1)

MSE Eq. 2 and RMSE Eq. 3 are often used to measure the deviation between prediction value and actual value. It is a measure of accuracy used to compare the prediction errors of different models for specific dataset and measure the error rate of the regression model. For MAE, MSE, and RMSE, the smaller their values are, the better effect of the model is. M S E = 1 n ∑ i = 1 n y i − y i ^ 2 (2) R M S E = 1 n ∑ i = 1 n y i − y i ^ 2 , ∈ 0 , + ∞ ) (3)

R 2 Eq. 4 is mainly used to measure how well the prediction value fits the actual value. When our model does not have any deficiencies, R 2 will get the maximum value of 1. If R 2 is 0, our model is equal to the baseline model. When R 2 is less than 0, it means that our model is not as good as the baseline model. R 2 = 1 − ∑ i = 1 n y i − y i ^ 2 ∑ i = 1 n y i − y ¯ 2 , ∈ 0 , 1 (4)

In formula Eqs 1–4, n is the number of samples, y i is the vector of actual value, y i ^ is the precdition vector, and y ¯ is the average value of all actual values y i 1 ≤ i ≤ n .

PCC Eq. 5 is used to measure the mutual relationship (linear correlation) between two variables X and Y, and its range is [−1, 1]. PCC is widely used in academic research to measure the strength of the linear correlation between two variables. Cov (X, Y) represents the covariance of two variables X and Y. σ X is the standard deviation of X. If ρ X Y > 0 , it means that X and Y are positively correlated; ρ X Y < 0 , X and Y are negatively correlated; ρ X Y = 0 , X and Y are not correlated. ρ X Y = c o v X , Y σ X σ Y , ∈ − 1 , 1 (5)

Spearman Eq. 6 is a nonparametric measure of the dependence of two variables. n is the number of samples. The difference between prediction value and actual value of each group is d i 1 ≤ i ≤ n . The closer value of correlation coefficient ρ is to +1 or −1, the stronger correlation between two variables. ρ = 1 − 6 ∑ d i 2 n n 2 (6)

CI Eq. 7 is used to evaluate the prediction accuracy of the model. Where b i is the prediction value for the larger affinity δ i , b j is the prediction value for the smaller affinity δ j , Z is a normalization constant. For function φ x , it is 1 if the value of x is greater than 0, 0.5 if the value of x is equal to 0, and 0 if the value of x is less than 0. C I = 1 Z ∑ δ i > δ j δ b i − b j 1 ≤ i , j ≤ n (7)

5.2 Performance analysis of multiple state-of-the-art methods based on PDBbind, KIBA, and Davis datasets

Figure 1 highlights PDBbind, KIBA, and Davis datasets as commonly used datasets for predicting DTA using deep learning. We summarized the performance evaluation metrics values of several state-of-the-art methods on PDBbind, KIBA, and Davis datasets, as reported in recently published literatures (Wang et al., 2023a; Zhu et al., 2023a; Bi et al., 2023; Xia et al., 2023; Tian et al., 2024; Wu et al., 2024; Zhou et al., 2024), without considering the specific partitioning of the corresponding datasets by these methods. Although the statistical results (Tables 4, 5; Figures 5–7) showed that these methods have achieved good prediction performance for DTA on commonly used benchmark datasets, the further improvement in DTA prediction still faces challenges. Researchers are actively working on extracting high-level implicit features from the sequences, structures, or complexes of drugs and targets, with the aim of developing novel methods with even better performance for predicting DTA.

TABLE 4

Performance comparison of multiple state-of-the-art methods based on PDBbind dataset.

Methods	MSE	RMSE	CI
Pafnucy	1.418	1.129	0.789
DeepDTA	1.443	1.148	0.771
DeepDTAF	1.355	1.073	0.799
FusionDTA	1.504	1.2	0.766
DataDTA	1.274	1.012	0.806
GraphDTA	1.579	1.193	0.66
WideDTA	1.633	1.295	0.638
DeepGS	1.385	1.096	0.784
DeepFusionDTA	1.235	1.203	0.774
AffinityVAE	1.398	1.102	0.792

TABLE 5

Performance comparison of multiple state-of-the-art methods based on KIBA and Davis datasets.

Dataset	Methods	MSE	CI	Methods	MSE	CI
KIBA	GanDTI	0.469	0.878	DGraphDTA	0.127	0.902
	GraphDTA	0.441	0.881	HiSIF-DTA	0.12	0.904
	WGNNDTA	0.43	0.886	GTAMP-DTA	0.123	0.917
	MGraphDTA	0.427	0.889	TransVAE-DTA	0.2536	0.8221
	UCMCDTA	0.421	0.891	AttentionMGT-DTA	0.14	0.893
Davis	GanDTI	0.236	0.885	DGraphDTA	0.202	0.905
	GraphDTA	0.225	0.895	HiSIF-DTA	0.191	0.907
	WGNNDTA	0.211	0.898	GTAMP-DTA	0.177	0.923
	MGraphDTA	0.205	0.899	TransVAE-DTA	0.3229	0.8596
	UCMCDTA	0.203	0.9	AttentionMGT-DTA	0.193	0.891

FIGURE 5

Performance analysis of multiple state-of-the-art methods based on PDBbind dataset. The general set and refined set are used as the training dataset, while the core set serves as the test dataset. The evaluation metric values of these methods in the figure are sourced from References (Wang et al., 2023a; Zhu et al., 2023a).

FIGURE 6

Performance analysis of multiple state-of-the-art methods based on KIBA dataset. The evaluation metric values of these methods in the figure are sourced from References (Bi et al., 2023; Xia et al., 2023; Tian et al., 2024; Wu et al., 2024; Zhou et al., 2024).

FIGURE 7

Performance analysis of multiple state-of-the-art methods based on Davis dataset. The evaluation metric values of these methods in the figure are sourced from References (Bi et al., 2023; Xia et al., 2023; Tian et al., 2024; Wu et al., 2024; Zhou et al., 2024).

6 Conclusion

Deep learning-based computational methods for DTA prediction have become a crucial component of drug discovery in the pharmaceutical industry. Despite the significant progress achieved by these methods, there is still a gap between their current prediction accuracy and the expectations of researchers. Therefore, to further facilitate the development of novel and high-precision computational methods for DTA prediction, this review provides detailed statistics, summaries, and elaboration on commonly used datasets, the sequence and structural representations of drugs and targets, as well as representative deep learning methods.

From the comprehensive overview of advanced methods for predicting DTA based on deep learning, three key points stand out:

1. It is essential to thoughtfully combine deep learning models like CNN, RNN, and GNN to extract crucial implicit features influencing DTA prediction from the sequences, structures, and other data related to drugs and targets.

2. Deep learning models are employed to extract numerous features from diverse modalities of drugs and targets. Further refinement and effective fusion of these features are vital to obtain comprehensive deep features.

3. Most DTA prediction methods using deep learning lack explanations of their effectiveness. This absence hinders researchers from enhancing current methods.

In the future, it is imperative to delve into DTA prediction methods based on deep learning from three key perspectives:

1. Alongside commonly used deep learning models like CNN, RNN, and GNN, it is essential to incorporate unsupervised learning models such as contrastive learning to comprehensively capture the pivotal features influencing DTA prediction.

2. The emergence of tools such as AlphaFold has made it no longer difficult to obtain the structures of targets, with these structures playing a crucial role in determining molecular function. Hence, delving deeper into the three-dimensional spatial structural features of drugs and targets will help enhance the performance of DTA prediction.

3. While some deep learning-based methods for DTA prediction have shown promising results on standard datasets, their generalization performance is not satisfactory. Therefore, focusing on selecting specific datasets within particular fields and constructing deep learning models for DTA prediction that directly cater to practical application requirements will emerge as a prominent area of research interest.

Author contributions

XZ: Writing–original draft. S-JL: Investigation, Supervision, Writing–review and editing. S-QL: Visualization, Writing–review and editing. M-LW: Project administration, Writing–review and editing. YL: Writing–review and editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Natural Sciences Foundation of China (No. 62366002), Yunnan Fundamental Research Projects (No. 202101BA070001-227), Yunnan Young and Middle-aged Academic and Technical Leaders Reserve Talent Project in China (No. 202405AC350023), and a grant (No. 2023KF005) from State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References Abbasi

Razzaghi

Poso

Amanlou

Ghasemi

J. B.

Masoudi-Nejad

(2020). DeepCDA: deep cross-domain compound–protein affinity prediction through LSTM and convolutional neural networks. Bioinformatics 36, 4633–4642. 10.1093/bioinformatics/btaa544 Abbasi

Razzaghi

Poso

Ghanbari-Ara

Masoudi-Nejad

(2021). Deep learning in drug target interaction prediction: current and future perspectives. Curr. Med. Chem. 28, 2100–2113. 10.2174/0929867327666200907141016 Altschul

Madden

T. L.

Schäffer

A. A.

Zhang

Miller

(1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402. 10.1093/nar/25.17.3389 Ballester

P. J.

Mitchell

J. B. O.

(2010). A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking. Bioinformatics 26, 1169–1175. 10.1093/bioinformatics/btq112 Berman

H. M.

Westbrook

Feng

Gilliland

Bhat

T. N.

Weissig

(2000). The protein Data Bank. Nucleic Acids Res. 28, 235–242. 10.1093/nar/28.1.235 Bi

Zhang

Jiang

Wei

(2023). HiSIF-DTA: a hierarchical semantic information fusion framework for drug-target affinity prediction. IEEE J. Biomed. Health Inf., 1–12. 10.1109/JBHI.2023.3334239 Chen

Liao

Wei

(2022). MultiscaleDTA: a multiscale-based method with a self-attention mechanism for drug-target binding affinity prediction. Methods 207, 103–109. 10.1016/j.ymeth.2022.09.006 Cheng

Yan

F.-X.

Wang

(2022). Drug-target interaction prediction using multi-head self-attention and graph attention network. IEEE/ACM Trans. Comput. Biol. Bioinform. 19, 2208–2218. 10.1109/TCBB.2021.3077905 Chu

Huang

Quan

Zhou

Liu

(2022). Hierarchical graph representation learning for the prediction of drug-target binding affinity. Inf. Sci. 613, 507–523. 10.1016/j.ins.2022.09.043 Çınaroğlu

S. S.

Timuçin

(2020). Comprehensive evaluation of the MM-GBSA method on bromodomain-inhibitor sets. Brief. Bioinform. 21, 2112–2125. 10.1093/bib/bbz143 Davis

M. I.

Hunt

J. P.

Herrgard

Ciceri

Wodicka

L. M.

Pallares

(2011). Comprehensive analysis of kinase inhibitor selectivity. Nat. Biotechnol. 29, 1046–1051. 10.1038/nbt.1990 Dehghan

Razzaghi

Abbasi

Gharaghani

(2023). TripletMultiDTI: multimodal representation learning in drug-target interaction prediction with triplet loss function. Expert Syst. Appl. 232, 120754. 10.1016/j.eswa.2023.120754 Fang

Zhang

(2023). ColdDTA: utilizing data augmentation and attention-based feature fusion for drug-target binding affinity prediction. Comput. Biol. Med. 164, 107372. 10.1016/j.compbiomed.2023.107372 Feng

Dueva

Cherkasov

Ester

(2019). PADME: a deep learning-based framework for drug-target interaction prediction. Available at: http://arxiv.org/abs/1807.09741 (Accessed March 16, 2024). Ghimire

Tayara

Xuan

Chong

K. T.

(2022). CSatDTA: prediction of drug–target binding affinity using convolution model with self-attention. Int. J. Mol. Sci. 23, 8453. 10.3390/ijms23158453 Gim

Choe

Baek

Park

Lee

(2023). ArkDTA: attention regularization guided by non-covalent interactions for explainable drug–target binding affinity prediction. Bioinformatics 39, i448–i457. 10.1093/bioinformatics/btad207 Gomes

Ramsundar

Feinberg

E. N.

Pande

V. S.

(2017). Atomic convolutional networks for predicting protein-ligand binding affinity. Available at: http://arxiv.org/abs/1703.10603 (Accessed January 4, 2023). Irwin

J. J.

Shoichet

B. K.

(2006). Zinc – a free database of commercially available compounds for virtual screening. Jiang

Zhang

Wang

Yuan

(2020). Drug–target affinity prediction using graph neural network and contact maps. RSC Adv. 10, 20701–20712. 10.1039/D0RA02297G Jiang

Shao

Zhang

Zhou

Pang

(2023). A deep learning method for drug-target affinity prediction based on sequence interaction information mining. PeerJ 11, e16625. 10.7717/peerj.16625 Jiang

Wang

Zhang

Zhou

Zhang

(2022). Sequence-based drug-target affinity prediction using weighted graph neural networks. BMC Genomics 23, 449. 10.1186/s12864-022-08648-9 Jin

Chen

Pan

Wang

Xie

(2023). CAPLA: improved prediction of protein–ligand binding affinity by a deep learning approach based on a cross-attention mechanism. Bioinformatics 39, btad049. 10.1093/bioinformatics/btad049 Jorgensen

W. L.

Thomas

L. L.

(2008). Perspective on free-energy perturbation calculations for chemical equilibria. J. Chem. Theory Comput. 4, 869–876. 10.1021/ct800011m Jumper

Evans

Pritzel

Green

Figurnov

Ronneberger

(2021). Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. 10.1038/s41586-021-03819-2 Kairys

Baranauskiene

Kazlauskiene

Matulis

Kazlauskas

(2019). Binding affinity in drug design: experimental and computational techniques. Expert Opin. Drug Discov. 14, 755–768. 10.1080/17460441.2019.1623202 Kalemati

Zamani Emani

Koohi

(2023). BiComp-DTA: drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach. PLOS Comput. Biol. 19, e1011036. 10.1371/journal.pcbi.1011036 Karimi

Wang

Shen

(2019). DeepAffinity: interpretable deep learning of compound–protein affinity through unified recurrent and convolutional neural networks. Bioinformatics 35, 3329–3338. 10.1093/bioinformatics/btz111 Karimi

Wang

Shen

(2021). Explainable deep relational networks for predicting compound–protein affinities and contacts. J. Chem. Inf. Model. 61, 46–66. 10.1021/acs.jcim.0c00866 Kim

Lee

Karin

E. L.

Kim

Moriwaki

Ovchinnikov

(2023). Easy and accurate protein structure prediction using ColabFold. Protoc. Exch. 10.21203/rs.3.pex-2490/v1 Kuhn

von Mering

Campillos

Jensen

L. J.

Bork

(2007). STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res. 36, D684–D688. 10.1093/nar/gkm795 Kwon

Shin

W.-H.

Lee

(2020). AK-score: accurate protein-ligand binding affinity prediction using an ensemble of 3D-convolutional neural networks. Int. J. Mol. Sci. 21, 8424. 10.3390/ijms21228424 Landrum

(2013). RDKit: a software suite for cheminformatics, computational chemistry, and predictive modeling. Available at: https://www.rdkit.org/RDKit_Overview.pdf. LeCun

Bengio

Hinton

(2015). Deep learning. Nature 521, 436–444. 10.1038/nature14539 Li

Leung

K.-S.

Wong

M.-H.

Ballester

(2015). Low-quality structural and interaction data improves binding affinity prediction via random forest. Molecules 20, 10947–10962. 10.3390/molecules200610947 Li

Zhang

Wang

(2022a). PLA-MoRe: a protein–ligand binding affinity prediction model via comprehensive molecular representations. J. Chem. Inf. Model. 62, 4380–4390. 10.1021/acs.jcim.2c00960 Li

Zhou

Huang

Wang

Xiong

(2021). “Structure-aware interactive graph neural networks for the prediction of protein-ligand binding affinity,” in Proceedings of the 27th ACM SIGKDD conference on knowledge discovery and data mining (USA: Virtual Event Singapore: ACM), 975–985. 10.1145/3447548.3467311 Li

Zhao

X.-M.

(2022b). Co-VAE: drug-target binding affinity prediction by Co-regularized variational autoencoders. IEEE Trans. Pattern Anal. Mach. Intell. 44, 8861–8873. 10.1109/TPAMI.2021.3120428 Li

Han

Liu

Wang

(2014). Comparative assessment of scoring functions on an updated benchmark: 2. Evaluation methods and general results. J. Chem. Inf. Model. 54, 1717–1736. 10.1021/ci500081m Liao

Chen

Wei

(2022). GSAML-DTA: an interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information. Comput. Biol. Med. 150, 106145. 10.1016/j.compbiomed.2022.106145 Lin

Shi

Chen

(2022a). GeneralizedDTA: combining pre-training and multi-task learning to predict drug-target binding affinity for unknown drug discovery. BMC Bioinforma. 23, 367. 10.1186/s12859-022-04905-6 Lin

Liu

Zhang

(2022b). Detecting drug–target interactions with feature similarity fusion and molecular graphs. Biology 11, 967. 10.3390/biology11070967 Liu

Wang

P.-S.

Zhu

Gaines

B. B.

Zhu

(2021). OctSurf: efficient hierarchical voxel-based molecular surface representation for protein-ligand affinity prediction. J. Mol. Graph. Model. 105, 107865. 10.1016/j.jmgm.2021.107865 Liu

Lin

Wen

Jorissen

R. N.

Gilson

M. K.

(2007). BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 35, D198–D201. 10.1093/nar/gkl999 Lu

Wang

Tan

Pan

(2023). Improving drug-target affinity prediction via feature fusion and knowledge distillation. Brief. Bioinform. 24, bbad145. 10.1093/bib/bbad145 Ma

Chen

(2022). Drug-target binding affinity prediction method based on a deep graph neural network. Math. Biosci. Eng. 20, 269–282. 10.3934/mbe.2023012 Ma

Zhang

Jiang

Wang

Guo

(2023). Predicting drug-target affinity by learning protein knowledge from biological networks. IEEE J. Biomed. Health Inf. 27, 2128–2137. 10.1109/JBHI.2023.3240305 Mekni

Fooladi

Perricone

Langer

(2023). Encoding protein-ligand interactions: binding affinity prediction with multigraph-based modeling and graph convolutional network. Chemistry. 10.26434/chemrxiv-2023-bvps7-v2 Metz

J. T.

Johnson

E. F.

Soni

N. B.

Merta

P. J.

Kifle

Hajduk

P. J.

(2011). Navigating the kinome. Nat. Chem. Biol. 7, 200–202. 10.1038/nchembio.530 Monteiro

N. R. C.

Oliveira

J. L.

Arrais

J. P.

(2022). DTITR: end-to-end drug–target binding affinity prediction with transformers. Comput. Biol. Med. 147, 105772. 10.1016/j.compbiomed.2022.105772 Nguyen

Quinn

T. P.

Venkatesh

(2022). GraphDTA: predicting drug–target binding affinity with graph neural networks. PLoS One 17, e0278387. 10.1371/journal.pone.0278387 Nguyen

Quinn

T. P.

Nguyen

T. D.

Venkatesh

(2021). GraphDTA: predicting drug-target binding affinity with graph neural networks. Bioinforma. Oxf. Engl. 37, 1140–1147. 10.1093/bioinformatics/btaa921 Öztürk

Özgür

Ozkirimli

(2018). DeepDTA: deep drug–target binding affinity prediction. Bioinformatics 34, i821–i829. 10.1093/bioinformatics/bty593 Pan

Xia

(2023). SubMDTA: drug target affinity prediction based on substructure extraction and multi-scale features. BMC Bioinforma. 24, 334. 10.1186/s12859-023-05460-4 Pandey

Radaeva

Mslati

Garland

Fernandez

Ester

(2022). Ligand binding prediction using protein structure graphs and residual graph attention networks. Molecules 27, 5114. 10.3390/molecules27165114 Rafiei

Zeraati

Abbasi

Ghasemi

J. B.

Parsaeian

Masoudi-Nejad

(2023). DeepTraSynergy: drug combinations using multimodal deep learning with transformers. Bioinformatics 39, btad438. 10.1093/bioinformatics/btad438 Remmert

Biegert

Hauser

Söding

(2012). HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods 9, 173–175. 10.1038/nmeth.1818 Rifaioglu

A. S.

Cetin Atalay

Cansen Kahraman

Doğan

Martin

Atalay

(2021). MDeePred: novel multi-channel protein featurization for deep learning-based binding affinity prediction in drug discovery. Bioinformatics 37, 693–704. 10.1093/bioinformatics/btaa858 Rives

Meier

Sercu

Goyal

Lin

Liu

(2021). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl. Acad. Sci. 118, e2016239118. 10.1073/pnas.2016239118 Ru

Zou

Lin

(2023). Optimization of drug–target affinity prediction methods through feature processing schemes. Bioinformatics 39, btad615. 10.1093/bioinformatics/btad615 Rube

H. T.

Rastogi

Feng

Kribelbauer

J. F.

Becerra

(2022). Prediction of protein–ligand binding affinity from sequencing data with interpretable machine learning. Nat. Biotechnol. 40, 1520–1527. 10.1038/s41587-022-01307-0 Seo

Choi

Park

Ahn

(2021). Binding affinity prediction for protein–ligand complex using deep attention mechanism based on intermolecular interactions. BMC Bioinforma. 22, 542. 10.1186/s12859-021-04466-0 Shar

P. A.

Tao

Gao

Huang

Zhang

(2016). Pred-binding: large-scale protein–ligand binding affinity prediction. J. Enzyme Inhib. Med. Chem. 31, 1443–1450. 10.3109/14756366.2016.1144594 Shen

Zhang

Zheng

Wang

Chen

(2021). A cascade graph convolutional network for predicting protein–ligand binding affinity. Int. J. Mol. Sci. 22, 4023. 10.3390/ijms22084023 Shim

Hong

Z.-Y.

Sohn

Hwang

(2021). Prediction of drug–target binding affinity using similarity-based convolutional neural network. Sci. Rep. 11, 4416. 10.1038/s41598-021-83679-y Stepniewska-Dziubinska

M. M.

Zielenkiewicz

Siedlecki

(2018). Development and evaluation of a deep learning model for protein–ligand binding affinity prediction. Bioinformatics 34, 3666–3674. 10.1093/bioinformatics/bty374 Su

Yang

Feng

Liu

(2019). Comparative assessment of scoring functions: the CASF-2016 update. J. Chem. Inf. Model. 59, 895–913. 10.1021/acs.jcim.8b00545

Suzek

B. E.

Wang

Huang

McGarvey

P. B.

C. H.

the UniProt Consortium (2015). UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics 31, 926–932. 10.1093/bioinformatics/btu739 Tang

Szwajda

Shakyawar

Hintsanen

Wennerberg

(2014). Making sense of large-scale kinase inhibitor bioactivity data sets: a comparative and integrative analysis. J. Chem. Inf. Model. 54, 735–743. 10.1021/ci400709d The

U. P. C.

Bateman

Martin

M.-J.

Orchard

Magrane

Agivetova

(2021). UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489. 10.1093/nar/gkaa1100 Tian

Wang

Cui

(2024). GTAMP-DTA: graph transformer combined with attention mechanism for drug-target binding affinity prediction. Comput. Biol. Chem. 108, 107982. 10.1016/j.compbiolchem.2023.107982 Tian

Ding

Yang

Yue

Zhong

(2022). Predicting drug-target affinity based on recurrent neural networks and graph convolutional neural networks. Comb. Chem. High. Throughput Screen. 25, 634–641. 10.2174/1386207324666210215101825 Tsui

L.-I.

Hsu

T.-C.

Lin

(2023). “NG-DTA: drug-target affinity prediction with n-gram molecular graphs,” in 2023 45th annual international conference of the IEEE engineering in medicine and biology society (EMBC) (Sydney, Australia: IEEE), 1–4. 10.1109/EMBC40787.2023.10339968 Vaswani

Shazeer

Parmar

Uszkoreit

Jones

Gomez

A. N.

(2017). Attention is all you need. Available at: http://arxiv.org/abs/1706.03762 (Accessed May 18, 2023). Wang

(2023). Fusion-based deep learning architecture for detecting drug-target binding affinity using target and drug sequence and structure. IEEE J. Biomed. Health Inf. 27, 6112–6120. 10.1109/JBHI.2023.3315073 Wang

Zhou

(2021a). DeepDTAF: a deep learning method to predict protein–ligand binding affinity. Brief. Bioinform. 22, bbab072. 10.1093/bib/bbab072 Wang

Luo

Han

Wang

(2023a). AffinityVAE: a multi-objective model for protein-ligand affinity prediction and drug design. Comput. Biol. Chem. 107, 107971. 10.1016/j.compbiolchem.2023.107971 Wang

Fang

Yang

C.-Y.

Wang

(2005). The PDBbind database: methodologies and updates. J. Med. Chem. 48, 4111–4119. 10.1021/jm048957q Wang

Shan

Zhao

Zuo

(2021b). GanDTI: a multi-task neural network for drug-target interaction prediction. Comput. Biol. Chem. 92, 107476. 10.1016/j.compbiolchem.2021.107476 Wang

Song

Zhang

Liu

Ren

(2023b). MSGNN-DTA: multi-scale topological feature fusion based on graph neural networks for drug–target binding affinity prediction. Int. J. Mol. Sci. 24, 8326. 10.3390/ijms24098326 Wang

Wei

(2022). Sfcnn: a novel scoring function based on 3D convolutional neural network for accurate and stable protein–ligand affinity prediction. BMC Bioinforma. 23, 222. 10.1186/s12859-022-04762-3 Weininger

(1988). SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Model. 28, 31–36. 10.1021/ci00057a005 Wu

Liu

Jiang

Zou

Cui

(2024). AttentionMGT-DTA: a multi-modal drug-target affinity prediction using graph transformer and attention mechanism. Neural Netw. 169, 623–636. 10.1016/j.neunet.2023.11.018 Xia

Pan

Niu

Zhang

(2023). Drug-target binding affinity prediction using message passing neural network and self supervised learning. BMC Genomics 24, 557. 10.1186/s12864-023-09664-z Xu

Leskovec

Jegelka

(2019). HOW POWERFUL ARE GRAPH NEURAL NETWORKS? Int. Conf. Learn. Represent. 10.48550/arXiv.1810.00826 Xu

Song

(2021). Application of machine learning for drug–target interaction prediction. Front. Genet. 12, 680117. 10.3389/fgene.2021.680117 Yang

Yang

Chu

(2023). GraphCL-DTA: a graph contrastive learning with molecular semantics for drug-target binding affinity prediction. Available at: http://arxiv.org/abs/2307.08989 (Accessed January 11, 2024). Yang

Zhong

Zhao

Yu-Chian Chen

(2022). MGraphDTA: deep multiscale graph neural network for explainable drug–target binding affinity prediction. Chem. Sci. 13, 816–833. 10.1039/D1SC05180F Yuan

Huang

(2021). Protein-ligand binding affinity prediction model based on graph attention network. Math. Biosci. Eng. 18, 9148–9162. 10.3934/mbe.2021451 Zaremba

Sutskever

Vinyals

(2015). Recurrent neural network regularization. Available at: http://arxiv.org/abs/1409.2329 (Accessed May 18, 2023). Zeng

Zhong

K.-Y.

Jiang

(2023). Fusing sequence and structural knowledge by heterogeneous models to accurately and interpretively predict drug–target affinity. Molecules 28, 8005. 10.3390/molecules28248005 Zeng

Chen

Luo

Peng

(2021). Deep drug-target binding affinity prediction with multiple attention blocks. Brief. Bioinform. 22, bbab117. 10.1093/bib/bbab117 Zhang

Liao

Saravanan

K. M.

Yin

Wei

(2019). DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity. PeerJ 7, e7362. 10.7717/peerj.7362 Zhang

Ouyang

Liu

Liao

Gao

(2023a). Multimodal contrastive representation learning for drug-target binding affinity prediction. Methods 220, 126–133. 10.1016/j.ymeth.2023.11.005 Zhang

Wang

C.-C.

Chen

(2022). Predicting drug–target binding affinity through molecule representation block based on multi-head attention and skip connection. Brief. Bioinform. 23, bbac468. 10.1093/bib/bbac468 Zhang

Wang

C.-C.

Zhang

Chen

(2023b). GPCNDTA: prediction of drug-target binding affinity through cross-attention networks augmented with graph features and pharmacophores. Comput. Biol. Med. 166, 107512. 10.1016/j.compbiomed.2023.107512 Zhang

Jiang

Wang

Wei

(2021). SAG-DTA: prediction of drug–target affinity using self-attention graph network. Int. J. Mol. Sci. 22, 8993. 10.3390/ijms22168993 Zhang

Jin

Liu

Wang

Zhang

Zhao

(2023c). SS-GNN: a simple-structured graph neural network for affinity prediction. ACS Omega 8, 22496–22507. 10.1021/acsomega.3c00085 Zhang

Gao

Wang

Chen

Zhang

Chen

(2023d). PLANET: a multi-objective graph neural network model for protein–ligand binding affinity prediction. J. Chem. Inf. Model., Acs.jcim., 3c00253. 10.1021/acs.jcim.3c00253 Zhang

Han

Yang

Liu

Cai

(2023e). A survey of drug-target interaction and affinity prediction methods via graph neural networks. Comput. Biol. Med. 163, 107136. 10.1016/j.compbiomed.2023.107136 Zhao

Duan

Yang

Cheng

Wang

(2022). AttentionDTA: drug-target binding affinity prediction by sequence-based deep learning with attention mechanism. IEEE/ACM Trans. Comput. Biol. Bioinform. 20, 852–863. 10.1109/TCBB.2022.3170365 Zheng

Fan

(2019). OnionNet: a multiple-layer intermolecular-contact-based convolutional neural network for protein–ligand binding affinity prediction. ACS Omega 4, 15956–15965. 10.1021/acsomega.9b01997 Zhou

Song

Xiang

(2024). TransVAE-DTA: transformer and variational autoencoder network for drug-target binding affinity prediction. Comput. Methods Programs Biomed. 244, 108003. 10.1016/j.cmpb.2023.108003 Zhu

Zhao

Wen

Wang

(2023a). DataDTA: a multi-feature and dual-interaction aggregation framework for drug–target binding affinity prediction. Bioinformatics 39, btad560. 10.1093/bioinformatics/btad560 Zhu

Yao

Mazur

Yang

Cong

(2023b). Associative learning mechanism for drug‐target interaction prediction. CAAI Trans. Intell. Technol. 8, 1558–1577. 10.1049/cit2.12194 Zhu

Yao

Zheng

Mazur

(2023c). Drug–target affinity prediction method based on multi-scale information interaction and graph optimization. Comput. Biol. Med. 167, 107621. 10.1016/j.compbiomed.2023.107621