š„° This repository contains the HeteroDTA deep learning model, which is designed for predicting drug-target affinity in drug discovery tasks. The model is implemented in Python and utilizes various deep learning techniques for accurate affinity predictions.
:construction: Note: There are some dependencies that are not listed, please install them independently according to the feedback from the console
All publicly accessible datasets used can be accessed here:
Dataset Name | Link |
---|---|
Davis, KIBA | https://github.com/hkmztrk/DeepDTA/tree/master/data |
Human and C.elegans | https://github.com/masashitsubaki/CPI_prediction |
:construction: Note: It is necessary to download the corresponding data set and then place it in the corresponding directory named "data" on your own machine for subsequent data preprocessing
All publicly accessible models used can be accessed here:
Model Name | Link |
---|---|
GEM | https://github.com/PaddlePaddle/PaddleHelix |
ESM | https://github.com/facebookresearch/esm |
:construction: Note: It is necessary to read the corresponding GitHub guidelines and use pre-trained models on your own machine to preprocess your chosen datasets or your own private datasets
:construction: Notes:
- Before training or inference, the datasets used must be preprocessed
- Please refer to the individual script files for more detailed instructions and options.
Clone the repository
git clone https://github.com/daydayupzzl/HeteroDTA.git
cd your-repository
Install the required dependencies
pip install SomePackage # Please follow comsole's feedback and install the missing packages until there is no lack of dependencies
Make sure you have the necessary Python libraries installed, including PyTorch.
Open a terminal (Command Prompt on Windows, or Terminal on Mac/Linux).
Navigate to the directory containing the file.
Run the following command:
python training.py <dataset_index> <cuda_index> <dataset_type_index>
Replace
For example:
python your_script_name.py 0 0 0 v1
This will run the script, using the "davis" dataset, utilizing the first GPU, and employing the original dataset type.
Run the following command:
python inference.py <dataset_index> <cuda_index> <dataset_type_index>
Replace
For example:
python inference.py 0 0 0
This will execute the script, using the "davis" dataset, utilizing the first GPU, and employing the original dataset type.
If you use this code or related methods in your research, please consider citing HeteroDTA paper (We will add the literature address in the future)
@article {,
author = {Zhang, Zuolong and He, Xin and Long, Dazhi and Luo, Gang and Chen, Shengbo},
title = {Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models},
year={2024},
publisher = {Oxford University Press (OUP)},
journal = {Bioinformatics}
}