Activark is a data-driven, ML-based approach to predict the functional consequence of genetic changes in protein kinases. Activark was trained on a curated dataset of activating (i.e. constitutive-activation or increase in kinase activity), deactivating (i.e. loss or decrease in Kinase activity), and drug-resistance protein variants in human kinases and using sequence and structural features.
Briefly, we applied a random forest algorithm to develop 3 contrasting predictors based on seven types of sequence and structural features:
To access the Activark webservice, go to here To know more about Activark, visit here
If you wish run Activark locally on your system, follow the steps below:
Create the environment (activark) from the environment.yml file
conda env create -f environment.yml
Activate the environment
conda activate activark
Move to the predictor directory (required)
cd ML/
Read the help section
./prepareTestData.py -h
Example of input:
./prepareTestData.py sample_mutations.txt
10-fold stratified CV results of all the 3 predictors
Gurdeep Singh: gurdeep.singh[at]bioquant[dot]uni-heidelberg[dot]de
Torsten Schmenger: torsten.schmenger[at]bioquant[dot]uni-heidelberg[dot]de