manisa / ClassifyTE

MIT License
12 stars 2 forks source link

ClassifyTE: Stacking-based Machine Learning Framework for Hierarchical Classification of transposable

Table of Content

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Prerequisites

You would need to install the following software before replicating this framework in your local or server machine.

Java JDK
Python version 3.5+
Aanaconda version 3+

Download and install code

Download Models

ClassifyTE/
    models/
        ClassifyTE_combined.pkl
        ClassifyTE_repbase.pkl
        ClassifyTE_pgsb.pkl

Demo

To run the program on test TE sequence:

ClassifyTE/
    data/
        demo.fasta

Parameters

For generate_feature_file.py, the user has to provide two parameters:

python evaluate.py -f demo_features.csv -n node.txt -d demo_features -m ClassifyTE_combined.pkl -a lcpnb

Parameters

For evaluate.py, the user has to provide following parameters:

Deployment

To run the program on new TE sequence:

ClassifyTE/
    data/
        [your_fasta_file]

Parameters

For generate_feature_file.py, the user has to provide two parameters:

python evaluate.py -f your_feature_file_name -d your_feature_directory -n node_file -m model_name

Parameters

For evaluate.py, the user has to provide following parameters:

Download datasets

ClassifyTE/
    data/
        pgsb_feature_file.csv
        repbase_feature_file.csv
        combined.csv

Training

Parameters

For train.py, the user has to provide following parameters:

nodes

Authors

Manisha Panta, Avdesh Mishra, Md Tamjidul Hoque, Joel Atallah

License

This project is licensed under the MIT License - see the LICENSE.md file for details

References

[1] Nakano, F.K., et al. Top-down Strategies for Hierarchical Classification of Tranposable Elements with Neural Networks. In, IEEE. 2017.