Arvados sequence uploader and analyzer for MRSA project
To get started, you need to install the uploader first and then run the main.py script in uploader directory.
git clone https://github.com/bio-ontology-research-group/mrsa-sequences.git
pycurl
and pyopenssl
. On Ubuntu 18.04, you can run:sudo apt update
sudo apt install -y virtualenv git libcurl4-openssl-dev build-essential python3-dev libssl-dev libxml2 libxslt1-dev
virtualenv --python python3 venv
. venv/bin/activate
Note that you will need to repeat the . venv/bin/activate
step from this directory to enter your virtualenv whenever you want to use the installed tool.
pip install -r requirements.txt
python uploader/main.py --help
export ARVADOS_API_TOKEN=2jv9346o396exampledonotuseexampledonotuseexes7j1ld
You can find the arvados token at current token link in your user profile menu on arvados web portal.
Run the uploader with a FASTA or FASTQ reads gzipped files and accompanying metadata file in YAML:
python uploader/main.py reads1.fastq.gz reads2.fastq.gz metadata.yaml
You can find the example files on mrsa web uploader. Here are the links to example files:
Once the sequence is uploaded, you can see the status of the job in state.json file.