hallamlab / MetaPathways

A modular pipeline for constructing Pathway/Genome Databases from environmental sequence information
http://hallam.microbiology.ubc.ca/MetaPathways
12 stars 7 forks source link

Which files from the KEGG database are needed? #34

Closed nielshanson closed 10 years ago

nielshanson commented 10 years ago

For KEGG you should put in the raw fasta file to create the database. I know they have a reduced set of proteins as well as a full set. There is a kegg.reduced.fasta.tar.bz2 or fasta.tar.bz2. Both of these work with the pipeline, they just have to be unzipped and placed in the blastDB folder and put in the parameter file (e.g., template_param.txt) before running.

Hope this helps.

Niels