samyeaman / orthagogue

Automatically exported from code.google.com/p/orthagogue
Other
1 stars 1 forks source link

Orhagogue blastp command #9

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Dear Ole,

Thank you for introducing “OrthAgogue” software for biologist like us. In 
my project, I am trying to identify ortholog sequences in twelve fish species, 
however I have no so much experience in finding ortholog protein sequence for 
multiple species. To our knowledge on this subject, only four software were 
developed for such a purpose (for multiple species); OrthoMCL,Multiparanoid, 
Proteinortho and OrthoAgogue. Due to its efficiency and applicability in large 
dataset, I decided to use OrthoAgogue. In our project, a protein fasta file of 
a fish species (obtained from de novo transcriptome assembly) will be searched 
against to 11 fish protein fast files (obtained from Ensemble non-redundant 
protein fast files) to find ortholog protein sequences.

As you well know, we must do a blast analysis (first I create a blast database 
using makeblastdb option and then, by using blastp option with suitable 
parameters, program start to matches the sequences).

My question are;

1) Which should protein fast file be database ? Can I select randomly ? for 
instance, my fast file.

2) In your manual, there is an command line like; “orthAgogue -i myblast.out 
-e 6 -O myoutdir”. My question is regarding the “my blast.out”;

Do I obtained this “my blast.out” after blastp analyses of each fish 
species (pairwise like search,for example, my fish (anchovy, database) vs 
zebrafish) or obtained via comparing all fishes against the database (for 
example, database (my fishes) vs zebra fish, puffer fish, etc…).

If you help me about this issue and provide a blastp command for multiple 
species, I will greatly appreciate.

Best Regards,

Vahap ELDEM 
Department of Biology, Istanbul University, TURKEY

Original issue reported on code.google.com by eldemva...@gmail.com on 13 Jul 2015 at 2:48