Closed yjk-bertrand closed 1 year ago
Hello
Yes, the sequence type should be determined automatically judging from the content. Even more, you might have both protein and nucleotide queries in the same file and it should run two processed: blastn
and tblastn
. However, the type detection might be not so perfect.
Will it be possible for you to attach your graph and input fasta file?
Hi Anton, After experimenting with other queries, I am starting to believe that the issue could have something to do with different blast settings between GUI and command line. I have attached the test files. I would also appreciate some advice on the correct string format for the blastp parameter. I am confused about the file name required for the query: I assumed it should be the same as the one inputted for the main program but looking at the string example from the GUI seem to indicate that it should be 'queries.fasta'. Thanks for your help, Yann test_data.zip
Hello, Sorry for insisting. Did you have a change to look at the graph and aa sequence? Thanks,
Sorry, was AFK – will check it out
Ok, so indeed there is the difference in the behavior and it is related to GFA paths.
On CLI we never include GFA paths to the BLAST db
Also, it seems that the filtering of path hits was not implemented, so all paths that you're seeing in the GUI are unfiltered path hits.
I am going to fix the confusion:
So, besides this paths hits were quite buggy. #141 introduced the necessary fixes. Now we correctly report zero hits as we enabled proper filtering :)
Thanks for all your work on BandageNG! Running a blast graph search with a protein query recovers plenty of hits on my graph. Attempting to do the same on the command line ("BandageNG-9eb84c2-x86_64.AppImage querypaths graph.gfa protein.fasta prefix") returns an empty file. My understanding is that the protein alphabet should be automatically recognized, but is it the case? Attempting to force tblastn with the --balstp flag (Bandage querypaths graph.gfa protein.fasta prefix --blastp "tblastn -query protein.fasta -db all_nodes.fasta -outfmt 6 ") did not help and produced an error: Error: Too many positional arguments (1), the offending value: tblastn Error: (CArgException::eSynopsis) Too many positional arguments (1), the offending value: tblastn I must be doing something wrong, but what? \Yann