Closed rujinlong closed 2 years ago
Dear @rujinlong, Thanks for using the pipeline and reporting this issue in such an informative manner 😄 That is a nice spotting!
Before I go into coding, I think it is nice to first brainstorm the best solution. Do you have any idea of nice and clean approach to do in such cases?
Maybe I can just change the way I check for gbk files with CDS sequences. Maybe with grep -q "CDS"
plus checking for this Bakta comment line?
My issue on this one is assuming that the amount of whitespaces will never vary.
Other possibility may be to make it run, but add something in the comment that triggers islandpath to ignore it's error.
My issue on this one is also ignoring
TRUE
errors and missing relevant logs.
I was able to think in something like this:
( sed '/CDS.*::.*0/d' test.gbk | grep -q CDS ) && echo yes || echo no
I will commit it to the branch and invite you to test it.
I've just commited to the new branch for this hotfix. Can you give it a try by appending:
-r 62-islandpath-failed-when-there-is-no-cds-in-bakta-annotated-gbk-file -latest
To your command line?
I've just commited to the new branch for this hotfix. Can you give it a try by appending:
-r 62-islandpath-failed-when-there-is-no-cds-in-bakta-annotated-gbk-file -latest
To your command line?
Great. This works 👍
Okidokie. I’ll make a hotfix release out of it 😄
https://github.com/fmalmeida/bacannot/blob/7fb675b1688d3d2af85f7cf206ce0fc6a1e82858/modules/MGEs/islandpath.nf#L22
Line 22 of
islandpath.nf
detect if there is CDS in gbk file. However, when annotation using Bakta, gbk file will always have a line with "CDS" in the COMMENT section, as show in line 21 below,This will make ISLANDPATH fail when there is no true CDS in the sequence.