fmalmeida / bacannot

Generic but comprehensive pipeline for prokaryotic genome annotation and interrogation with interactive reports and shiny app.
https://bacannot.readthedocs.io/en/latest/
GNU General Public License v3.0
96 stars 9 forks source link

Add bakta #16

Closed fmalmeida closed 1 year ago

fmalmeida commented 3 years ago

Study the best way to implement Bakta in the pipeline.

It will be nice to provide the users with the option to choose the base annotation with Prokka or Bakta, depending on their needs.

Check if it will be possible to add it.

fmalmeida commented 2 years ago

Bakta outputs are extremely similar to Prokka, however, their annotation is more reliable. Therefore, the addition seems to be very straightforward:

One thing to think is:

Recapitulating:

To add bakta it would be necessary to:

fmalmeida commented 2 years ago

Now that pipeline has been restructured, this issue can become a reality.

Since bakta database is huge, instead of downloading and formatting with the pipeline users will have to download themselves as each system or institute will have a way to handle such massive download.

Thus, if users want to annotate and trigger bakta, they will have to simply:

  1. Download the database
  2. Set path to bakta database with --bakta_db

When using this parameter, the pipeline should automatically trigger bakta instead of prokka.

fmalmeida commented 2 years ago

Finally, after very much time, workflow is now properly running from top to bottom when using bakta. For release, it is now required to:

fmalmeida commented 1 year ago

Almost ready.

fmalmeida commented 1 year ago

try to roll it up in the next 3 days

fmalmeida commented 1 year ago

Something is wrong with bakta docker image. When running it, it is complaining about diamond. With some -9 exit code.

fmalmeida commented 1 year ago

Execution tests were finished. Now building new docker images, to check whether scripts and reports are properly updated so release can be made.

fmalmeida commented 1 year ago

Finally done 🥳