klebgenomics / Kaptive

GNU General Public License v3.0
44 stars 21 forks source link

Creating Locus References Database #25

Closed ne1al closed 6 months ago

ne1al commented 2 years ago

Hi, I am new to Kaptive. I wonder if there is an argument to help me create locus references database through Kaptive or if that you can add this input file into the Kaptive repo for the users to download. Thank you

katholt commented 2 years ago

Hi there, do you mean you want to create a Kaptive database of loci from a new organism (ie other than Klebsiella pneumoniae or Acineobacter baumanii)?

This takes a fair bit of work, including manual curation, and is not something can be done within Kaptive.

Rafal Mastowy has shared code for automating the process of extracting cps & kps loci from genomes, which could be used as a starting point to generating a new database for use with Kaptive: https://github.com/rmostowy/fastKaptive

Once created and curated, the new database can be passed to Kaptive using -k or --k_refs. See details in the wiki for format requirements for K reference databases: https://github.com/katholt/Kaptive/wiki/How-to-run#locus-references

If you do create a new database, please consider sharing it in a GitHub repository for others to use, and we can link to it from the Kaptive repo (and, following testing, could also include it in Kaptive-Web as we have done with the third-party V. parahaemolyticus database).

ne1al commented 2 years ago

I am actually working on Klebsiella pneumoniae, Acinetobacter baumanii, and Pseudomonas aeruginosa. I will let you know if I manage to create one for Pseudomonas aeruginosa. Thank you.

tomdstanton commented 6 months ago

Closing this issue as it's a question relating to database curation which is outlined in the docs.