bacpop / PopPUNK

PopPUNK 👨‍🎤 (POPulation Partitioning Using Nucleotide Kmers)
https://www.bacpop.org/poppunk
Apache License 2.0
86 stars 17 forks source link

[database] A double-checked database of Neisseria meningitidis #267

Closed tanzhizhou closed 4 months ago

tanzhizhou commented 1 year ago

Hi johnlees,

Please specify:

If the latter:

I know that there is a database of Neisseria meningitidis already in your website, however, I found that the genome quality of Neisseria meningitidis in PubMLST is not good enough, and some isolates were even not belong to the species of Neisseria meningitidis.

For that reason, in this version of database, all the genomes of isolates were validated by species identification function (taxonomy_wf) in Genome Database Taxonomy toolkit (GTDB-Tk). Quality of the genomes were assessed using CheckM software. Of the isolates, only those with a completeness rate of ≥90% and a contamination rate of ≤5% were considered suitable for Poppunk model fitting and clustering.

Could I submit this database to your website?

Zhizhou Tan

Chinese CDC

johnlees commented 1 year ago

Thanks for this contribution, we would be very happy to make this the updated version on the website as it sounds like you've done a much more thorough QC of the data. I might suggest that we use your sketches with the existing model fit, to maintain as much backwards compatibility as possible.

It may be convenient to upload your data here: https://imperialcollegelondon.app.box.com/f/3c4781a2f5ad4215b12a62099ed5cc63 Then I can run through with the previous model and update the website.

Please also let me know if you have any preferred citation or attribtion for this that you'd like me to include on the page.

tanzhizhou commented 9 months ago

Thanks for this contribution, we would be very happy to make this the updated version on the website as it sounds like you've done a much more thorough QC of the data. I might suggest that we use your sketches with the existing model fit, to maintain as much backwards compatibility as possible.

It may be convenient to upload your data here: https://imperialcollegelondon.app.box.com/f/3c4781a2f5ad4215b12a62099ed5cc63 Then I can run through with the previous model and update the website.

Please also let me know if you have any preferred citation or attribtion for this that you'd like me to include on the page.

Hi John,

Many thanks for you detailed suggestion. I am really sorry for the late reply because our project have been delay for several months. Could I still upload the our database to your website?

https://imperialcollegelondon.app.box.com/f/3c4781a2f5ad4215b12a62099ed5cc63

The Neisseria_meningitidis database is 24GB in size, I think I can complete the submission this week

tanzhizhou commented 9 months ago

Hi John,

I have complete the submission of Neisseria meningitidis database, named as Neisseria_meningitidis_26801_refs.tar.bz2 to your link https://imperialcollegelondon.app.box.com/f/3c4781a2f5ad4215b12a62099ed5cc63

What should I do next?

All the best, Zhizhou China CDC

johnlees commented 9 months ago

Apologies for the slow reply – I am currently away and will respond in due course

tanzhizhou commented 9 months ago

Hi John, thanks for your email. No problem, looking forward to your reply.

All the best, Zhizhou

John Lees @.***> 于2023年9月26日周二 21:50写道:

Apologies for the slow reply – I am currently away and will respond in due course

— Reply to this email directly, view it on GitHub https://github.com/bacpop/PopPUNK/issues/267#issuecomment-1735584309, or unsubscribe https://github.com/notifications/unsubscribe-auth/ANMVSKQXX3NJK64WBUBDP63X4LMSZANCNFSM6AAAAAAZAABJCM . You are receiving this because you authored the thread.Message ID: @.***>

johnlees commented 4 months ago

Zhizhou,

Please accept my sincere apologies for the delay, I have to admit I forgot about your submission! I have now added your database contribution to our ftp and website (will appear ~tomorrow). Thank you very much for your contribution, which is greatly appreciated.

tanzhizhou commented 4 months ago

Hi John,

Many thanks for your email and help!

It is great news that this database is available on your website. The paper that describes this database is still under review. We will still need your help to update the information of this database on the website once the paper has been published.

Thanks again for your help!

All the best, Zhizhou

John Lees @.***> 于2024年2月27日周二 01:13写道:

Closed #267 https://github.com/bacpop/PopPUNK/issues/267 as completed.

— Reply to this email directly, view it on GitHub https://github.com/bacpop/PopPUNK/issues/267#event-11924661760, or unsubscribe https://github.com/notifications/unsubscribe-auth/ANMVSKXZG6GFWUZ25R77TZTYVS7EHAVCNFSM6AAAAAAZAABJCOVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJRHEZDINRWGE3TMMA . You are receiving this because you authored the thread.Message ID: @.***>

johnlees commented 4 months ago

Great, do let us know when your paper has been published and we'll be happy to update the reference