CDCgov / phoenix

🔥🐦🔥PHoeNIx: A short-read pipeline for healthcare-associated and antimicrobial resistant pathogens
Apache License 2.0
50 stars 17 forks source link

NCBI_Assembly_stats_20230504.txt clarification #119

Closed cimendes closed 11 months ago

cimendes commented 11 months ago

Greetings!!

I've been researching how the median and std are calculated by phoenix when outputting the assembly ratio assessment. I came across the NCBI_Assembly_stats_20230504.txt that is used by https://github.com/CDCgov/phoenix/blob/main/bin/calculate_assembly_ratio.sh but I'm having a hard time understanding what the columns of this file mean, and how it was obtained.

Could you please clarify?

Thank you! Inês

jvhagey commented 11 months ago

@cimendes please see wiki documentation. I just added information on the headers. Are you just curious or is there an issue related to this?

cimendes commented 11 months ago

Just curious! Is it possible to share the script used to compute those statistics? :)

jvhagey commented 11 months ago

Unfortunately, we are not able to provided additional scripts. Thanks for the question.