ctb / 2022-sourmash-sens-spec

Playing around with sens/spec measurements for simulated stuff
BSD 3-Clause "New" or "Revised" License
3 stars 0 forks source link

exact k-mer results #6

Open ctb opened 1 year ago

ctb commented 1 year ago

For s__Bosea sp902506165 genomes

% ../gemella/compare-scaled-1.py *.fna.gz *.fna.gz
Removing query 'GCF_902498905.1_BOS5A_assembly_genomic.fna.gz' from against list.
WARNING: scaled value should be >= 100. Continuing anyway.
consuming 'GCF_902498905.1_BOS5A_assembly_genomic.fna.gz'
6317621
consuming 'GCF_902506135.1_BOSE125_assembly_genomic.fna.gz'
6286192 35128
consuming 'GCF_902506125.1_BOSE62_assembly_genomic.fna.gz'
6316542 706
consuming 'GCF_904067565.1_ASM90406756v1_genomic.fna.gz'
6316930 502
consuming 'GCF_902506185.1_BOSE127_assembly_genomic.fna.gz'
6316932 442
consuming 'GCF_902506165.1_BOSE29B_assembly_genomic.fna.gz'
6316207 402
consuming 'GCF_904067575.1_ASM90406757v1_genomic.fna.gz'
6316918 372
consuming 'GCF_904067625.1_ASM90406762v1_genomic.fna.gz'
6316220 286
286

meaning there's fewer than 286 k-mers that are unique to GCF_902498905.1_BOS5A_assembly_genomic.fna.gz within this GTDB species.

ctb commented 1 year ago

results for s__Gemella sp004570485

% ../../scripts/compare-scaled-1.py *.fna.gz                                  
WARNING: scaled value should be >= 100. Continuing anyway.
consuming 'GCF_004570485.1_ASM457048v1_genomic.fna.gz'
1493803
consuming 'GCF_015234945.1_ASM1523494v1_genomic.fna.gz'
1493675 128
consuming 'GCF_013415365.1_ASM1341536v1_genomic.fna.gz'
1290174 128
consuming 'GCF_015234655.1_ASM1523465v1_genomic.fna.gz'
1290174 128
128
ctb commented 1 year ago

results for g__Murdochiella

../../scripts/compare-scaled-1.py *.fna.gz
WARNING: scaled value should be >= 100. Continuing anyway.
consuming 'GCF_900059565.1_PRJEB12696_genomic.fna.gz'
1635530
consuming 'GCF_902375925.1_MGYG-HGUT-01541_genomic.fna.gz'
1635530 0
consuming 'GCF_900119705.1_PRJEB14245_genomic.fna.gz'
1658527 0
consuming 'GCF_902381775.1_UHGG_MGYG-HGUT-01563_genomic.fna.gz'
1658527 0
0