nhoffman / ya16sdb

A curated subset of 16S rRNA sequences from NCBI
2 stars 3 forks source link

Use the labmed classifier to illustrate species uncertainty in Dash app #49

Closed crosenth closed 2 years ago

crosenth commented 2 years ago

https://github.com/crosenth/moose/compare/0.8..5a854cf472b

% time classify -vv --out vsearch_old.csv --lineages taxonomy.csv --seq-info seq_info.csv vsearch.tsv
INFO classifier loading alignments output/20211207/dedup/1200bp/named/vsearch.tsv
INFO classifier reading output/20211207/dedup/1200bp/named/filtered/trusted/types/seq_info.csv
INFO classifier joining
INFO classifier reading output/20211207/dedup/1200bp/named/filtered/trusted/types/taxonomy.csv
INFO classifier joining with alignments
INFO classifier selecting best alignments for classification
INFO classifier 1464015 alignments selected for assignment
INFO classifier condensing group tax_ids to size 3
INFO classifier creating compound assignments
INFO classifier summarizing output
classify -vv --out vsearch_old.csv --lineages  --seq-info    7645.31s user 99.61s system 100% cpu 2:09:00.00 total
% time classify -vv --out vsearch_new.csv --lineages taxonomy.csv --seq-info seq_info.csv vsearch.tsv
INFO classifier loading alignments output/20211207/dedup/1200bp/named/vsearch.tsv
INFO classifier reading output/20211207/dedup/1200bp/named/filtered/trusted/types/seq_info.csv
INFO classifier joining
INFO classifier reading output/20211207/dedup/1200bp/named/filtered/trusted/types/taxonomy.csv
INFO classifier joining with alignments
INFO classifier selecting best alignments for classification
INFO classifier 1464015 alignments selected for assignment
INFO classifier condensing group tax_ids to size 3
INFO classifier creating compound assignments
INFO classifier summarizing output
classify -vv --out vsearch_new.csv --lineages  --seq-info    2771.73s user 62.48s system 100% cpu 47:10.64 total
% md5sum vsearch_new.csv vsearch_old.csv   
0754e5ec98ea3a5399bce7d07c5be74f  vsearch_new.csv
0754e5ec98ea3a5399bce7d07c5be74f  vsearch_old.csv
crosenth commented 2 years ago

Done