lczech / gappa

A toolkit for analyzing and visualizing phylogenetic (placement) data
GNU General Public License v3.0
56 stars 7 forks source link

Query sequences missing in the per_query.tsv file #25

Closed miferg closed 1 month ago

miferg commented 1 year ago

Hi I used epa-ng to place some read sequences to a referece tree and then used gappa examine assign with the --per-query-results flag to get the per_query.tsv taxonomic classification table. When checking the results, I found that some sequences are missing from the table but can be found in the jplace file produced by epa-ng. I would like to know if this is expected and/or what could be causing this. Thanks in advance. Miguel

lczech commented 1 year ago

Hi @miferg,

it could be that those are filtered out for some reason? Unfortunately, I did not write that particular command in gappa - maybe @pierrebarbera is willing to jump in?

Either way, could you please provide the exact command line that causes the issue, and, if possible provide a minimal working example of a jplace file where this error occurs? Otherwise it's hard to track this down.

Thanks and so long Lucas

miferg commented 1 year ago

Hi Lucas,

This is the command line: gappa examine assign --jplace-path epang_out/epa_result.jplace --taxon-file deftree.taxstrings.tsv --out-dir tax-classification --per-query-results --allow-file-overwriting

I'm working on the minimal example.

Thanks Miguel

lczech commented 10 months ago

Hi @miferg, any update on this?

lczech commented 1 month ago

Hi @miferg,

I'll close the issue for now, but if you have an update, feel free to re-open or start a new one :-)

Cheers Lucas