Closed pbraileyjones closed 10 months ago
Hi @pbraileyjones ,
The formatting script splits the header at |
, and takes the 5th element for taxonomy. You should just need to add two extra |
, like in this header:
>Entoloma_flavidum|JQ281481|SH1510137.08FU|refs|k__Fungi;p__Basidiomycota;c__Agaricomycetes;o__Agaricales;f__Entolomataceae;g__Entoloma;s__Entoloma_flavidum
For example:
>Candida_glabrata_Asco|AY198398.1|XXX|reps|k__Fungi;p__Ascomycota;c__Saccharomycetes;o__Saccharomycetales;f__Saccharomycetaceae;g__Candida;s__glabrata
I hope that works,
Julian
Thanks! That was it!
Best, Phil
Hi- I'm trying to classify my sequences using a custom LSU database but I am running into some errors that I'm not sure how to solve.
I've ostensibly formatted my database headers in the UNITE format eg.:
This is the code I am running:
And this is the output I am getting
Do you have any idea why I might be coming up with this error?
Thanks! Phil