Rensing-Lab / Genome-Zoo

Other
0 stars 0 forks source link

Minor FASTA Fixes #5

Open shiltemann opened 5 months ago

shiltemann commented 5 months ago

@Deeptivarshney

Ok, looks like there are only a few remaining minor inconsistencies, perhaps this is intentional/expected/ok, in which case please ignore and close this issue again, but otherwise maybe something to look into:

FASTA files without standard header naming scheme (not beginning with LETTERCODE_)

For optimal functionality from TAPscan website point of view, naming scheme should be

>LETTERCODE_organelle_proteinID

Where the proteinID (or gene ID) is used to search on PLAZA, and the organelle suffix is optional.

It's not a priority, but in theory would be nice to have everything standardized this way

Deeptivarshney commented 5 months ago

Hi @shiltemann, these are the FASTA files for brown algae that Romy added to the repo. Recently, I added a 5-letter code to the FASTA headers in a commit change on my branch and submitted a pull request (though we are awaiting confirmation from Romy). You can merge it to the main and hopefully everything will be fine then.

shiltemann commented 5 months ago

Aha! then ignore everything I said ;p