ScilifelabDataCentre / genome-portal

This is the repository for the Swedish Reference Genome Portal, a service facilitating access and discovery of genome data of non-model eukaryotic species studied in Sweden
https://genomes.scilifelab.se/
MIT License
0 stars 0 forks source link

Support zipped GFF files #28

Closed kwentine closed 3 months ago

kwentine commented 3 months ago

This PR adds support for remotely hosted GFF files with the .zip extension (an example of which can be found in Linum tenue)

The solution is ad-hoc: to create genes.gff.bgz look at any file in DOWNLOAD_TARGETS that matches genes.gff; that can turn out to be genes.gff.zip or genes.gff.gz.

It works partly because zcat handles both .gz and .zip identically.

I have some ideas to push this work further, so that we can support any compression format handled by zcat for all remote files (not only GFF). But that will be for another PR.

Note: this PR will be merged in main after #27 . The target is currently set to #27 to make the diff more apparent.