biostars / biostar-handbook

Issue tracker for the Biostar Handbook
57 stars 12 forks source link

gunzip hg38.knownGene.gtf.gz not in gzip format #267

Open nush320 opened 1 year ago

nush320 commented 1 year ago

Hi, I am on Chapter VI-Data Sources - Downloading complete genomes. The following command from "Get the GTF file" is not working: gunzip hg38.knownGene.gtf.gz

It says its not in gzip format.

When I check the file type its showing "ASCII text".

The command after that works: cat hg38.knownGene.gtf | wc -l if I replace hg38.knownGene.gtf as hg38.knownGene.gtf.gz but it prints out 3626358 instead of 3091269.

Thanks.

ialbert commented 1 year ago

strangely enough, and for reasons I can't explain the file hg38.knownGene.gtf.gz is not a gzip file, it is already unpacked. I'm quite certain it did not use to be like that and that it used to be a gzipped file ...

I think the server automatically unpacks that file for us upon request.

I would have to investigate what happens there, evidently not the correct behavior there ... oh well ... bioinformatics