mdozmorov / genome_runner

Academic Free License v3.0
0 stars 3 forks source link

dbcreator_encode: double downloading of DNase imputed gapped/narrow #87

Closed mdozmorov closed 9 years ago

mdozmorov commented 9 years ago

Some files downloaded twice, e.g., 2015-07-22 12:18:21,142 INFO Downloading E001-DNase.imputed.gappedPeak.bed 2015-07-22 12:18:22,705 INFO Converting into proper bed format: E001-DNase.imputed.gappedPeak.bed 2015-07-22 12:18:23,595 INFO Downloading E001-DNase.imputed.gappedPeak.bed.gPk.gz 2015-07-22 12:18:25,053 INFO already exists, skipping extraction

As a result, two files exist in the 'downloads' folder: E001-DNase.imputed.gappedPeak.bed.gPk.gz - apparently, zipped from the first download E001-DNase.imputed.gappedPeak.bed - the same file, unzipped. Apparently, after skipping extraction.

This happen only with files like: E129-DNase.imputed.gappedPeak.bed E129-DNase.imputed.narrowPeak.bed

There are 253 such orphan files - why are they double downloaded?