Closed nickmalleson closed 2 years ago
I suspect downloading or untarring the massive QUANT file failed partially last time. Per a not very visible part of the docs, you might have to remove the directory and/or tar.gz file to get it to try again.
From the logs, data/raw_data/nationaldata/QUANT_RAMP/ already exists, not untarring data/raw_data/nationaldata/QUANT_RAMP_spc.tar.gz
. For me, ls -l data/raw_data/nationaldata/QUANT_RAMP_spc.tar.gz
is about 2.3GB. Could you check if you have the full file?
I've thought about making the download/extract process smarter and compare expected file sizes. Maybe it's time to do that...
That file is only 10M for me. Almost certaintly my fault that it was only partially downloaded, I think I killed it in a previous attempt. It's re-downloading now. I've mentioned the error explicitly int he docs in https://github.com/alan-turing-institute/uatk-spc/pull/28 (https://github.com/alan-turing-institute/uatk-spc/pull/28/commits/7c5ba7b90965aeefc7b7ad3e198e4e25867d5a6d)
Filed #30 to later explore more automated recovery. The tool could detect partly downloaded files and try again.
I'm running the program on berkshire, but getting this error:
Error: failed to open file
data/raw_data/nationaldata/QUANT_RAMP/retailpointsZones.csv``And indeed that file hasn't been created:
Here's the full output: