As a PM I want to fix the scripts for core datasets that have bugs which have proven more troublesome to fix
Analysis
cofog
Source for process.py script on line 20 access_db_zip_url = 'http://unstats.un.org/unsd/cr/registry/regdntransfer.asp?f=186' is broken. Moving it to broken sources
corruption-perceptions-index
Loads of problems. Libraries need to be installed, sometimes source is downloaded but it is not unzipped, some sources unavailable, it's a mess. I suggest complete refactoring of this script
gdp-us
A slight change in url for sources was the problem, easy fix. It might be useful refactoring script to use python 3
geo-admin1-us
This uses tuttle file so further analysis is needed
geo-countries
This uses tuttle file so further analysis is needed
geo-ne-admin1
This uses tuttle file so further analysis is needed
house-prices-global
Format of the source csv file was changed so the script was slightly modified in order to work
house-prices-uk
Script is workin properly, the problem was in instructions for running the script in README which is now fixed
imf-weo
Unable to make script work, some problem with locales
interest-rates-gb
Script works properly but the source is outdated and the script should be refactored
language-codes
Unable to make PHP work, further analysis needed
pharmaceutical-drug-spending
Fixed importing libraries in population.py
s-and-p-500
In progress as a separate issue
s-and-p-500-companies
In progress as a separate issue
un-locode
Unable to make PHP work, further analysis needed
world-cities
This uses tuttle file so further analysis is needed
Acceptance criteria
[ ] The scripts with complex bugs are working properly. Those that require further analysis are labeled as such.
Tasks
[x] Determine and fix the bugs (if possible) or move repo to the [broken source issue(https://github.com/datahq/pm/issues/291) for the following list of datasets
Some datasets are fixed, but there are a few complex ones that use PHP and tuttle that will require further analysis and are a part of a separate issue
https://github.com/datahq/pm/issues/292
As a PM I want to fix the scripts for core datasets that have bugs which have proven more troublesome to fix
Analysis
Source for
process.py
script on line 20access_db_zip_url = 'http://unstats.un.org/unsd/cr/registry/regdntransfer.asp?f=186'
is broken. Moving it to broken sourcesLoads of problems. Libraries need to be installed, sometimes source is downloaded but it is not unzipped, some sources unavailable, it's a mess. I suggest complete refactoring of this script
gdp-us A slight change in url for sources was the problem, easy fix. It might be useful refactoring script to use python 3
geo-admin1-us This uses tuttle file so further analysis is needed
geo-countries This uses tuttle file so further analysis is needed
geo-ne-admin1 This uses tuttle file so further analysis is needed
house-prices-global Format of the source csv file was changed so the script was slightly modified in order to work
house-prices-uk Script is workin properly, the problem was in instructions for running the script in README which is now fixed
imf-weo Unable to make script work, some problem with locales
interest-rates-gb Script works properly but the source is outdated and the script should be refactored
language-codes Unable to make PHP work, further analysis needed
pharmaceutical-drug-spending Fixed importing libraries in
population.py
s-and-p-500 In progress as a separate issue
s-and-p-500-companies In progress as a separate issue
un-locode Unable to make PHP work, further analysis needed
world-cities This uses tuttle file so further analysis is needed
Acceptance criteria
Tasks