everypolitician-scrapers / spain_congreso_es

Details of members of the Spanish Congress from the official website congreso.es
https://morph.io/everypolitician-scrapers/spain_congreso_es
1 stars 2 forks source link

Mixed data? #5

Open tmtmtmtm opened 8 years ago

tmtmtmtm commented 8 years ago

Does the output here include data from an earlier run, before some of the scraper changes?

e.g. some people have names in different formats from others; faction information seems inconsistent; about a third of the entries don't have an iddiputado set, etc.

Not sure if this is just because the database needs to be emptied before a clean run, or whether there's a deeper problem.

struan commented 8 years ago

I am reasonably sure I emptied the database before I committed the most recent version.

However, yes, it does look like there is data from a previous version in there :(

I will delete the data and see if that improves matters.