OCHA-DAP / Data-Team

A place for tracking data team issues
0 stars 1 forks source link

A error-filled SW file, please #18

Closed cjhendrix closed 10 years ago

cjhendrix commented 10 years ago

We discussed this a while back, but it would be good for us to have a ScraperWiki CSV zip file filled with the kinds of errors we would like to be able to catch during validation.

I'm not sure what the best way is to build something like that, but for sure we would want a checklist of errors that we should catch so we know when we are successfully catching them all.

rosnfeld commented 10 years ago

I think this could be built manually, perhaps by modifying an existing SW file.

A few ideas to start with:

I realize only the last example is something we talked about explicitly during our validation talk a month ago, as that's more the flavor of things I had been looking at.

cjhendrix commented 10 years ago

Yes, that's the sort of stuff I was thinking of. I think at the moment that we would do pretty well against this list except for the 3rd item.

rosnfeld commented 10 years ago

(The 3rd item unfortunately continues to happen - the most recent SW data seems to have only a small portion of regions covered by World Bank data)

cjhendrix commented 10 years ago

Yes, but at least in that case, the server is alerting us that something went wrong with the scraper. @JavierTeran , I wonder if we shouldn't ask Dragon about that one. It seems to have persisted for several days.

JavierTeran commented 10 years ago

Sure, I am writing to him now. Thanks

JavierTeran commented 10 years ago

And for the dummy file, sometime ago I started preparing one, we talked about it. I will have it ready by the end of next week.