Closed Irio closed 7 years ago
When we first generated it, the companies.xz
file already had geolocation (using src/geocode_addresses.py
). I'm good with option number 1 if we work on number 2 later. @cuducos
I'm good with option number 1 if we work on number 2 later
I agree with this approach ;)
For some reason it has less 7% companies than the last one (no idea why)
I think I ran it using only the reimbursements dataset. Another reason could be that the last script were filling lines with blank info only with the message error for "CNPJ inválido".
Renaming it, opening an issue to add geolocation… and closing this issue! Hell yeah ; ) Thank you so much @marcusrehm 🎉
Closed by #218
I think this is mostly related to the partner list. I'm pondering on two issues about this dataset before bringin it to S3:
So before making it available I would like to know about best practices in versioning (arguably similar) datasets:
companies-no-geolocation
?What do you think @Irio?