Closed yy closed 6 years ago
Thanks – these can be fixed by creating classes for each of these datasets in https://github.com/altair-viz/vega_datasets/blob/master/vega_datasets/core.py with appropriately modified values of _pd_read_kwds
If you're interested in submitting a pull request to fix these, I'd be happy to help you get started.
Fixed in #17 and #18
Expected:
dtype('O')
or ratherCategoricalDtype(categories=['00501', '00544', ...
.Actual:
dtype('int64')
Some ZIP codes starts with "0" and
zipcodes = data.zipcodes()
removes all preceding zeros. The following works, but I think it's better to return with the correct dtypes by default.Also found that
data.unemployment()
cannot correctly parse the data. One should specify the separatordata.unemployment(sep='\t')
.