socialfoundations / folktables

Datasets derived from US census data
MIT License
240 stars 19 forks source link

Preprocess census #23

Closed bargavj closed 2 years ago

bargavj commented 2 years ago

Added comments.

mrtzh commented 2 years ago

Hi! I appreciate the use of folktables and the work you put into this.

Is there anything specific to this preprocessing that you think it should part of the main codebase?

From reading it, it looks to me like this is one reasonable way to preprocess the data. There are many others that could be suitable depending on application. The main code base is not the right place to collect these. My sense is that this code is best off in the repository for whatever project you're using it for.

If you're looking for a dataset that's very similar to UCI Adult, you may want to consider adult_reconstruction.csv.