jmbejara / comp-econ-sp18

Main Course Repository for Computational Methods in Economics (Econ 21410, Spring 2018)
16 stars 23 forks source link

Which samples to select for the IPUMS data? #31

Closed vuphan95 closed 6 years ago

vuphan95 commented 6 years ago

I know which variables to select but forgot which samples to select. Right now I have selected only 18 ASEC samples for the years 2000-2017 (I deselect all the monthly samples). This in total is about 130 MB which seems wrong because the sample output of df.info() seems to suggest that the correct data has to be about 207 MB. Please remind me about which samples to select. Thank you so much!

jmbejara commented 6 years ago

Yep. Those look like the right samples! DataFrame.info() gives the memory usage. This will often be different from the total storage size of the file. It could often be larger.