larsvilhuber / MobZ

https://larsvilhuber.github.io/MobZ/
3 stars 0 forks source link

Question re CBP data #30

Closed larsvilhuber closed 4 years ago

larsvilhuber commented 4 years ago

https://github.com/larsvilhuber/MobZ/blob/e64870c460238bc35fc43ff83a9be1c7fb76cf60/programs/07_adh/00_03_cbp_readin.do#L28

This just renames emp into manuf_emp (manufacturing)?

But the incoming data (if the same) is for all industries. Shouldn't there be a subset here somewhere?

andrewfoote commented 4 years ago

@larsvilhuber The data sets only include the manufacturing data. Somewhere in the past, I apparently extracted it so that I only took the observations where NAICS = "31----" in the raw data. Despite looking for 30 minutes, I have no idea where or how I did that.

larsvilhuber commented 4 years ago

@andrewfoote OK, I will implement that in the subsetting code.

larsvilhuber commented 4 years ago

@andrewfoote Have a look if a58ae0c fixes the problem. At the end,

. tab year 

       Year |      Freq.     Percent        Cum.
------------+-----------------------------------
       1990 |      3,114       33.62       33.62
       2000 |      3,067       33.12       66.74
       2007 |      3,080       33.26      100.00
------------+-----------------------------------
      Total |      9,261      100.00
andrewfoote commented 4 years ago

@larsvilhuber That matches my numbers.