rjweiss / CaliforniaGreatRegister

0 stars 1 forks source link

Extract gender for 1912 #19

Open rjweiss opened 8 years ago

rjweiss commented 8 years ago

Introduced for Alameda, parse it out

rjweiss commented 8 years ago

Will try to add "gender" as a new column, even though for most years it will be NA.

rjweiss commented 8 years ago

I can add new gender as a column, but for some reason this results in a severe drop in rows for both Alameda and SB. Need to track down why this is happening.

bspahn commented 8 years ago

FYI, using the file currently in working data, gender is still going into occupation field.

dat %>% filter(yr==1912) %>% select(recordnum, pagenum,occupation, name, address) %>% head %>% tail(5) recordnum pagenum occupation name address 2 50774 6 fern housewife Bragg Willia 761 Willow st 3 50775 6 male laborer Bridge James 421 Willow st 4 50776 6 fem housewife Bridge Viola 421 Willow st 5 50777 6 fem housekeeper Brown Ellen 1714 5th st 6 50778 6 male barber Brown Samuel T 1735 7th st