ropensci / gendercoder

Creating R package to code free text gender responses
https://docs.ropensci.org/gendercoder/
Other
46 stars 12 forks source link

Add more options to the dictionary #16

Open ekothe opened 5 years ago

ekothe commented 5 years ago

We'd love feedback or pull requests to add terms to the dictionary.

You can do this by adding entries to GenderDictionary.csv

kylehamilton commented 5 years ago

Pull request #17 included additional terms for common misspellings of androgynous. I'll add more over time.

KyleHaynes commented 5 years ago

Any interest in extending the dictionaries to include colloquial terms, such as Dude, Chick, Guy, Gal, Bloke, Fella?

A lot of records potentially missing hyphen variations as well. e.g.:
"Brauer, who identifies as gender-queer and prefers the"
https://edition.cnn.com/2016/04/13/living/gender-fluid-feat/index.html

ekothe commented 5 years ago

@KyleHaynes So far this has mostly been based on sample data that we have in terms of how people have answered this question in our "wild" data. We've just run a study with some participants who did identify as "gender-queer" and "gender-fluid" so I would be inclined to add those (feel free to put in a pull request to add them if you want - it will probably take me a bit longer to get to it).

I'm not sure about colloquial terms since I've not had them come up in real data. If people have thoughts on potential downsides to adding them I'd be interested to know.

ekothe commented 5 years ago

New terms reported by Nils Reimer to be added to the dictionary (via email)

  gender_dictionary <- list(
    email = "female",
    `my sex is female` = "female",
    makle = "male",
    femalew = "female",
    `female only` = "female"
  )
Christian-T commented 3 years ago

Please add "masculine" = "male"; "feminine"= "female" works already