JuliaStats / RDatasets.jl

Julia package for loading many of the data sets available in R
GNU General Public License v3.0
160 stars 56 forks source link

In Bwages the "sex" factor is loaded as all NAs #33

Closed pkofod closed 9 years ago

pkofod commented 9 years ago

When using dataset("Ecdat", "Bwages"), the factor "Sex" is loaded as NAs instead of "male"/"female".

garborg commented 9 years ago

Thanks for the report! I've got some other fixes almost ready to go -- I'll try to get to them and this today or tomorrow and push a patch to METADATA.

pkofod commented 9 years ago

Great, the data sets are nice for presenting statistics in Julia to people.

[off-topic: I used Housing to show the use of colwise(), and by() instead.]

garborg commented 9 years ago

I'm seeing this upstream (checked Ecdat latest, 0.2-0, and 0.1-5) -- please reopen if you find a source with valid gender data.

Re: using Housing -- @cgoldammer has been improving the DataFrames.jl docs, but more good, salient examples always helps -- if you're free to put in a PR to make your examples the examples for colwise() and by() (assuming they're just a couple/few lines each), that would be greatly appreciated.

pkofod commented 9 years ago

Does that mean, that you can actually write datasets("Ecdat","Bwages") and get the sex-column filled with something else than NAs?

The data set can be found here http://eu.wiley.com/legacy/wileychi/verbeek2ed/datasets.html

StoneCypher commented 9 years ago

i think he means the place the data is coming from is where the breakage is

garborg commented 9 years ago

Yep, that's what I meant, but I'll bring in the right data from the link you provided if I can.