Open aterenin opened 6 years ago
How big is the dataframe? Can you upload it here?
Can you give us the output of eltypes(d_p2)
? Also, does using a random DataFrame of the same size cause the problem or it's just this specific DataFrame when large enough?
i.e. does using DataFrame(rand(size(d_p2)))
cause the problems too?
It's kinda hard to diagnose with the current info. :/
Here you go.
https://www.dropbox.com/s/q4wdo6qplvt9d3a/logmargpost-enron-10.csv?dl=1 https://www.dropbox.com/s/8g69br7rsfos1jz/logmargpost-enron-100.csv?d=1
Import with
d_p2 = vcat(
CSV.read("experiments/logmargpost-enron-10.csv", nullable=false) |> x -> begin x[:dataset] = repeat("Enron\nK=10", size(x)[1]); x end,
CSV.read("experiments/logmargpost-enron-100.csv", nullable=false) |> x -> begin x[:dataset] = repeat("Enron\nK=100", size(x)[1]); x end
)
eltypes(d_p2)
String
Int64
Float64
Float64
Int64
Int64
String
String
String
Hi @aterenin, do you still have these CSVs available? The links appear to be dead.
Re-uploaded.
https://www.dropbox.com/s/lbreovxavndy8gv/logmargpost-enron-10.csv?dl=0 https://www.dropbox.com/s/07mfq6fbnxtqncv/logmargpost-enron-100.csv?dl=0
Please let me know when you've grabbed them so I can take the links down.
Can someone retest this on Gadfly 1.0.1?
i would re-test, but the links are dead again.
I'm porting some ggplot code that works in R to Gadfly. With a sufficiently large DataFrame, I run the following code.
This throws the following error with a rather unhelpful stack trace.
I tried to come up with a minimum working example, but was unable to reproduce the problem for small data frames. There shouldn't be anything weird about my CSV - it works correctly in R. I'd be happy to provide it for further examination.