Open r2evans opened 6 years ago
This can become tricky if the sample data exceeds the 30K question size limit, as can easily happen with database or statistics questions.
@pinobatch I don't disagree that that corner case is problematic. But in my experience (which is likely just a small speck in the big scheme), it is also the exception.
But I don't think that that alone is a problem, for a couple of reasons:
If your problem truly needs so much sample data to demonstrate the problem, then perhaps pastebin or some other large-file-sharing method can be used. But this should still be used to augment, not replace, having sample copy-able data in the question. I think it's a fair thing to say in the data something like:
My data looks like *this*:
# mpg cyl disp hp drat wt qsec vs am gear carb
# Mazda RX4 21.0 6 160 110 3.90 2.620 16.46 0 1 4 4
# Mazda RX4 Wag 21.0 6 160 110 3.90 2.875 17.02 0 1 4 4
# Datsun 710 22.8 4 108 93 3.85 2.320 18.61 1 1 4 1
but since the data is 2B rows, the full dataset can be found *here* (some link).
I think most rules have rational exceptions. I think leaning towards copy-able data as a default and first-attempt is much preferred. There are definitely questions where I find a picture of the data is sufficient to get the point across, but I find it rare that I can provide an answer to questions like that without having to generate my own data to replace that which I don't have readily available (lacking the .NORM
file format).
I keep wanting to link here when a user posts a screen capture of a dataframe. +1 for adding this images of data page.
Strongly related to "images of code", it is equally frustrating to have data (for testing or demonstration) included as an image vice something copy/paste-able. The wording could be related: