Closed hermands closed 5 years ago
I think we should follow the R for Data Science terminology and refer to columns as variables and rows as observations.
My personal preference would be column. I think thats what our audience would call them. R/tidyverse isn't really consistent with these things. select() does use variables, but what about
cbind()
bind_cols()
R4ds says something like: in tidy data Each variable must have its own column, each observation its own row so to me variable and observation are ideas in the data, a column is that vertical thingee in a data frame.
anyone convinced?
I think it's fine to use both and agree that column is the physical description of the data structure and variable is an idea. I think we just want to avoid multiple terms for the idea terms.
that works for me as well
Sounds good.
I changed my language to variables & columns, rows & observations.
@jwrudolf @amromeo Sorry if I missed this earlier.
I was going to use 'variables'... What is our convention?