Probably easiest to keep it simple, e.g. a dataset is a nested array of a single datatype, all double. Rows (first index) = observations, columns (second index) = columns.
Also likely easiest to include response as a column in the dataset, not as a separate vector.
As a convention, response is always rightmost/last column.
I keep having to define variables of type std::vector<std::vector<double>> every time a new dataset is used or created. Maybe "dataset" should be a class?
Probably easiest to keep it simple, e.g. a dataset is a nested array of a single datatype, all double. Rows (first index) = observations, columns (second index) = columns.
Also likely easiest to include response as a column in the dataset, not as a separate vector. As a convention, response is always rightmost/last column.