Open juliasilge opened 4 years ago
Agreed that this would be helpful! An example of where I might use this is if I wanted to model the probability a member/customer takes a certain action each month for the next 12 months. Using the binomial framework above, this gives me the probability for each month (assuming independence), and I can also estimate the number of months they will take an action. Without this option, I can model the probability that they will take an action in at least one of the months, but that doesn't quite get me what I'd need. I hope that explanation helps.
In situations where users need to model an outcome that is a proportion (such as clicks out of impressions, registrations out of visits, etc) a useful approach is to use a generalized linear model with
family = binomial
(i.e. just likeparsnip::logistic_regression()
) but in the formula, instead of a factor, specify the response as a "two-column matrix with the columns giving the numbers of successes and failures", according to the docs.We don't currently support this. This is what it looks like using the underlying
glm()
function, and this is the error we currently get trying to use parsnip:Created on 2020-02-24 by the reprex package (v0.3.0)