Closed sami-ka closed 2 years ago
Yes, I think it would be best to add the missing months; if you're using something like DoubleML, then in the first stage we'll fit price and quantity models and you'd want the quantity model training process to have access to the fact that 0 units were sold in those months, and you'd also want those months to be represented in the second stage when we're using the residuals from those predictions to train the final model.
Thanks for your answer !
I am currently using econml in a pricing use case.
In my dataset I have the transactions aggregated by months, indicating which product and how much a customer consumes every month. If customer 1 buys 10 units of product A in January and 5 in March, I will have something like this :
Is it ok to leave it as is?
Should I complete the dataset and have an additional row for each period of time without transactions? I have illustrated my point in the table below.