Closed sn3fru closed 6 years ago
Thank you! And thanks for initiating interesting discussions here.
There's a few things to keep in mind when introducing what I call global features. One such example features is 'mean number of commits yesterday among the whole population' or 'temperature yesterday'. In short : go for it! But be careful.
First thing is wtte/survival specific issue. Don't disclose to the algo things that helps it know if a timestep of a seq. is censored or not. The second one is a more general forecasting problem.
In my experience, adding this type of feature unintuitively decreased sequence-specific overfitting and hence only had moderate to no effect on the type-1 problem above. One explanation is that the algo dont need to focus big chunk of the network to inferring the global time from the features when it gets it for free.
Check out the feature mean_commits_global
in data pipeline example.
First of all, congratulations on the great work with wtte.
My question is about different periods in time. We know that the behavior is not constant, that in some months or during the dawn the flow is naturally lower if the learning in creating its curves of theoretical time series takes this behavior into account.