Bug in train/val split for data in long format

SalesforceAIResearch / uni2ts

[ICML2024] Unified Training of Universal Time Series Forecasting Transformers

Apache License 2.0

611 stars 49 forks source link

Bug in train/val split for data in long format #27

Closed Mo-Abdelhameed closed 1 month ago

Mo-Abdelhameed commented 2 months ago

While checking the code for splitting the data into train/val datasets, I came across this code.

I think this logic of splitting works well for wide format but not long format. For long format we should group by the series_id column, do the filtering and then concatenate the data again.

Can you please look into this?

Thanks