Closed adrfantini closed 4 years ago
Quick tip with your data cleaning - lubridate::round_date()
is a nice way to fix minor irregularities in the measurement time.
It looks to me that the plots are working as intended. They're plotting all of the data you've provided, using the visualisation method specified (season plots, subseries plots, etc).
If the information is overwhelming, you can consider using a different plot type (such as calendar plots with sugrrants) or aggregating/summarising/focusing your data in some way.
Quick tip with your data cleaning -
lubridate::round_date()
is a nice way to fix minor irregularities in the measurement time.
Oooh, thanks!
It looks to me that the plots are working as intended.
Yes, I agree. Everything is working well, however feasts
could aim at implementing additional tools that could be applied to larger datasets, such as this one. One example could be using geom_smooth
for gg_season
, maybe?
aggregating/summarising/focusing your data in some way.
Could feasts
maybe facilitate this?
Yes, I agree. Everything is working well, however
feasts
could aim at implementing additional tools that could be applied to larger datasets, such as this one. One example could be usinggeom_smooth
forgg_season
, maybe?
As feasts
is using ggplot2
to produce the graphics, you should be able to add geom smooth to the result. i.e. data %>% gg_season(y) + geom_smooth()
.
We have some people in our research group thinking about how time series graphics can be designed for larger datasets, and automatic identification of informative graphics for time series. New methods and tools will be designed to work with these packages.
Could
feasts
maybe facilitate this?
tsibble
supports this by providing temporally aware versions of dplyr verbs.
Aggregate would be to summarise()
to combine multiple time series (keys) together.
Summarising (or perhaps more accurately, temporal aggregation) can be done with dt %>% index_by(...) %>% summarise()
.
Focusing on a certain time period can be achieved with filter()
data %>% gg_season(y) + geom_smooth()
This smooths every single line, I guess because it is missing a group aesthetic for smoothing the different lines. If I am not mistaken this is something that can't be done outside of gg_season
since grouping needs to happen on the the internally defined id variable.
We have some people in our research group thinking about how time series graphics can be designed for larger datasets
That's excellent to know! Feel free to close this or use it as a possible usecase / discussion zone for this.
I am testing
feasts
with a long time series: 5 years, hourly, 45k observations. The defaultfeasts
functions seem to struggle a bit with all this information: despite the fact that the series exhibits strong seasonalities (daily, weekly and yearly), these are hard to understand from thefeasts
output. This might be the case of me using the functions in a wrong way, or maybe the functions could be improved to handle better large datasets.Here is an example (I'll not upload all plots, just a couple as examples):
Here are a couple of examples of the above plots (season and subseries):
Can something be done to improve the visualisation of large(r) datasets?