asardaes / dtwclust

R Package for Time Series Clustering Along with Optimizations for DTW
https://cran.r-project.org/package=dtwclust
GNU General Public License v3.0
252 stars 29 forks source link

Definition of multivariate time series #54

Closed vidarsumo closed 2 years ago

vidarsumo commented 2 years ago

I was just reading Vignette with theory and I'm a bit confused when you talk about multivariate time series.

Do you differentiate between many time series and multivariate time series?

I have a data set of 6,000 time series (sales data from one client) and I want to create clusters. These can be very different products like cosmetics, beverages and candy. I simply look at this as many time series, some are related, like a subgroup of the beverages, but many are not, like lipstick and a soda. The products are of very different length, many only with 1 or 2 observation (1-2 months of monthly sales data) while other have many months of data.

I'm not sure if I need to do anything special after reading this vignette because I got a bit confused when you mentioned multivariate time series.

asardaes commented 2 years ago

Have you checked the sample data included in CharTrajMV? I'd say that considering series as multivariate can be subject to interpretation. In said data, every multivariate series contains 3 univariate series that were recorded for the same individual doing the same character, so there are many series, but some of them can be considered together as one because they came from the same "entity".

vidarsumo commented 2 years ago

Ok that's what I thought but wasn't sure. I was a bit confused since in a field like economics you talk about multivariate model like VAR but there you might have few time series from different "entity", i.e. wage, inflation, exchange rate, unemployment etc.