dmbee / seglearn

Python module for machine learning time series:
https://dmbee.github.io/seglearn/
BSD 3-Clause "New" or "Revised" License
571 stars 63 forks source link

question on ts_data #55

Closed BrannonKing closed 3 years ago

BrannonKing commented 3 years ago

I have a simple 2D array of data that has samples in the first dimension (rows) and sensors in the 2nd. I understand this is called the "wide" format. It's fairly large: 2 million-ish samples from 100+ sensors. It's in order but there is no date/time column. I have a separate set of labels that contains the same number of samples. I don't understand how to convert this into the required format; I get the error "object has no attribute 'ts_data'". What value is supposed to go in the to-be-created ts_data column?

dmbee commented 3 years ago

The dataset format is a list or object-array of time series. If you have just one time series in your dataset you need to put a [ ] around it when passing it to the segmenter or pipeline objects. This requirement is explained in the user guide

dmbee commented 3 years ago

https://dmbee.github.io/seglearn/user_guide.html