uncharted-distil / distil-primitives

Distil AutoML primitives
Apache License 2.0
2 stars 1 forks source link

Create timeseries re-binning common primitive #126

Closed cdbethune closed 4 years ago

cdbethune commented 5 years ago

Add a primtive to common primitives to allow for timeseries data to be re-binned using different time buckets. At minimum, we should support seconds, minutes, hours, days, weeks, months, years.

cdbethune commented 4 years ago

Timeseries data will be passed into the binning primitive in the agreed upon D3M long format, where data is stored in a single dataframe, with each series having a unique identifier associated with it. The series values themselves will be a combination of a date/time or sequence value, and a set of measures.

The primitive should use the following parameters:

Column Parameters:

Binning Parameters:

The binning process will bin the data for a given series ID to the granularity requested using the supplied operation. All bins must be returned - if no data is present for a given bin it should be set to NaN.