databrickslabs / tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
https://pypi.org/project/dbl-tempo
Other
306 stars 51 forks source link

Functions to transform into label / feature-vector format for model training & scoring #1

Closed tnixon closed 3 years ago

tnixon commented 4 years ago

We need some helper functions that can easily group timeseries and roll them into a tensor/vector format suitable as a feature vector for model training & scoring.

The functions should allow for collection of values / aggregation across 2 dimensions:

This function may be a special case of a more generic rolling-window type aggregation.

tnixon commented 3 years ago

Resolved by #13