splicemachine / pysplice

Apache License 2.0
5 stars 6 forks source link

Dbaas 5440 - Support training sets as pandas dataframes #149

Closed Ben-Epstein closed 3 years ago

Ben-Epstein commented 3 years ago

Support returning training sets as pandas dataframes dataframes (under 50MB)

Description

This enables users to use the featurestore without spark and get small datasets as pandas dataframes

Motivation and Context

More flexibility with the feature store usage, no need for spark on small datasets.

Dependencies

ml-workflow

How Has This Been Tested?

Against K8s

Screenshots (if appropriate):

Changelog Inclusions

Additions

Support training sets as JSOn and Pandas (under 50MB)

Changes

Fixes

Deprecated

The return_sql parameter is deprecated and will be removed in a future release

Removed

Breaking Changes