Open ElenaKhaustova opened 3 weeks ago
This seems very specific to the SnowparkTableDataset
, so I personally wouldn't tackle this as part of the other catalog work. I'll move it to the kedro-plugins
repo under the individual dataset improvements milestone.
Description
SnowparkTableDataset
dataset configuration does not have a query endpoint, so running database-level SQL queries is not possible at the catalog level. Thus users have to make it at the level of the database - at first, execute query to filter data and only after run a Kedro pipeline. Users expect it to work similar toSQLQueryDataset
andGBQQueryDataset
where they have a query endpoint.https://docs.kedro.org/projects/kedro-datasets/en/kedro-datasets-3.0.1/api/kedro_datasets.snowflake.SnowparkTableDataset.html
We propose to:
SQL
queries withIbis
in such cases instead: https://kedro.org/blog/sql-data-processing-in-kedro-ml-pipelines.Context