kedro-org / kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
https://kedro.org
Apache License 2.0
9.47k stars 875 forks source link

[DataCatalog]: Interface to request specific properties from catalog and datasets #3939

Open ElenaKhaustova opened 3 weeks ago

ElenaKhaustova commented 3 weeks ago

Description

Users seek increased efficiency in accessing data through the public API. They suggest implementing a kind of GraphQL interface that allows them to request specific properties from catalog and datasets, reducing unnecessary data transmission.

We propose to explore the feasibility of implementation and integration of such an interface into the public API.

Partially related to https://github.com/kedro-org/kedro/issues/143

Context

merelcht commented 3 weeks ago

The server part of this task is way out of scope for the DataCatalog API improvements, but I have a feeling that if we tackle:

we make it easier for those that have a server setup to get the information they need from the catalog.