kedro-org / kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
https://kedro.org
Apache License 2.0
9.47k stars 875 forks source link

[DataCatalog]: Improve the way to access namespaced datasets with `_FrozenDataset` API #3928

Open ElenaKhaustova opened 3 weeks ago

ElenaKhaustova commented 3 weeks ago

Description

Users struggle with the _FrozenDataset's API when accessing namespaced datasets because it uses double underscores instead of dots, which they find unintuitive and cumbersome. Some prefer referring to the dataset by its original name, so they use the private _get_dataset() method instead.

We propose to:

  1. Explore the feasibility of modifying the _FrozenDataset's API to use dots instead of double underscores for namespaces, aligning with users' expectations.
  2. Provide an opportunity to call datasets by their exact names - get dataset by name function.

Relates to https://github.com/kedro-org/kedro/issues/3926

Context

User feedback:

merelcht commented 3 weeks ago

FYI the PR in which the double underscore syntax was introduced: https://github.com/kedro-org/kedro/pull/487