kedro-org / kedro-viz

Visualise your Kedro data and machine-learning pipelines and track your experiments.
https://demo.kedro.org
Apache License 2.0
647 stars 106 forks source link

Add Dataset Factory Patterns to Experiment Tracking #1824

Closed ravi-kumar-pilla closed 3 months ago

ravi-kumar-pilla commented 3 months ago

Description

Resolves #1689

Development notes

NOTE:

Earlier we used catalog.exists() on the datasets for discovery which caused the below issues -

  1. Users were not able to use Kedro Viz if the dataset defined in the catalog does not exist (This was resolved using exception handlers)
  2. Users received a timed-out issue as the datasets resided remotely and in a distributed cluster
  3. exists() also calls _get_dataset internally and also calls AbstractDataset.exists. We are removing this additional call

Example patterns -

Screenshot 2023-10-18 at 1 11 46 PM

QA notes

Checklist

ravi-kumar-pilla commented 3 months ago

CircleCI Build fix - https://github.com/kedro-org/kedro-viz/pull/1819

ravi-kumar-pilla commented 3 months ago

Hi @iamelijahko ,

I know you are working on refactoring catalog api. I wanted to bring this to your notice if this helps -

At this moment we are using a private method of DataCatalog to get around resolving the dataset factory patterns in one of our use-case. catalog._get_dataset(dataset_name, suggest=False)

Thank you