Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
When having parameters in conf/base/sub_folder/parameters_A.yml structure where sub_folder is symlinked, the pipeline gives the following error, however, it is able to read the parameters from the following structure: conf/base/parameters_A.yml
ValueError: Pipeline input(s) {'params:length', 'params:width'} not found in the DataCatalog
Context
The issue is that fsspec.filesystem.glob() which we use to find the paths recursively doesn’t find symlinks.
Description
When having parameters in
conf/base/sub_folder/parameters_A.yml
structure wheresub_folder
is symlinked, the pipeline gives the following error, however, it is able to read the parameters from the following structure:conf/base/parameters_A.yml
ValueError: Pipeline input(s) {'params:length', 'params:width'} not found in the DataCatalog
Context
The issue is that
fsspec.filesystem.glob()
which we use to find the paths recursively doesn’t find symlinks.https://github.com/kedro-org/kedro/blob/adfc593bcd2f1b74676e7ab7c1a3b9c168b7257f/kedro/config/omegaconf_config.py#L295
Steps to Reproduce
For a default spaceflights-pandas project create a symlink folder and place![Screenshot 2024-06-28 at 14 55 13](https://github.com/kedro-org/kedro/assets/157851531/976a94ba-b8f0-4559-8a4c-05d90a48119a)
parameters_data_science.yml
in the linked folder.Run the pipeline.
Expected Result
Symlinks are found when loading configuration files.
Your Environment
pip show kedro
orkedro -V
): kedro, version 0.19.6python -V
): Python 3.11.9