Open picousse opened 10 months ago
Also datatypes have to be pyarrow datatypes in the consume. This was not clear to me based on https://github.com/ml6team/fondant/tree/main/components/load_from_parquet
- data path. This the path in the docker (/data/...). This is unclear based on the documentation (or I might have missed it)
This is the path on your local (or remote) file system, which will be mounted in docker. Is that how you understood it, or did you understand it differently?
- for load_from_parquet, the produces values are crucial. there is no type inference.
Indeed, I think this is documented both in our general documentation and the component documentation.
- Also datatypes have to be pyarrow datatypes in the consume. This was not clear to me based on
This is indeed not clearly documented in the component documentation. Would be good to add.
hi, some minor stuff I encountered running locally.
Current code:
what was unclear for me:
load_from_parquet
, the produces values are crucial. there is no type inference.I read https://fondant.ai/en/latest/pipeline/ and both issue did not seem clear to me.