mlcommons / croissant

Croissant is a high-level format for machine learning datasets that brings together four rich layers.
https://mlcommons.org/croissant
Apache License 2.0
456 stars 41 forks source link

Cache the result of each operation. #741

Closed marcenacp closed 2 months ago

marcenacp commented 2 months ago

This will aim at more easily generating datasets in Beam - even when they contain joins.

github-actions[bot] commented 2 months ago

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅