Currently ParquetDataset can receive partition as a parameter, but it's not possible to set a partition_base_dir. We need to expose this in the constructor parameters and pass it in the ds.dataset call.
This is useful when we're loading datasets structured with DirectoryPartitioning, and passing in a list of files
Describe the enhancement requested
Currently ParquetDataset can receive partition as a parameter, but it's not possible to set a partition_base_dir. We need to expose this in the constructor parameters and pass it in the ds.dataset call.
This is useful when we're loading datasets structured with DirectoryPartitioning, and passing in a list of files
Component(s)
Parquet