Open AngeloFrigeri opened 2 days ago
Tried with (below ) not able to reproduce in local . Maybe specific to s3. @AngeloFrigeri can you check partition column are correct in your code and not an issue there.
from deltalake import convert_to_deltalake
import pyarrow as pa
import pandas as pd
df = pd.DataFrame(data={'blaaPara': ['a', 'a', 'b'],
'year': [2020, 2020, 2021],
'month': [1,12,2],
'day': [1,31,28],
'value': [1000,2000,3000]})
df.to_parquet('./mydf', partition_cols=['blaaPara'])
convert_to_deltalake(
'./mydf',
partition_by=pa.schema(
[
pa.field("blaaPara", pa.string()),
]
),
partition_strategy="hive"
)
Environment
Delta-rs version: 0.18.1
Binding: python 3.8.19
Environment:
Bug
What happened: When converting a partitioned parquet table to delta table, I got the following error:
What you expected to happen: To have a delta log folder create on our S3 path
How to reproduce it:
More details: