Closed arpit94 closed 1 week ago
Hi @arpit94, thanks for opening this issue. There are many moving parts here:
It seems to me that the Athena and Postgres are not required to reproduce the issue. Instead, the issue is that the following instruction works:
CREATE OR REPLACE TABLE tbl AS FROM read_parquet('https://duckdb.org/data/prices.parquet', union_by_name = true);
But this one doesn't:
COPY tbl FROM 'prices.parquet' (FORMAT PARQUET, UNION_BY_NAME true);
Not implemented Error: Unsupported option for COPY FROM parquet: UNION_BY_NAME
You can work around your issue by first using a CREATE [OR REPLACE] TABLE
command to create an interim table, then manually copy the content of that table to your target Postgres table.
What happens?
I am trying to run a COPY FROM command to pull data from Parquet files in S3 and load it into a PostgreSQL database. The S3 directory contains multiple files.
I am encountering the following error:
Additionally
To Reproduce
Issue : There is no way to add union_by_name=True to the COPY FROM command, and this is already a COPY command.
OS:
linux
DuckDB Version:
1.0.0
DuckDB Client:
python
Full Name:
Arpit Aggarwal
Affiliation:
Candor Health
What is the latest build you tested with? If possible, we recommend testing with the latest nightly build.
I have tested with a stable release
Did you include all relevant data sets for reproducing the issue?
No - I cannot share the data sets because they are confidential
Did you include all code required to reproduce the issue?
Did you include all relevant configuration (e.g., CPU architecture, Python version, Linux distribution) to reproduce the issue?