duckdb / duckdb_azure

Azure extension for DuckDB
MIT License
51 stars 17 forks source link

Getting checksum error while querying on .db file placed over azure blob storage using azure extension and duckdb threads sets to more than 1 #87

Closed rupeshtej closed 4 weeks ago

rupeshtej commented 1 month ago

What happens?

Getting checksum error when executing querying on .db file placed over azure blob storage using azure extension and duckdb threads sets to more than 1.

It works fine when duckdb threads is set to 1 , but getting below issue when setting duckdn thread more than 1 -

Error in executing query: IO Error: Corrupt database file: computed checksum 10704450356758579597 does not match stored checksum 8071570260309647359 in block at location 2371584

Had problem with attach earlier, attach issue was fixed and mentioned in below id - https://github.com/duckdb/duckdb_azure/issues/70

Now having problem with query execution when setting thread more than 1

To Reproduce

Below are the steps and commands to reproduce -

Run duckdb CLI and run below commands

INSTALL azure; LOAD azure;

CREATE SECRET secret ( TYPE AZURE, CONNECTION_STRING '***' );

SET threads = 4;

Attach .db file using below command - ATTACH 'az://samplecontainer/sample_test.db' AS sample_test (READ_ONLY);

Now run sample query - select column_1 from sample_test.sample_test

OS:

Linux

DuckDB Version:

1.1.2

DuckDB Client:

CLI

Hardware:

No response

Full Name:

Rupesh Tejapuriya

Affiliation:

NA

What is the latest build you tested with? If possible, we recommend testing with the latest nightly build.

I have tested with a nightly build

Did you include all relevant data sets for reproducing the issue?

Yes

Did you include all code required to reproduce the issue?

Did you include all relevant configuration (e.g., CPU architecture, Python version, Linux distribution) to reproduce the issue?

duckdblabs-bot commented 1 month ago

Thanks for opening this issue! Based on our automated check, it seems that your post contains some code but it does not use code blocks to format it.

Please double-check your post and revise it if necessary. To employ syntax highlighting, it's recommended to use code blocks with triple backticks, e.g.:

```sql
SELECT ...

If this is a false positive, feel free to disregard this comment.