xlang-ai / Spider2

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
https://spider2-sql.github.io
Apache License 2.0
157 stars 14 forks source link

Where is the full dataset? #1

Open moore269 opened 2 months ago

moore269 commented 2 months ago

I see there are 178 examples in this file https://github.com/xlang-ai/Spider2/blob/main/spider2/examples/spider2.jsonl

However, the paper says there are 600 examples? Where are the rest? Also, is it possible to have the correctly labeled sql as another field in the jsonl as well?

Lastly, is there a place where I can easily download all the referenced tables?

lfy79001 commented 2 months ago

Hi,

We are working on 600 examples, and we have currently only released part of the data. It is expected to take another week.

Some of the tables are on the cloud, please refer to Bigquery Guideline, so you don’t need to download them. There is another portion of the tables that need to be downloaded, which you can access via this link

sethsiddharth commented 1 month ago

Hello @lfy79001! It's been about 3 weeks since the last update on the full dataset release. Any news on the progress?

Thank you for your work on this project and for keeping the community informed.

lfy79001 commented 1 month ago

Thank you for your interest in Spider 2.0. We have been busy with paper writing and data validation. In about 10 days, we will release the paper and all the data.

sethsiddharth commented 1 month ago

Thank you for the update. Looking forward to the release!