xlang-ai / Spider2

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
https://spider2-sql.github.io
Apache License 2.0
82 stars 2 forks source link

Where is the full dataset? #1

Open moore269 opened 2 weeks ago

moore269 commented 2 weeks ago

I see there are 178 examples in this file https://github.com/xlang-ai/Spider2/blob/main/spider2/examples/spider2.jsonl

However, the paper says there are 600 examples? Where are the rest? Also, is it possible to have the correctly labeled sql as another field in the jsonl as well?

Lastly, is there a place where I can easily download all the referenced tables?

lfy79001 commented 2 weeks ago

Hi,

We are working on 600 examples, and we have currently only released part of the data. It is expected to take another week.

Some of the tables are on the cloud, please refer to Bigquery Guideline, so you don’t need to download them. There is another portion of the tables that need to be downloaded, which you can access via this link