taoyds / spider

scripts and baselines for Spider: Yale complex and cross-domain semantic parsing and text-to-SQL challenge
https://yale-lily.github.io/spider
Apache License 2.0
848 stars 193 forks source link

preprocessor returns different data than released #16

Closed nweir127 closed 5 years ago

nweir127 commented 5 years ago

I ran the preprocess_train_dev_data.py script as directed in the README, but got a training set that is incompatible with the training pipeline. It is missing gt_col fields for every item and has string labels instead of numbers. Is this a bug on my end or is the preprocessing script different than the one that produced the generated datasets?

op_dataset.zip

nweir127 commented 5 years ago

nevermind, resolved