salesforce / WikiSQL

A large annotated semantic parsing corpus for developing natural language interfaces.
BSD 3-Clause "New" or "Revised" License
1.62k stars 322 forks source link

Phase 1 vs. phase 2 #43

Closed aagohary closed 5 years ago

aagohary commented 5 years ago

Hi, I am confused by phase 1 and phase 2 annotations in the dataset files. The paper says phase 1 is a paraphrasing phase while phase 2 is a verification phase. As far as I understand, phase 2 is just about discarding wrong paraphrases. So what do you mean by a given example was collected in phase 1 vs. phase 2? Thanks,

--Ahmed

vzhong commented 5 years ago

This is rather unfortunate naming but the phase number in the actual dataset indicates which batch the data was collected in. It is different than the phase in the paper.