Model > Seq2SQL : DNN for translating Natural Language Questions to SQL Queries
Dataset > WikiSQL : 80654 questions, SQL queries corpus from wikipedia.
aggregation operator classifier
Compute attention score over aggregation operations (COUNT, MIN, MAX, ...) for input tokens.
SELECT column pointer
Produce the representations for the columns using LSTM.
Compute attention score for each column and make a distribution over possible columns.
1,2 can be trained using cross entropy loss.
WHERE clause generator
Apply reinforcement learning
Instead of teacher forcing at each step of query generation, sample from the output distribution to obtain the next token.
Execute the generated SQL query against the database to obtain a reward. (compute by the result)
Model > Seq2SQL : DNN for translating Natural Language Questions to SQL Queries Dataset > WikiSQL : 80654 questions, SQL queries corpus from wikipedia.
aggregation operator classifier Compute attention score over aggregation operations (COUNT, MIN, MAX, ...) for input tokens.
SELECT column pointer Produce the representations for the columns using LSTM. Compute attention score for each column and make a distribution over possible columns. 1,2 can be trained using cross entropy loss.
WHERE clause generator Apply reinforcement learning
Instead of teacher forcing at each step of query generation, sample from the output distribution to obtain the next token.
Execute the generated SQL query against the database to obtain a reward. (compute by the result)