shijx12 / KQAPro_Baselines

Pytorch implementation of baseline models of KQA Pro, a large-scale dataset of complex question answering over knowledge base.
http://thukeg.gitee.io/kqa-pro/
MIT License
127 stars 22 forks source link
bart dataset kbqa seq2seq sparql

KQA Pro Baselines

KQA Pro is a large-scale dataset of complex question answering over knowledge base, which provides strong supervision of SPARQL and program for each question. Here is its homepage website. This dataset is licensed under a Creative Commons Attribution-ShareAlike 4.0 International.

This repo implements several baselines for the dataset:

Instructions of how to run these models are described in their README files. Before trying them, you need to first download the dataset and unzip it into the folder ./dataset. The file tree should be like

.
+-- dataset
|   +-- kb.json
|   +-- train.json
|   +-- val.json
|   +-- test.json
+-- GRU
|   +-- preprocess.py
|   +-- train.py
|   +-- ...
+-- KVMemNN
+-- RGCN
...