JinheonBaek / KALMV

Official Code Repository for Knowledge-Augmented Language Model Verification (EMNLP 2023)
28 stars 4 forks source link

questions about the KGQA datasets #5

Closed ROSC263 closed 6 months ago

ROSC263 commented 6 months ago

Hi, JinheonBaek,

Thanks for your great work! I would like to kindly ask about the process of generating the KGQA datasets. If I want to work on WebQSP and Mintaka QA datasets based on Freebase, how can I generate datasets with the same structure as the one based on Wikidata?

JinheonBaek commented 6 months ago

Thank you for your interest!

You can first download the full dump of the freebase (which is now deprecated though) and then match the entities in the dataset with the nodes in the freebase knowledge graph. I no longer have the code for preprocessing the freebase version of WebQSP but I hope you can easily reproduce it.

ROSC263 commented 6 months ago

Thank you so much!