microsoft / ContextualSP

Multiple paper open-source codes of the Microsoft Research Asia DKI group
MIT License
374 stars 62 forks source link

LogiGAN: dataset creation #36

Closed Golovneva closed 2 years ago

Golovneva commented 2 years ago

Hi! Thank you for sharing the code for LogiGAN paper. I'm having troubles creating training set. In particular:

  1. Here code refers to non-exiting script. I have replaced commands with "python corpus_construction.py --start 0 --end 500 --indicator_type conclusion &" - is it the right way to do?
  2. elastic_search/build_gen_train/ver_train refers to files that do not exist in the bookcorpus, and there are no instructions how to create them. Is there a script/link to generate gan_corpus_new/beta/gen_train_B.jsonl and gan_corpus_new/beta/ver_train.jsonl files?
SivilTaram commented 2 years ago

Hi Wanjun, can you help answer the question? @zhongwanjun

zhongwanjun commented 2 years ago

Hi. Thanks. I will upload the created data files, and model checkpoints ASAP. Please stay tuned.

zhongwanjun commented 2 years ago

Hi, I have already uploaded the data and code in the released data in this repo.

Golovneva commented 2 years ago

Can you please point me where I can find it? If you are talking about "ContextualSP/logigan/" than that's precisely what my original question is about - there is no data, and scripts to generate data are not runnable

SivilTaram commented 2 years ago

@Golovneva You can find it here.

Golovneva commented 2 years ago

Thanks a lot!