LLM-generated candidates

Hi,

thank you for your interest in our work, and apologies for the late response. Yes indeed, you should sample the predictions from the big model yourself and then place them in the files data/xsum/train_[llm_name] and data/xsum/validation_[llm_name]. You can check this code for reference on sampling from an LLM. In our approach we generated two types of outputs from the LLM: one using greedy decoding and another using sampling. The corresponding filenames are indicated by FILE_SAMPLE and FILE_GREEDY in the code, which should give you an idea of how to structure your output files. In each case, we wrote each prediction on a separate line.

GeorgeVern / lmcor

LLM-generated candidates #3