Open forkliu0826 opened 10 months ago
I'm sorry I misunderstood some information. In this work, we used four 3090 GPUs and a training epoch took about 16 hours on the CNNDM dataset and about 10 hours on the XSum dataset.
Hello author, I see that your epoch default value is 100, does it take that long to train?
In fact, it’s not necessary to train so many rounds, we only need to train a few rounds (< 10).
Sorry author, I have one more question, I would like to know what instruction is running as a result of the test in your experiment, do_reranking or do_generation or something else.
do_ranking is for the experiment used for selecting candidate summaries, while we employ do_generation to evaluate our approach. The author is currently occupied with other tasks. Subsequently, there will be further code organization and updates to the README.md will be provided.
Hello author, I am very interested in your article, I see that your article did not report computing resource requirements and running time, I would like to ask you about the relevant information