Open lxx909546478 opened 2 years ago
Good questions.
Good questions.
- I think the main difference between BRIO and the abstractive version of COLO may lie in the loss function. The loss function in BIRO is a reinforcement learning loss which is used to learn the order. While COLO also follows a contrastive learning paradigm. And BRIO may not suitable for extractive methods.
- COLO has already used the predicted score(modeled by BCEloss) to clip the sentence number of the document to K sentences. Because most summaries in CNNDM has 2~3 sentences, and then we further get the Candidates size by C(K, 2) + C(K,3)
Thanks for your reply.
Thanks a lot! COLO is an inspiring work. I would try in other experiment settings.
Great work! And there were a few points I wanted to know after reading.
Looking forward to your reply.