Closed anton164 closed 2 years ago
In the paper you say:
For training the CMLM, we use both XSUM and the CNN/Dailymail dataset.
Do you train the CMLM separately for each dataset depending on the downstream task, or do you train one CMLM using both datasets?
Hi Anton, we train the CMLM separately for each dataset.
Thanks @mcao516 !
In the paper you say:
Do you train the CMLM separately for each dataset depending on the downstream task, or do you train one CMLM using both datasets?