Andrew03 / transformer-abstractive-summarization

Code for the paper "Efficient Adaption of Pretrained Transformers for Abstractive Summarization"
71 stars 14 forks source link

Model weights after Adaption #1

Open orionw opened 5 years ago

orionw commented 5 years ago

This repo is super great - I hope all goes well with the paper submission!

I was looking to take a model like this and play with it but doing the Adaptive training myself (although you have good instructions) would require 20 epochs of training, if I read the paper correctly. This wouldn't be too bad except that each epoch takes over 12 hours on the best GPUs I have access to. I could go through all the work to get the results but it would be expensive and I'd prefer not to.

Is there anyway you could upload the weights you used for the summarization tasks? If that's not feasible I would understand but I thought I would ask.

Thanks!

atcbosselut commented 5 years ago

Hello,

That should be possible. Is there a particular dataset you're looking for the adapted weights from?

orionw commented 5 years ago

Thank you for the response!

The newsroom weights would be awesome if that would be possible.

peter-xbs commented 5 years ago

Hello,

That should be possible. Is there a particular dataset you're looking for the adapted weights from?

Could you also provide the can_dm weights?

CreateRandom commented 4 years ago

Hi,

I'd also be interested in obtaining the weights for cnn_dm. If possible and still available, could you also provide the generated summaries for validation and test set?

vprelovac commented 4 years ago

@atcbosselut Are you going to release pre-trained weights for Newsroom? Would make evaluating the model much easier! Thanks!