salesforce / ctrl-sum

Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper
https://arxiv.org/abs/2012.04281
BSD 3-Clause "New" or "Revised" License
146 stars 24 forks source link

package for CTRLsum (easy to use) #5

Closed hyunwoongko closed 3 years ago

hyunwoongko commented 3 years ago

Thanks for cool research. The results of this research were quite appealing to me, so I made the package to make it easier to use. The package is called summarizers. I ported the model you provided to Hugging Face Transformers, and I made it for use with codes 2 to 3 lines. If anyone wants to try this model more easily, it would be good to use.

https://github.com/hyunwoongko/summarizers

Thanks for great research again.

jxhe commented 3 years ago

Thank you very much! I added references to your repo as well as the huggingface interface into README.

hyunwoongko commented 3 years ago

I ported the following name on the Hugging Face.

Thanks a lot!

jxhe commented 3 years ago

Thank you! I have a question for the unconditional summarization example: the unconditional summaries are generated with empty keywords/prompts in the package, right? I am asking just for clarification since it is different from the paper which uses automatic keywords, while I think it is still good and simple without an additional tagger.

hyunwoongko commented 3 years ago

Yeah. because It is just first draft version. I will add BERT tagger later (next version) Thanks :)

jxhe commented 2 years ago

Hi, the pretrained model checkpoints are already avaiable at huggingface, but we don't have a training script to train CTRLsum using Huggingface.

However, you can train CTRLsum with huggingface easily -- just run our preprocessing script to prepare the data, then run huggingface's summarization example script here following our training hyperparameters.