Closed JohnGiorgi closed 1 year ago
@JohnGiorgi thanks for your interest! I will add a running example in 1-2 days :))
Awesome! Thanks a lot
@JohnGiorgi,
I have added an example in a way as in BARTScore. The code supports 6 discourse metrics, including DiscoScore. The details of these metrics are provided in Appendix A.1 in the paper.
Note that if system and reference texts do not contain coherence phenomena (e.g., no word repetition), then the discourse metrics would return 0.
Awesome! Thank you @andyweizhao. A couple of questions if that's okay!
model_name="bert-base-uncased"
the currently recommended pre-trained model to use to get best results?disco_scorer.DS_SENT_NN(s, refs)
is different (but valid) from disco_scorer.DS_SENT_NN(refs, s)
disco_scorer.DS_SENT_NN(s, source_doc)
. It looks like the source document text is retrieved in your code but never used.@JohnGiorgi
Hi!
Is there an example of how to use this metric for evaluation? I would be interested in instantiating some objects and computing the DisoScore score between a reference and generated summary. Ideally a usage something like BERTScore or BARTScore