Open dennlinger opened 2 years ago
Basically combine all the different approaches into a single benchmark, similar to GLUE. Also see if there is some way to extend evaluation beyond ROUGE.
Could be duplicate idea of #35?
Basically combine all the different approaches into a single benchmark, similar to GLUE. Also see if there is some way to extend evaluation beyond ROUGE.