EducationalTestingService / rstfinder

Fast Discourse Parser to find latent Rhetorical STructure (RST) in text.
MIT License
121 stars 24 forks source link

How to get the corpora? #49

Closed zhongluwang closed 6 years ago

zhongluwang commented 6 years ago

How to get ~/corpora/rst_discourse_treebank?

aoifecahill commented 6 years ago

It's available through the LDC: https://catalog.ldc.upenn.edu/ldc2002t07

Syauri commented 4 years ago

This link does not work anymore.

desilinguist commented 4 years ago

Looks like the ID just needs to be uppercase and then it works fine: https://catalog.ldc.upenn.edu/LDC2002T07

kaushal18 commented 4 years ago

Do I need to pay to download the corpora?

desilinguist commented 4 years ago

You or the organization you belong to (university, employer) need to be a member of the LDC to get the corpus. See more details here.