EducationalTestingService / rstfinder

Fast Discourse Parser to find latent Rhetorical STructure (RST) in text.
MIT License
121 stars 24 forks source link

reduce directions seem redundant with nuclearity #15

Closed mheilman closed 10 years ago

mheilman commented 10 years ago

The current parser separates reduce direction and nuclearity (e.g., there can be a L:nucleus:span and R:nucleus:span). It would be better to just have one type of binary reduce action per relation type (e.g., just B:nucleus:span) and then infer the head of the resulting subtree from the nuclearity and position of its children. This would reduce the size of the model by a large amount. About 30% of the parser actions (i.e., model classes) are redundant in this way.

mheilman commented 10 years ago

dece1d827fcc6a3b9a67dec2d8904e4a6cc6fb08 starts to address this. I still need to test it more before merging it in with the develop branch.