nyu-mll / spinn

NYU ML² work on sentence encoding with tree structure and dynamic graphs
MIT License
107 stars 38 forks source link

Scripts for collapsing/binarizing ptb trees #99

Closed bo-son closed 4 years ago

bo-son commented 4 years ago

Hi, could you make public the scripts for unary-collapsing and binarizing the original PTB trees? I want to make sure that I use the same trees as you did. Thanks a lot!

sleepinyourhat commented 4 years ago

I don't really know Java, but I cobbled this script together a while back, and it contains the core operations that were used for both SNLI and MultiNLI. You'll probably have to re-do the IO to fit whatever data format you're working with, but otherwise, you should just be able to run this in a directory that contains the 04/20/15 release of Stanford CoreNLP.

ParseImageFlickrOneCol.java.zip

bo-son commented 4 years ago

Thank you very much! Helped me a lot.