shashiongithub / Sentence-Simplification-ACL14

State-of-the-art Supervised Sentence Simplification System from ACL 2014
BSD 3-Clause "New" or "Revised" License
47 stars 13 forks source link

First time deploy instructions #2

Open eyalrosen opened 7 years ago

eyalrosen commented 7 years ago

Hi, It'll be great if you could tell me which steps should I take in order to run this software on the first time. I've cloned the code and I have a bunch of sentences I wish to simplify. Now what? which commands should I run? Thanks!

shashiongithub commented 7 years ago

I have added some instructions now. You should have a look.

eyalrosen commented 7 years ago

Thank you so much! :)

NoahRothchild commented 7 years ago

Can we run the program without training it before hand?

shashiongithub commented 7 years ago

Unfortunately, I don't have pretrained models from the paper anymore.

xylophone234 commented 7 years ago

@eyalrosen Have you got the Boxer 1.00? I want to setup the experement too, but I have no Boxer.

shashiongithub commented 7 years ago

http://www.cl.cam.ac.uk/~sc609/candc-1.00.html

shankha117 commented 5 years ago

anyone got the boxer output (xml format) of the complex sentences?? or atleast know,how the xml file should be for any complex sentence?

shashiongithub commented 5 years ago

I ran boxer-1.0 with the following command.

candc-boxer/candc-1.00/bin/candc --input $1 --models /disk/gpfs/tools/candc-boxer/candc-1.00/models/boxer > $INPUT_FILE.ccg candc-boxer/candc-1.00/bin/boxer --input $INPUT_FILE.ccg --semantics drs --output $2 --resolve true --instantiate true --format xml --elimeq true

The xml output for a complex sentence:

<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE xdrs-output SYSTEM "xdrs.dtd">

The Baku Turkish Martyrs Memorial , located in Azerbaijan , is dedicated to the Ottoman Army soldiers killed in the Battle of Baku . DT NNP NNP NNP NNP , VBN IN NNP , VBZ VBN TO DT NNP NNP NNS VBN IN DT NN IN NNP . I-LOC I-ORG I-ORG I-DAT I-DAT I-DAT I-DAT i1009 i1001 i1007 i1023 i1020 i1014 i1018 i1009 i1002 i1003 i1004 i1007 i1007 i1008 i1005 i1023 i1021 i1022 i1015 i1016 i1018 i1018 i1019 i1017 i1012 i1012 i1012 i1013
shashiongithub commented 5 years ago
shankha117 commented 5 years ago

Thank you very much for the reply, really appreciate it. After a lot of fails, I just now resolved the boxer problem and have the XML output. FILES:: input file:snse003.txt prolog_file: prolog.cgg boxer_xml file: box_output

COMMANDS:: ./t --input snse003.txt --output token.tok ./candc --input token.tok --models /home/shankha/Desktop/ext/candc/models/boxer --candc-printer boxer --output prolog.cgg ./boxer --input prolog.cgg --semantics drs --output box_output.xml --resolve true --instantiate true --format xml

However, it is somewhat different from the format you sent me. I have all the requirements for the hybrid-simplification model except the boxer output. Could you please tell, if XML(box_output) file which I generated work for the hybrid model?

shankha117 commented 5 years ago
shashiongithub commented 5 years ago

As mentioned, I am using Boxer-1.00. The xml parser is written for that. If your xml is diifferent, then you may have to upgrade your parser to new xml files.

Crista23 commented 5 years ago

I am using the same commands posted above to retrieve the xml output from Boxer 1.00, however I am getting errors:

bin/candc --input test.txt --models models/boxer > test.ccg this file was generated by the following command(s): bin/candc --input /srv/disk01/ggarbace/TS/dataset/zhu_acl2010/test.txt --models models/boxer

1 parsed at B=0.075, K=20 1 coverage 100% 1 stats 10.8077 1479 2488 comb 2300 1438 0 0

bin/boxer --input test.ccg --semantics drs --output test.out --resolve true --instantiate true --format xml --elimeq true

ERROR: test.ccg:24: Syntax error: Undefined character escape in quoted atom or string: \N' ERROR: test.ccg:27: Syntax error: Undefined character escape in quoted atom or string:('

If I replace \N with \N and ( with ( and run the command again, what I get is:

ERROR: unable to preprocess derivation 1

Can you please let me know what I am doing wrong here? How can I get Boxer to work? Thank you!