Closed pasmargo closed 6 years ago
Hi! As C&C parser does not support N best parsing, I do not know how the output xml should look like when using N-best output and C&C xml format. Do you have any suggestion?
Sorry to interrupt.
@pasmargo I suspect you are now incorporating depccg into ccg2lambda. Sorry for my long silence after saying I will incorporate various CCG parsers into Jigg. Actually, the latest Jigg already supports them, but without k-best mode, which I've not tried on depccg yet. It also doesn't support k-best mode for easyccg for some technical reasons (I think it is possible, though).
Maybe the output format would be changed after the discussion on this and other threads. If the format is decided I want to soon implement k-best mode on Jigg for depccg, as well as easyccg.
Do you have any suggestion?
I don't know of a good solution unfortunately. One possibility is to annotate the <ccg>
node with the sentence ID. E.g. <ccg ID=1>
where the ID would be the equivalent to EasyCCG' ID.
@hiroshinoji Thank you very much for your consideration! I think Jigg is a great solution for the integration of several CCG parsers and I am looking forward to seeing it implemented with n-best support. At the moment I am only drafting some temporary solutions until Jigg is ready. This is not so urgent for me at the moment so please do not work too hard on it! Thank you!!
OK, I adopt <ccg sentence=1 id=1>
,
where the first one is sentence id and the latter is the number of N best parses.
I am willing to change it if there is some standard format
but I close this as this does not cause the assertion error anymore.
When I request N-best parsing and XML output, I get an assertion error:
For 1-best parsing:
For 2-best parsing: