Open kastein opened 2 years ago
@TheresaSchmidt, I've just had a meeting with @kastein about this and we figured out that the problem could be solved by adding the function that you wrote for splitting the input data for the tagger (in case there are more recipes in a single input file) by adding white lines also for the parser. Could you also document the arguments that can be used (e.g., --single-sentences)?
This isn't as urgent as other matters (since we usually parse single recipes and not different ones at a time), but we may forget about it in the future. Thank you in advance!
When converting the json output files of the tagger or parser to conllu using read_prediction.py, then different recipes are not separated by an empty line and the IDs in the first column do not restart at 1 when a new recipe starts. When converting the output of the tagger the format looked correct when I added the argument --single-sentences in addition to the arguments listed in the main Readme but this option did not work for converting the parser output.