Open mufeili opened 3 years ago
+1, please help
+1, please help
I find the parser input is like this project https://github.com/jiyfeng/DPLP (after "python segmenter ./data"),need to add one column of paragraph id
@mufeili and @chenzhutian Thank you for your interest. I'm the first author of this paper.
@wangwang110 thank you for reply the question. As you said we need a paragraph index to parse a raw document.
@mufeili and @chenzhutian If you can obtain a paragraph index and attach it to the DPLP format, you can parse a raw document. This input format is the same as Two-Stage Parser.
@mufeili and @chenzhutian Thank you for your interest. I'm the first author of this paper.
@wangwang110 thank you for reply the question. As you said we need a paragraph index to parse a raw document.
@mufeili and @chenzhutian If you can obtain a paragraph index and attach it to the DPLP format, you can parse a raw document. This input format is the same as Two-Stage Parser.
Thank you for your reply and the great work!
Hello, I was wondering is there a limit on the length of the raw document?
I guess I can get the EDUs of a raw document using a trained model from rstfinder. Are we supposed to use
python src/main.py parse
for parsing a new raw document? If so, can you provide an example? What kind of input data file should I prepare?