Closed lcswillems closed 5 years ago
Hmm... what is the motivation for not stripping the whitespace?
Unfortunately the root cause of this problem is in the CoreNLPServer. Text sent to the server is trimmed out of abundance of caution for any whitespace added through the URL encoding/decoding process. It should be ok to simply not do this and the easiest way to do so would be through a local checkout of CoreNLP.
Because there is no reason why CoreNLP would strip it... If I want to strip it, I do it. I may want to give it some text that is not stripped.
As @arunchaganty notes, the trimming was happening in the Java CoreNLPServer. But it seems to me that there is no good reason why we were doing this, and it is in principle wrong as @lcswillems notes. So, I've removed it. 🙂So, whenever we next make a CoreNLP release (probably end of (northern) summer) or if you cherrypick CoreNLP commit 082ed17b04bdb7ed1ae613916d713942f7c24dfb then this should be fixed.
That's great!! Thanks a lot! :)
Hi,
If I do:
I get:
The text is
hi
instead ofhi
. How is it possible to make CoreNLP stop stripping the input text?