Open KevinDanikowski opened 1 year ago
This result is expected.
first "
is missing and sentence-splitter can not parse it correctly.
natural language does not have parse error. This makes it difficult to correct implicit errors.
It is possible to issue a warning if one of the pairs is missing, but the use case is difficult.
Hey @azu , do you know of a potentially recommended fill solution to add the quote in this scenario? Otherwise, I'm thinking to just remove quotes if it appears the text was not split properly, and retry.
Describe the bug The sentence with obvious splits doesn't get split, it appears to be due to a missing first double quote at the beginning.
Text
Actual Result
Code:
Expected Result (too long to share) - split into sentences.
Additional context It's missing a double. quote in the beginning, but this shouldn't stop the sentences from being split.