Open raivisdejus opened 1 year ago
Adding another replacement option, that can process regexs. This can be used to split longer sentences into smaller chunks.
In my tests for Latvian, this can yield ~25% more sentences in the final output.
Thanks for submitting this! I will have a closer look at this PR tomorrow.
Adding another replacement option, that can process regexs. This can be used to split longer sentences into smaller chunks.
In my tests for Latvian, this can yield ~25% more sentences in the final output.