Open danielweck opened 1 month ago
A good test for large sections of text (which would normally result in far-too-long speech utterances, and therefore benefit from sentence detection) is Georgia: https://idpf.github.io/epub3-samples/30/samples.html#georgia
https://www.npmjs.com/package/sentence-splitter
https://github.com/textlint-rule/sentence-splitter/issues/28#issuecomment-2110632032
Edge cases to test: poetry, quotation marks and punctuation that make it hard to determine boundaries. Example: Alice in Wonderland (there are several editions, i think this one is useful for testing https://www.gutenberg.org/ebooks/28885 )