Improve splitting behaviour by only splitting on newlines, carriage returns, Unicode spaces, and Unicode "other" punctuation followed by spaces
If the selection contains any open/close bracket or initial/final quote punctuation, assume it is too complicated to sensibly split (for now) and skip the splitting phase
Use full Unicode category names in the regexps for maintainability
Fix #153