Closed wginsberg closed 4 months ago
hey Will, ya, you're right. we have a list of hard-coded abbreviations, and no way to augment it on the fly right now, and that's a lousy solution. Happy to take a look at this. A proper solution may require a breaking change. A quicker one may just be adding 'ste' to the list. cheers
hey Will, I've added ste
and a known abbreviation, and am putting augmenting this list as a table-stakes feature for v11.
you should see the change in 10.3.2
.
cheers
First - thanks for this project!
I noticed that there is an issue parsing the sentences on the Wikivoyage page for Sault Sainte Marie (Ontario)
Using the following code:
I am getting the result
Obviously looks like the period is being interpreted as the end of the sentence, but i do not have the same problem with other pages. E.g. St. Clairsville (Ohio) comes back as expected:
This is the only instance of the issue I have found so far, so I guess it is not a huge deal. Though I wonder if it would be a nice feature to have a set of strings to pass to
sentences()
that would prevent breaking?I.e. calling
Would guarantee that no sentence would break across that string? Just a thought.