Closed dakinggg closed 4 years ago
Thanks..looks good so far.
Yes, something broke for that the Alice in Wonderland test. It seems there is issues with a newline character offsets for 2nd index in expected list
I haven't looked at the cleaner code, but it seems like something about how ❦ is used is wrong when it follows another punctuation. I think something also might still be broken for char_span=True when there are double new lines. but I'll leave it to you whether those things need fixing before/after merging this pr
@danielkingai2 Thanks for your contribution😃
Opted for another approach. You can have a look at PR #63
Had a go at fixing the offsets, take a look and see what you think. I built on your WIP branch so this has those commits as well. All the tests pass (although I am wondering why you xfailed the Alice in Wonderland test in your second commit? Is something broken there?)