ubermichael / isetools

Tools for parsing data for the Internet Shakespeare Editions
GNU General Public License v2.0
2 stars 3 forks source link

ellipsis #20

Closed telic closed 8 years ago

telic commented 9 years ago

I stumbled accross a ticket in Trac (#1051) that would be more appropriate here.

Unicode ellipses, three consecutive periods, and three periods separated by spaces should all be replaced by the texts toolchain with an element (<ellipsis>?) containing a single unicode ellipsis character so they can be rendered identically.

I'm not convinced that this is something that belongs in code rather than editorial guidelines though.

If this is done in code, I don't see any reason to create a new tag for them. Just replace all ellipsis-like stuff with a real unicode ellipsis (…).

ubermichael commented 8 years ago

Refactor the code to do this from RTFWriter to a transformer.