usfm-bible / tcdocs

Technical Committee Documents
Other
9 stars 9 forks source link

Regex in diagrams and Glossary #46

Closed KentSpiel closed 6 months ago

KentSpiel commented 1 year ago

We have agreed that we want to minimize the amount of regex in diagrams by using Terms as defined in the Glossary of Terms

[\t\n\v\f\r\p{Zs}] 

instead of

[\u0009-\u000D\u0020\u00A0\u1680\u2000-\u200B\u2028\u2029\u202F\u205F\u3000]

Note: \u200B is a Zero Width Space and is a formatting character not a white space character so I exclude it from the simplified version.

@mhosken says there is a reason we cannot make chapter and verse numbers a Term. However, I would still like to find a way to simplify the definition of the constraints on C:V numbers so we do not need regex in the diagrams