ubsicap / usfm

Unified Standard Format Markers
37 stars 18 forks source link

USFM test documents #119

Open grahame opened 3 years ago

grahame commented 3 years ago

I'm working on an implementation of USFM for Python. My implementation reads in the Paratext stylesheets so hopefully it will be easy to update it as USFM is developed and new markers are added.

I've found a few areas of ambiguity, mostly in trying to understand how paragraph markers interact with character environments – does a new paragraph marker implicitly close all character environments?

Beyond that specific case, I'm wondering if there are any test documents which exercise implementations? I'm thinking things such as the interaction of the different marker types, nested character markers, the interaction of character markers and footnotes, and whitespace handling.

Thanks for any help.

mhosken commented 2 years ago

BTW ptxprint has a python USFM parser driven by stylesheets, in case you are interested. Thanks to Tim Eves for writing it.