ubermichael / isetools

Tools for parsing data for the Internet Shakespeare Editions
GNU General Public License v2.0
2 stars 3 forks source link

disallow RULE on a line with content #11

Closed telic closed 8 years ago

telic commented 9 years ago

RULE should only be allowed on a line with no non-whitespace text content. (ie. it is not an "inline" tag)

ubermichael commented 8 years ago

This is not valid and should not happen:

./input/withTitlePage/Pur/Pur_F3.txt:{{s}h}all be {s}oon dried up, and a better world then ever<RULE n="4"/>

ubermichael commented 8 years ago

I've assumed that this is OK:

<!-- comment --> <RULE n="3"/> <!-- another comment -->

Which is to say that the comments and whitespace are acceptable.

telic commented 8 years ago

Yep. Any other tagging is fine, as long as there's no text content. E.g.

<L n="1"/> <RULE/> <!-- comment -->  </SCENE>

should be okay too.