Source regulation RegML seems to contain section titles with inconsistent formatting, for example:
1003.2 Something
§ 1003.2 Something
Something
§ Something
In all of these cases, the section sublabel should be properly extracted as "Something". This change makes the sublabel extraction logic somewhat more robust and adds a few unit tests to verify functionality.
A real example of this kind of thing can be found in this Reg E RegML file.
Source regulation RegML seems to contain section titles with inconsistent formatting, for example:
In all of these cases, the section sublabel should be properly extracted as "Something". This change makes the sublabel extraction logic somewhat more robust and adds a few unit tests to verify functionality.
A real example of this kind of thing can be found in this Reg E RegML file.