Closed GraylinKim closed 11 years ago
I ported some changes between branches that should mostly take care of the newlines. Diffs now include a pass through clean_line_formatting which wraps around some regexen I suggested (replace all multiple spaces with a single space, and delete all space-padded numbers up to 6 digits long at the start of a line), a conversion of empty lines to <br>
, and some logic to strip page footers (that seems to miss them at the moment).
(I'm closing this for now since I think the problem is solved, but if the current version doesn't fix the problem, feel free to reopen the issue.)
Reopening because the example I gave when I opened the issue still isn't solved.
This specific issue has now been resolved but there are other weird new line issues around.
The extra new-lines in the beginning of S7033 are one such example. Likely there are others as well.
Hmm, it actually seems like the S7033 example went away at some point. I guess I'll close this.
Because bills follow a fixed character width rule when the text changes, all the new lines in the paragraph shift as well. This leads to a lot of choppy new line display in the diff that we need to figure out.
Example from S39-2011