This would allow the use of clean: :loose to only remove non-printable characters from the middle of the cell, in addition to whitespace at either end of the cell.
There are a few changes that are somewhat surprising (moving where sanitization happens), as the tests that only verified a lack of "blowing up" were not getting useful, testable values.
This was motivated by the fact that I am parsing a formatted Excel spreadsheet where we need to preserve the structure, but we have received unprintable characters in the past. clean: true is too aggressive; for most people, clean: :loose is likely to be more useful.
This would allow the use of
clean: :loose
to only remove non-printable characters from the middle of the cell, in addition to whitespace at either end of the cell.There are a few changes that are somewhat surprising (moving where sanitization happens), as the tests that only verified a lack of "blowing up" were not getting useful, testable values.
This was motivated by the fact that I am parsing a formatted Excel spreadsheet where we need to preserve the structure, but we have received unprintable characters in the past.
clean: true
is too aggressive; for most people,clean: :loose
is likely to be more useful.