Closed brokkr closed 3 years ago
One way to skirt the issue would be to see if it would be sufficient to create a hash of the entire sub. If the only purpose is to catch changes, that should be sufficient.
Of course, if unicode characters in element names are banned, it should be caught at the XML parsing stage. But I don't think they are, cf. sanitization efforts.
On the other hand: Are there any good reasons to allow them?
We can't match against a list because vorbis comment tags can be anything (in ascii) - so how would we do this? Loop over a list, try to encode('ascii')? Seems primitive...
Maybe do some testing, see if it's an LXML or a pickle issue. Maybe the wrong character set is assumed? If it is an upstream bug, we should report it.
Note that if #139 is implemented, this becomes irrelevant.
Neither Pickle nor LXML is used in v2. Closing.
How to replicate: