riggsd / guano-spec

The GUANO bat acoustic metadata format specification
http://guano-md.org
8 stars 6 forks source link

Specify That No UTF-8 BOM Is Allowed #4

Open riggsd opened 7 years ago

riggsd commented 7 years ago

Byte Order Mark (BOM) is technically allowed in UTF-8 encoding, but neither required nor recommended. Specify that the BOM is illegal in GUANO UTF-8 so that reading implementations don't have to care.

See: https://en.wikipedia.org/wiki/Byte_order_mark#UTF-8

riggsd commented 7 years ago

Or, it sounds like BOM may be required for a UTF-8 plaintext file on Windows... perhaps the spec should simply warn readers that it may be present?