metanorma / pubid-bsi

BSI Publication Identifiers
BSD 2-Clause "Simplified" License
1 stars 0 forks source link

Parse "Expert Commentary" identifiers #11

Closed ronaldtse closed 1 year ago

ronaldtse commented 1 year ago

BSI publishes "Expert Commentary" documents for certain standards, they use this pattern:

{base identifier} ExComm

e.g.

BS 7273-4:2015+A1:2021 ExComm
BS EN ISO 13485:2016+A11:2021 ExComm
BS EN 55011:2016+A2:2021 ExComm
BS EN 61000-3-3:2013+A2:2021 ExComm
BS 5250:2021 ExComm
BS EN ISO 2692:2021 ExComm
BS EN ISO 22301:2019 ExComm
BS EN IEC 61439-2:2021 ExComm
BS 60080:2020 ExComm
BS EN IEC 62115:2020+A11:2020 ExComm
BS 1722-2:2020 ExComm
BS EN IEC 60947-1:2021 ExComm
BS EN IEC 61000-6-3:2021 ExComm
BS EN ISO/IEC 80079‑34:2020 ExComm
BS 10008-1:2020 ExComm
BS EN IEC 62115:2020+A11:2020 ExComm
mico commented 1 year ago

BS EN ISO/IEC 80079‑34:2020 ExComm

All examples in the list except this one using hyphen-minus "-" (the standard minus symbol), but this one use "UNICODE HYPHEN":

U+2010 ‐ UNICODE HYPHEN (HTML ‐ or ‐)[[c]](https://en.wikipedia.org/wiki/Hyphen#cite_note-37)
U+2011 ‑ NON-BREAKING HYPHEN

We never had it for other identifiers, except "JIS" identifiers. Should I add "UNICODE HYPHEN" for ISO/IEC identifiers?

ronaldtse commented 1 year ago

For parsing, let's also accept "UNICODE HYPHEN". Thanks!