cnabio / cnab-spec

Cloud Native Application Bundle Specification
https://cnab.io
Other
957 stars 99 forks source link

Clarify character set constraints on name (Add a link to the Unicode spec) #273

Closed ndeloof closed 4 years ago

ndeloof commented 5 years ago

spec says

name MUST contain only characters from the Unicode graphic characters The Unicode graphic character set allowed for name includes letters, numbers, punctuation, symbols, and spaces; it does not include newlines or tabs.

I'm not sure this definition is clear enough on supported characters. Typically, what does symbols mean ? Can I use Klingon characters? emojis? Unicode Character 'BACKSPACE' (U+0008) etc.

Maybe clarify using explicit Unicode General categories Letter (L), Number (N) ... and explictely exclude Controls & al (C)

technosophos commented 5 years ago

"Unicode Graphics Characters" is a defined set of Unicode characters.

In Unicode, Graphic characters are those with General Category Letter, Mark, Number, Punctuation, Symbol or Zs=space. Other code points (General categories Control, Zl=line separator, Zp=paragraph separator) are Format, Control, Private Use, Surrogate, Noncharacter or Reserved (unassigned)

https://www.unicode.org/versions/Unicode5.2.0/ch02.pdf#G25564

technosophos commented 5 years ago

Oops... the link was to the spec, and the quote was from the wikipedia entry on graphic characters.

https://en.wikipedia.org/wiki/Graphic_character

ndeloof commented 5 years ago

Oh ok. I googled for it but didn't found anything obvious. Then a link would avoid stupids like me to make such a mistake ;)

technosophos commented 5 years ago

Thanks! I updated the title and will do a PR next week.

technosophos commented 4 years ago

The PR was merged a long time ago, so I am closing this issue.