digital-preservation / csv-schema

CSV Schema
http://digital-preservation.github.io/csv-schema
Mozilla Public License 2.0
98 stars 33 forks source link

Regex ignored? #24

Closed mhGLEIF closed 5 years ago

mhGLEIF commented 5 years ago

TEST SCHEMA:

version 1.1
@quoted
@totalColumns 2
@permitEmpty
@ignoreColumnNameCase
diacritics-allowed: length(*, 500) and regex("\S+( \S+)*")
ascii-only:  length(*, 500) // and regex("(!|"|%|&|'|\(|\)|\*|\+|,|-|\.|/|0|1|2|3|4|5|6|7|8|9|:|;|<|=|>|\?|A|B|C|D|E|F|G|H|I|J|K|L| |M|N|O|P|Q|R|S|T|U|V|W|X|Y|Z|_|a|b|c|d|e|f|g|h|i|j|k|l|m|n|o|p|q|r|s|t|u|v|w|x|y|z)+")

TEST CSV:

diacritics-allowed,ascii-only
èûîôà,èûîôà

RESULT: PASS

NOTE:

This is based on regex that's currently in use in XML schemas, where it appears to work OK. The intention is to filter out strings with diacritics (transliterated versions only)

mhGLEIF commented 5 years ago

oops. commented out the regex. my bad.