NHMDenmark / Mass-Digitizer

Common repo for the DaSSCo team
Apache License 2.0
1 stars 0 forks source link

Avoid bad input in Storage and taxonomic name fields. #385

Open jlegind opened 11 months ago

jlegind commented 11 months ago

I believe it would benefit us in the long run to restrict the types of characters that can go into the input text fields.

If storage location was restricted to a alpha-numeric (with certain symbolic characters included) list, then we would end up with less messy data.

jlegind commented 11 months ago

This would be non-trivial since we would have to accommodate almost all Unicode characters except control chars and most symbols. The standard Unicode table + Latin-1 Supplement should fit well initially since it has all Scandinavian and European diacritic characters.

PipBrewer commented 10 months ago

@jlegind As both taxon and storage fields use lists drawn from Specify, it is hard to understand how we are ending up with messy data. Please can you provide specific examples of these?