SpeciesFileGroup / taxonworks

Workbench for biodiversity informatics.
http://taxonworks.org
MIT License
87 stars 27 forks source link

Allow to specify format details when uploading TXT files in DwC importer #3894

Closed LocoDelAssembly closed 8 months ago

LocoDelAssembly commented 8 months ago

Due to problems with multi-line data, we have to change our TSV output format to have text with quotes (not for every text, only text that would otherwise break the dataset if not escaped/quoted correctly).

Required changes

Frontend

When tsv, txt or csv (new) is uploaded, before sending to backend the user should be asked about field delimiter and string delimiter. For field delimiter options are: Tab, Comma, Semicolon, Space and Other. When Other is selected a text box to supply a character must be available for the user to type it in. Only ONE option can be selected. For string delimiter options are ' and " and None (perhaps in italics or something else to differentiate from actual values).

Defaults for each format are these:

Format Field delimiter String delimiter
csv Comma "
tsv Tab "
txt Tab "

Backend

LocoDelAssembly commented 8 months ago

Missing frontend functionality:

LibreOffice dialog: image