NHMDenmark / Mass-Digitizer

Common repo for the DaSSCo team
Apache License 2.0
1 stars 0 forks source link

Review and refinement of OpenRefine GREL script #506

Closed AstridBVW closed 3 weeks ago

AstridBVW commented 1 month ago

The OpenRefine post-processing GREL script needed to be reviewed and refined.

The following changes have been made:

  1. For the column Publish, I have changed the value from “yes” to “True”
  2. I have corrected the remark source and remark date columns so they only contain a value (“DaSSCo digitisation” and “catalogeddate”) if remarks contain a value (ticket #461 and ticket #492 )
  3. I have added a column for “storedunder” with value “True” for all records (ticket #490 )
  4. I have added columns for “datafile_remark”, “datafile_source”, “datafile_date” with the following values for all records:
    1. datafile_remark: “insert filename”
    2. datafile_source: “DaSSCo data file”
    3. datafile_date: the value from catalogeddate
  5. I have added a step, you have to change the value in column “datafile_remark” from “insert filename” to the filename of the processed file (suffix “processed” and extension tsv)

The 4th and 5th change replace the monitoring script that Jan made for the time being (ticket #461 ).

AstridBVW commented 3 weeks ago

The new version of the GREL post-processing script is now available on Github.