kbrbe / beltrans-data-integration

Creating a FAIR Linked Data corpus for the BELTRANS research project about Belgian book translations NL-FR and FR-NL between 1970 and 2020
https://www.kbr.be/en/projects/beltrans/
MIT License
5 stars 0 forks source link

Processing of identified publishers is broken #249

Closed SvenLieber closed 5 months ago

SvenLieber commented 8 months ago

Data about publishers that we identify in KBR MARC fields 264 are not properly processed. We have a bash script that extracts lines of a contribution file in which the second column (supposedly the KBR identifier) is longer than 8 characters, KBR identifiers have length 8, but self created identifiers are a hash sum which is longer.

However, due to a change that we introduced a while ago, the column with the identifier is no longer in the second column, but in the third. The second column is now the type of contributor, i.e. person or org.

Hence the script does not produce any output.