kbrbe / beltrans-data-integration

Creating a FAIR Linked Data corpus for the BELTRANS research project about Belgian book translations NL-FR and FR-NL between 1970 and 2020
https://www.kbr.be/en/projects/beltrans/
MIT License
5 stars 0 forks source link
belgium csv data-integration dutch-language europe french-language history knowledge-graph linked-data marc21 rdf rml sparql translation-studies

BELTRANS data integration

This repository contains code for the data integration of the BELTRANS project which studies Intra-Belgian translation flows between French and Dutch in the period 1970-2020. Data from different heterogenous data sources is integrated to create a FAIR corpus, this includes XML files from different sources but also existing large RDF dumps.

Preprocessing scripts for the different data sources are stored in the respective data-source folder. This mainly includes Python scripts but also RML mapping documents. The data integration is currently controlled by a bash script in the data-integration folder.

BELTRANS data integration overview