Currently we have a sheet with translations (one row is one edition) and per translation the identifier of the cluster. For data cleaning and filling of gaps per cluster it is often more handy to only show the most relevant data for the project, which is the first edition.
We should add a new sheet with information per cluster (with linked information about the first edition)
[x] initial working SPARQL query to fetch this information (if necessary with possible postprocessing in Python)
[x] adapt pipeline to add the new CSV as a sheet to the corpus Excel
Currently we have a sheet with translations (one row is one edition) and per translation the identifier of the cluster. For data cleaning and filling of gaps per cluster it is often more handy to only show the most relevant data for the project, which is the first edition.
We should add a new sheet with information per cluster (with linked information about the first edition)