InTaVia / idm-rdf

Intavia Data Model for RDF data
1 stars 2 forks source link

can/should we add the biographical text in the IKG #19

Open sennierer opened 2 years ago

sennierer commented 1 year ago

@biktorrr @yoge1 and @joahim could you please do some research if it is (license wise) allowed to also publish the texts of the biographies? If not we still need to decide whether we want to include the text to allow for full text searches across the biographies (or run some NLP pipelines on it)

sennierer commented 1 year ago

update in meeting 2022-12-16: looks like finnish biographies can not be added as fulltext, the others can. Is that correct @yoge1 ?

any ideas on how to add the fulltext to the model? @biktorrr @yoge1 @CarlaVS ?

related to InTaVia/InTaVia-Backend#77

yoge1 commented 1 year ago

I just read the agreement we have done with the Finnish biograhy data provider, and as far as I understand we are allowed to republish the biographies as fulltext for deceased persons in Intavia (the agreement allows us to publish the data for deceased persons in our university's data service, however it doesn't state a license, but in our data service we have stated CC BY 4.0, see: https://www.ldf.fi/dataset/nbf which allows anyone to republish the data with attribution).

sennierer commented 1 year ago

Great, so we can also publish the Finnish data. Are there any still living people that we need to filter out in your dataset?

yoge1 commented 1 year ago

No, there are no still living people in our dataset.

yoge1 commented 1 year ago

Having now had further discussions with the Finnish biograhy data provider, it seems clear that we can actually publish only the lead paragraphs of the biography fulltexts (of deceased persons), the other paragraphs cannot be published openly not even in our university's data service. This is due to copyright issues related to the individual biography authors.