Riksarkivet / dataplattform

Dokumentation och exempel för Riksarkivets API- och datatjänster
8 stars 0 forks source link

Wikidata:Property proposal/National Archives of Sweden persistent identifier #13

Closed salgo60 closed 1 year ago

salgo60 commented 1 year ago

Some questions need to be answered in WD land link 1) Do we need both this Wikidata:Property_proposal/National_Archives_of_Sweden_persistent_identifier and Property:P9713 "Swedish National Archive agent ID" fewer I guess is better 2) Is there an API or a list of the NAD available 3) Are Riksarkivets NAD mapped to Wikidata? 4) Do you have classified them so we understand if they are Notable for WD

image
nilsw-ra commented 1 year ago

This is a bit tricky as both archival objects, agents and concepts (e.g. topographic entities) have the same kind of persistent identifier. This id is a Base62-encoded UUID. So, P9713 is rather superfluous, I would prefer there to be one property for all these PIDs. BUT, things have moved since I wrote the proposal and I have realized that it would be better to use the RDF URIs for these objects (all three kinds). We don't serve RDF yet, but it is in the works now. For further complication, I thought we had landed in a URI pattern, but turns out there is one more issue to consider. The national archives of Finland are planning ti use URNs for PIDs, so question is if we should too. If we decide to do that, these URNs should also be used for RDF URIs rather than the http URIs I have considered so far:

http://data.riksarkivet//

salgo60 commented 1 year ago

Thanks keep us updated I guess most things are moving objects when we are trying to connect things....

Would be excellent if we could use Wikidata as a start as the project Welfare state analýtics are doing (they use students as Emil and me to fix the data quality

My understanding is that they in the future will be the trusted source to Wikidata but as a step 1 WD is better than nothing....

salgo60 commented 1 year ago

@nilsw-ra FYI changing the formatter URL in WD for a property is done in 2 secs P1630 and is DRY just changed in one place so dont that hesitate you

  • my guess is that we will find much more difficult things when open the Pandoras jar of the National Archives 🚀

(image

nilsw-ra commented 1 year ago

I see that the tentative URI scheme didn't com out as expected...

http://data.riksarkivet.se/{object type}/{pid}

e.g.

http://data.riksarkivet.se/agent/HX8UA8hDrH646m3GjpvwY3 (person authority record, Axel Munthe) http://data.riksarkivet.se/archive/Tp1bCTomaKMVT0c9t1XLA0 (archive record, Axel Munthe's archive)

salgo60 commented 1 year ago

Now I dont follow you did you change something in WD. Dont hesitate to call me 0735152802

Question is if its added value to have type in the schema if you compare Wikidata we dont...

I guess persons are stable but archives are split and merged and maybe should have a more advanced model as part of, and I guess parts of an archive can also be physical moved to other parts....

nilsw-ra commented 1 year ago

I have not done anything with the WikiData property proposal yet. I need to think this over in order to get something long term viable. I should probably put it on hold actually. All three kinds, archive data, person/organization authorities and concepts (including topographic entities) have persistent ids. (There is an unfortunate exception, but that is outside the scope of this). There is has part / is part of relation for archives and their subordinate elements which will be included in the RDF representation. E.g. Fonds (archive) -> Series -> Volume.

salgo60 commented 1 year ago

ok

  • "unfortunate exception" we use redirects on mass in Wikidata see objects with NAD P5324 that has merged the last 1000 days / SBL P3217

    • people add new objects that are already there/ they add them in another language that some one later understand is the same object already in WD
    • when I check Riksarkivets August Strindbergs I feel you have a lot of duplicates that should be merged... maybe that is a bigger problem than the url schema...
    • SE/GUB/2432 Strindberg, August (1849 – 1912)
    • SE/GUB/REA000053729 STRINDBERG, AUGUST (1849 – 1912)
    • SE/UUB/REA000137171 STRINDBERG, JOHAN AUGUST (AUGUST) (1849 – 1912)
    • SE/LUB/722 Strindberg, August (1849 – 1912)
    • it looks like most of the data is 4 star data with no external referencies (below dbpedia example) --> to make the data useful it should be 5 star data
    • some feedback Riksarkivet SBL dataset that is strings and not things
    image
  • would be cool if you archives started using Wikibase and delivered more trusted data than Wikidata....


interesting that SE/LUB/722 had same as dbpedia feels like Riksarkivet had some linked data ambitions that died...

image
salgo60 commented 1 year ago

OT on the friday session with Riksarkivet SBL I asked them at 47 min about if they were part of designing the future of the National Archives archive and they said no one asks them - maybe use them as user cases (see my thoughts of a structured requirement process #25)

nilsw-ra commented 1 year ago

I will get back on the issue on wikidata.org once we have decided what's best.

salgo60 commented 11 months ago

Any status?

DavidHaskiya commented 11 months ago

I'm afraid @nilsw-ra is no longer working at the Swedish National Archives. I will need to read-up myself and check with a colleague and get back to you all. Won't be until next week at the earliest since that colleague is on holiday this week.

salgo60 commented 11 months ago

Sad I feel you moved in the right direction... I guess if he has left nothing will happen when he now has Kultur IT as "sponsor"....