Open nathangibson opened 3 years ago
Okay @hannafriedel , I finally figured it out, there is a better way to add OpenRefine columns from another spreadsheet! (Of course.) It uses the cross
function.
cells["Affiliation (all)"]
to match the name of the column you want to import from the other sheet: cell.cross("persons text v0.6.0-dev 2021-07-26","ID").cells["Affiliation (all)"].value[0]
. How this works: The cross
function matches your current column (cell
) to the one from the project and column specified ("persons text v0.6.0-dev 2021-07-26","ID")
. Then cells["Affiliation (all)"]
gets the column you want as an object and value[0]
extracts the value from it.
If you have any trouble let me know. I can always upload my version if that helps.
I forgot to mention, some columns you might want to import for matching are
The Arabic names especially might help with ones you weren't able to match otherwise.
Hi Nathan, I tried to reconcile the arabic names, but it only resulted in a few more matches (I only reconciled the Arabic names, when I had not already matched the English name).
@hannafriedel great work!
I think the next major step will be matching against smaller, specialized databases that don't have OpenRefine APIs. We'll need to talk about how to do that.
I think that in both cases some can be assigned a singular match by doing intensive research including usaybia's writing. I tried to avoid this in hopes that you have a more elegant less time-consuming solution but can start doing it any time. I do not really know with which cells you could help so maybe wait until I have shrunk down the number of affected cells.
1. a) I ran the reconciliation with every cell so the ones which do not have any matches at all, could not be matched by the reconciliation. b) For the ones which have multiple matches I did a basic lookover, if I could find the correct match by death date/occupation/relation/etc. and it did not work due to insufficient information.
I think that in both cases some can be assigned a singular match by doing intensive research including usaybia's writing. I tried to avoid this in hopes that you have a more elegant less time-consuming solution but can start doing it any time. I do not really know with which cells you could help so maybe wait until I have shrunk down the number of affected cells.
OK, this makes sense. It may be that as we reconcile against specialized databases we will find some of the info (or links) we need for the unmatched persons. I'll post info here about how to reconcile with other databases. But tagging ch. 8 #162 is also a priority, so you can work on that for now and come back to this when you are bored :-)
1. I included these colums and also work location and "educated at". The VIAF IDs I immediately converted into VIAF matches so they are not visible anymore.
Perfect! 👍
Next step:
Please work on the unmatched persons to try to find matches. You can try
You could work on these as 2 or more subsets such as
If you find useful information but not enough to decide you can add it to a notes column.
Note to self: We may be able to use https://github.com/cmharlow/isni-reconcile to get ISNIs (where not available from VIAF/WikiData).
@hannafriedel Just to document what we already discussed:
Use the branch content/uri-matching
In OpenRefine:
You can find the reconciliation service for VIAF at http://refine.codefork.com/. (Please duplicate the LHOM Name column for each additional reconciliation service.)
OpenRefine documentation: https://openrefine.org/documentation.html (you can also check out youtube videos)