Open nickynicolson opened 2 years ago
Thanks @nickynicolson for the suggestion. In most cases, this would work but then there are cases like this:
Input: A. & B. Smith
Output:
[{"family":"Smith","given":"A.","suffix":null,"particle":null,"dropping_particle":null,"nick":null,"appellation":null,"title":null},{"family":"Smith","given":"B.","suffix":null,"particle":null,"dropping_particle":null,"nick":null,"appellation":null,"title":null}]
So, the original for the first item in the array is merely A.
I suppose this is acceptable, but I'll have to sort-out how to pass along the original bits to the output. Some of the bits are actually parsed whereas others flow through some regex routines.
Input:
Friis I., Getachew A., Rasmussen F. & Vollesen K.
Current output:
Proposed: Add an extra property "original" to each agent entry in the list:
It would then be possible to surmise that this string consists of "agent, agent, agent & agent". Coupled with metadata hints about the source of the recordedBy data, this could help direct the parse strategy for more difficult examples.