Closed jasdeepgosal closed 9 years ago
The parser takes lower- and upper case into account; this mode would detect something like 'van Beethoven'. Is it legitimate, in your case, that both names are lower case?
It is legitimate that both names are lower case.
Hmm, I'll make a note of this, but off the top of my head, I have no good idea how to support all lower case names without having to resort to pattern matching for particles. Do you have any suggestions?
If you know for a fact that there are no particles in your data-set you could patch the parser to turn particle into given after the parsing, perhaps?
I ran across this name (though this is an anonymized version, obviously) that isn't parsing correctly:
I would've expected: