Closed jar398 closed 2 years ago
This behavior is specified in the tests, but if it's not intentional then this change in grammar.peg "fixes" it:
- CapAuthorWord <- AuthorUpperChar AuthorLowerChar*
+ CapAuthorWord <- AuthorUpperChar (AuthorLowerChar / AuthorUpperChar)*
The only tests it breaks are the ones showing the behavior of splitting e.g. McDunnough
into two author words, but I guess it might have unintended consequences that are not currently tested for -- but figuring that out is some way beyond my talents...
@jar398 this is not intentional, another good catch!
I was hoping to reconstruct a canonicalized scientificName from CanonicalFull + Authorship, but it looks like gnparser inserts a space after 'Mc' in names starting with 'Mc', which doesn't give a good result. I don't know whether this gnparse behavior is intentional or not so perhaps this is not a bug.