Other attempt to solve issue #23:
instead of using a black-box library I just accept defined simple cases:
one of the two full names is a subset of the other (eg. 'Timothy Head' subset 'Timothy Daniel Head')
ignore difference in accents, lower cases
All the spelling mistakes or different spelling for Russian names are still a problem but I cannot solve it without allowing wrong matches to be made (eg. Atlas' member Liqing Zhang being confused with LHCb's member Liming Zhang).
I guess that the simpler solution is or to change the database or to "hard code" known differences
Other attempt to solve issue #23: instead of using a black-box library I just accept defined simple cases:
All the spelling mistakes or different spelling for Russian names are still a problem but I cannot solve it without allowing wrong matches to be made (eg. Atlas' member Liqing Zhang being confused with LHCb's member Liming Zhang).
I guess that the simpler solution is or to change the database or to "hard code" known differences