diging / tethne

Python module for bibliographic network analysis.
http://diging.github.io/tethne/
GNU General Public License v3.0
81 stars 32 forks source link

the way wos reader module analyzes the names of the authors of the references #89

Closed yzou20 closed 8 years ago

yzou20 commented 9 years ago

For authors whose last name has two characters, the wos reader module breaks the last name into two parts and treats them as first name and last name. For example, for author "van Kampenng NG", the tethne module reads the last name as "van," and the first name as "Kampenng," and ignores the real firstname "NG."

nischalsamji commented 9 years ago

Hi Yawen, So Is this the format that you are expecting?

First Name: NG Last Name: Kampenng

erickpeirson commented 9 years ago

This is hard. One thing that you might consider is using a whitelist of affixes in the name parsing logic. Eg see Wikipedia: https://en.m.wikipedia.org/wiki/List_of_family_name_affixes


Erick Peirson Postdoctoral Scholar ASU-SFI Center for Biosocial Complexity Arizona State University

On Sep 18, 2015, at 12:09 PM, Nischal Samji notifications@github.com wrote:

Hi Yawen, So Is this the format that you are expecting?

First Name: NG Last Name: Kampenng

— Reply to this email directly or view it on GitHub.

yzou20 commented 9 years ago

Thanks for the info. I will think about that!

On Fri, Sep 18, 2015 at 12:26 PM, Erick notifications@github.com wrote:

This is hard. One thing that you might consider is using a whitelist of affixes in the name parsing logic. Eg see Wikipedia: https://en.m.wikipedia.org/wiki/List_of_family_name_affixes


Erick Peirson Postdoctoral Scholar ASU-SFI Center for Biosocial Complexity Arizona State University

On Sep 18, 2015, at 12:09 PM, Nischal Samji notifications@github.com wrote:

Hi Yawen, So Is this the format that you are expecting?

First Name: NG Last Name: Kampenng

— Reply to this email directly or view it on GitHub.

— Reply to this email directly or view it on GitHub https://github.com/diging/tethne/issues/89#issuecomment-141544647.