Open JohnSmithDev opened 5 years ago
OK, that particular script isn't going through the same function path as the reports that use gender_analysis.py
I've done some refactoring that will allow it to use the code from gender_analysis.py (now moved to title_related.py) that should address this issue (and maybe a couple of the preceding ones.
I am slightly bothered that the gender determination code this code was calling isn't doing the depseudonymization, which I thought it was. Hopefully I can get to a point where there is a single clear entry point to the gender determination, that covers all the bases.
Reopening because the commit that claims to fix this had the wrong number in the commit message :-(
As per prior comment, the original reported issue has been fixed, but I'm not 100% certain this is solved everywhere it needs to me, hence keeping it open for now.
Output from a new (not yet committed script):
./bulk_title_authors.py 2019 ... WARNING:root:No Twitter link(s) for http://www.isfdb.org/cgi-bin/title.cgi?2515634 http://www.isfdb.org/cgi-bin/ea.cgi?133814
This should be coming from Wikipedia.
Especially galling seeing this is an author_id I use in my tests :-(