Closed lfoppiano closed 8 years ago
Hi! Can you try the following:
data = np.load("data/author-disambiguation.npz", encoding="latin1")
@lfoppiano I recommend trying out the example here: https://github.com/inspirehep/beard/tree/master/examples/applications/author-disambiguation
@glouppe I've tried also that and I've got another error:
Traceback (most recent call last): File "/Users/lfoppiano/development/inria/disambiguation/paper-author-disambiguation/beard/examples/author_disambiguation.py", line 86, in <module> block_clusterer.fit(X) File "/Users/lfoppiano/development/inria/inria-virtualenv/lib/python3.5/site-packages/beard-0.0-py3.5.egg/beard/clustering/blocking.py", line 319, in fit File "/Users/lfoppiano/development/inria/inria-virtualenv/lib/python3.5/site-packages/beard-0.0-py3.5.egg/beard/clustering/blocking.py", line 178, in _validate File "/Users/lfoppiano/development/inria/inria-virtualenv/lib/python3.5/site-packages/beard-0.0-py3.5.egg/beard/clustering/blocking_funcs.py", line 385, in block_last_name_first_initial TypeError: string indices must be integers
@natsheh I haven't seen that page. I'll try it out.
In principle I though that author_disambiguation.py was a prepared script to run quickly the whole pipeline, isn't it?
author_disambiguation.py
is a very simplified version of what is described in the paper. To reproduce our results, you should check examples.applications/author-disambiguation
instead.
@glouppe OK, thanks :)
And an example of the input data is available here: https://github.com/inspirehep/beard/tree/master/examples/data
Indeed. I've manage to run the sampling.py (using python2). How should I generate ethnicity_estimator.pickle?
I close this issue as it is related to the fact that I was using Python 3.
Dear all, after reading the paper and find it very interesting I wanted to try the application, so I cloned the repository. I've run the setup (using virtualenv) and I launched
python author_disambiguation.py
But then I get the following error:
I've tried to use different encodings when loading the file
data = np.load("data/author-disambiguation.npz")
but with no success.
What am I missing?