inspirehep / beard

Bibliographic Entity Automatic Recognition and Disambiguation
Other
66 stars 36 forks source link

where is the dataset for the example of sampling.py ? #101

Closed SeekPoint closed 5 years ago

SeekPoint commented 5 years ago

python sampling.py \ --input_signatures input/signatures.json \ --input_clusters input/clusters.json \ --balanced 1 \ --sample_size 1000000 \ --output_pairs pairs/1M_nysiis_balanced.json \ --use_blocking 1 \ --blocking_function block_phonetic \ --blocking_threshold 1 \ --blocking_phonetic_alg nysiis \ --verbose 1

MSusik commented 5 years ago

I believe you are looking for https://github.com/glouppe/paper-author-disambiguation