olivettigroup / synthesis-database-public

Codebase for compiling a database of materials syntheses
MIT License
6 stars 2 forks source link

No such file or directory: 'data/extraction_lexicon.json' #1

Open wallian opened 5 years ago

wallian commented 5 years ago

Thank you so much for sharing these source codes.

I tried them, and got following error.

File "/home/username/scripts/python_dir/synthesis_project/synthesis-database-public-master/synthesisdatabase/classifiers/token_classifier.py", line 214, in _load_lexicon with open(lexicon_path) as f: IOError: [Errno 2] No such file or directory: 'data/extraction_lexicon.json'

Could you share this "extraction_lexicon.json"? Is this already exists in somewhere? Or should I create it by myself?

Thank you!

eddotman commented 5 years ago

Just realized I forgot to reply to this before -- sorry about that @wallian!

You'd have to make/include the JSON file yourself, but tbh these models are somewhat out of date. You'll find much better models at https://github.com/olivettigroup/materials-synthesis-generative-models which is a release that corresponds to a newer paper.

There is a far better token classifier here https://github.com/olivettigroup/materials-synthesis-generative-models/blob/master/models/token_classifier.py and the repo includes the annotated data required to train said model.