It seems that fetch_process_wbm_dataset.py and compute_struct_fingerprints.py expect different fingerprint paths:
> python fetch_process_wbm_dataset.py
[...]
fingerprints_path='[...]/data/wbm/site-stats.json.gz' not found, run scripts/compute_struct_fingerprints.py to generate
> python compute_struct_fingerprints.py
Loading 'wbm_summary' from cached file at '~/.cache/matbench-discovery/1.0.0/wbm/2022-10-19-wbm-summary.csv.gz'
out_path='~/.local/opt/anaconda3/envs/discovery/lib/python3.10/site-packages/data/wbm/site-stats-000.json.gz' already exists, exciting early
It seems that
fetch_process_wbm_dataset.py
andcompute_struct_fingerprints.py
expect different fingerprint paths: