optakt / flow-dps

Flow Data Provisioning Service
Apache License 2.0
29 stars 13 forks source link

Regenerate dictionaries using mainnet checkpoint for samples #514

Open Ullaakut opened 2 years ago

Ullaakut commented 2 years ago

Description

The current dictionaries were generated by sampling data against our snapshot, which contains realistic data but of less variety than a proper mainnet checkpoint would.

It would probably improve compression further to index a mainnet checkpoint completely and then regenerate the dictionaries, or to change the sample generation for dictionaries to read from checkpoints instead of reading from our index.