allenai / bff

Apache License 2.0
37 stars 8 forks source link

Barebones python/S3 implementation #10

Open revbucket opened 6 months ago

revbucket commented 6 months ago

Added bff_v0.py which is a simple python script to: 1) download all .jsonl.gz's from a specified S3 directory 2) Run BFF on ^ 3) Upload the outputs back to S3

Also modified the main.rs to use the faster init for bloom filters